Skip to content

STT skill — speech-to-text (ears) via Deepgram/Whisper #8

@jbold

Description

@jbold

Summary

Give Kit ears — speech-to-text for voice messages and real-time audio.

Tasks

  • Evaluate Deepgram Nova-3 vs local Whisper
  • Build skill following openclaw conventions
  • Real-time streaming for Discord voice
  • Async STT for voice messages (WhatsApp, Discord)
  • Integrate with existing openai-whisper skill or replace

Notes

Hybrid approach likely best: Deepgram for real-time, Whisper for offline.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions