Add local voice model transcription by pdufour · Pull Request #1 · pdufour/codex

pdufour · 2026-03-15T18:07:07Z

Description

This adds the ability to do transcription using a local LLM model for the in-development transcription
There are already some ways to load local models (i.e. you can run with ollama using --local-provider ollama) so this extends that theme with the ability to also do transcription locally. It can be useful for instance if you are temporarily offline.

output.mp4

Specific Changes

Add new /voicemodel slash command that lets you switch between remote transcription and local
Change space to record to use above model section for transcription
Add ability to download models on the fly with hf_hub

Test Plan

Test recording with onnx model

Add voice-input feature
Edit codex-rs/tui/BUILD.bazel
Add in crate_features voice-input like so:

codex_rust_crate(
    name = "tui",
    crate_name = "codex_tui",
    crate_features = ["voice-input"],

Run the app with the voice transcription active:

RUST_LOG=error,transcribe_rs=debug,codex_tui::voice=debug bazel run //codex-rs/cli:codex -- --enable voice_transcription --enable realtime_conversation

Select a voice-model:

/voice-model

Select Parakeet
Look at logs in another terminal:

tail -f ~/.codex/log/codex-tui.log

Hold down space and record something
See the transcription come through and see that it used Parakeet
Verify the logs

2026-03-16T00:59:43.065655Z DEBUG codex_tui::voice: transcribe_bytes route: parakeet

Test recording with remote openai selection

Repeat same steps but select openai model (this is already the default)
Hold down space and record something
See the transcription come through and see that it used OpenAI:

Verify the logs:

2026-03-16T00:59:43.065655Z DEBUG codex_tui::voice: transcribe_bytes route: openai

Test recording with no voice-model selected

Repeat above steps but don't go to the /voice-model input
See that it selects openai when you record (view logs)
View logs and see openai logs

2026-03-16T00:59:43.065655Z DEBUG codex_tui::voice: transcribe_bytes route: openai

…rakeet onnx backend Made-with: Cursor

Made-with: Cursor

…ranscription path Made-with: Cursor

Made-with: Cursor

… debug log Made-with: Cursor

…st, trim chatwidget Made-with: Cursor

… into try_set Made-with: Cursor

…and set_voice_model Made-with: Cursor

Made-with: Cursor

…test expectations from main Made-with: Cursor

…f input Made-with: Cursor

…cker_options()

pdufour added 10 commits March 15, 2026 11:28

wip: rip out openai-only transcription, wire up transcribe-rs with pa…

cdc85f4

…rakeet onnx backend Made-with: Cursor

download missing parakeet model files from huggingface on first use

50f9cee

Made-with: Cursor

add /voicemodel slash command, voice model popup, bring back openai t…

0c379cb

…ranscription path Made-with: Cursor

restore chat_composer.rs voice keybinding block that got nuked

f338a8e

Made-with: Cursor

switch model download from curl to hf-hub api, add tracing, kill /tmp…

7e29b12

… debug log Made-with: Cursor

clean up voice model validation, replace alias map with allowed-id li…

61b409b

…st, trim chatwidget Made-with: Cursor

drop normalize_selected_model and validate wrapper, inline validation…

a4849dc

… into try_set Made-with: Cursor

add 24k to 16k wav resampling for local models, simplify voice popup …

d633852

…and set_voice_model Made-with: Cursor

revert .gitignore additions for model artifacts and sample.ogg

c0cd0c4

Made-with: Cursor

restore openai transcribe duration_seconds param and 24k sample rate …

b7b0c33

…test expectations from main Made-with: Cursor

pdufour force-pushed the paul.dufour/test-local-trans branch from 1ae3151 to b7b0c33 Compare March 15, 2026 18:30

pdufour added 2 commits March 15, 2026 11:32

fix normalize_chatgpt_base_url: use parameter name base_url instead o…

185279f

…f input Made-with: Cursor

voice: repo-to-class map, generic SpeechModel loading, voice_model_pi…

18a084d

…cker_options()

pdufour changed the title ~~Paul.dufour/test local trans~~ Add local voice model transcription Mar 16, 2026

pdufour added 4 commits March 15, 2026 17:57

voice: default to OpenAI when no /voicemodel selected

7cf9b15

Re-enable features

d88d05f

Disable voice-model when voice-input feature is off

a333f38

Add tests

8971a82

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add local voice model transcription#1

Add local voice model transcription#1
pdufour wants to merge 16 commits intomainfrom
paul.dufour/test-local-trans

pdufour commented Mar 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pdufour commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Specific Changes

Test Plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pdufour commented Mar 15, 2026 •

edited

Loading