Whisper tools

These are tools that use whisper, ffmpeg etc to process foriegn language videos

Install brew and then install ffmpeg

/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
brew install FFmpeg
brew install cmake

Install Python Poetry Environment (2.0 version)

brew install pyenv
pyenv install 3.12.7
pyenv global 3.12.7
poetry install
poetry env activate

Compile and build Whisper.cpp

git clone https://github.com/ggerganov/whisper.cpp
cd whisper.cpp
cmake -B build
cmake --build build --config Release
./models/download-ggml-model.sh large-v3

create a simlink so the main script can find main

ln -s ./build/bin/whisper-cli main

Create a CSV file to do conversion, example headers:

Source_File_Path|Source_File_Name|Subject_Name|Subject_Tag|Campaign

Folders will be created at the location of CSV file

Run output splitter to create folders, convert audio, and run whisper and split to final versions

python whisper_output_splitter.py -a all -p ~/Pictures/hfunds/content/HearOurStories/Interviews_Dec_8_HearOurStories.csv

For debugging you can run an individual stage with the -a flag and you can modify other flags as follows

To split to 5 words per line, and only first 60 seconds of the project name matching 'Irina', only processing the SRT generation step.

python whisper_output_splitter.py -a create_srt -m 5 -n 1 -d 60  -f "Irina"  -p ~/Pictures/hfunds/content/HearOurStories/Interviews_Dec_8_HearOurStories.csv

FAQ

Translations are repeating over and over again.

openai/whisper#81 This happens when the model is unsure about the output (according to the compression_ratio_threshold and logprob_threshold settings). The most common failure mode is that it falls into a repeat loop, where it likely triggers the compression_ratio_threshold. The default setting tries temperatures 0, 0.2, 0.4, 0.6, 0.8, 1.0 until it gives up, at which it is less likely to be in a repeat loop but is also less likely to be correct.

Add library via poetry

poetry add nltk poetry update poetry lock

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
google_dump_comments.py		google_dump_comments.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
scene_summarizer.py		scene_summarizer.py
whisper_output_splitter.py		whisper_output_splitter.py
whisper_srt.py		whisper_srt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper tools

Install brew and then install ffmpeg

Install Python Poetry Environment (2.0 version)

Compile and build Whisper.cpp

Create a CSV file to do conversion, example headers:

Run output splitter to create folders, convert audio, and run whisper and split to final versions

For debugging you can run an individual stage with the -a flag and you can modify other flags as follows

To split to 5 words per line, and only first 60 seconds of the project name matching 'Irina', only processing the SRT generation step.

FAQ

Translations are repeating over and over again.

Add library via poetry

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

jsrawan-mobo/whisper_tools

Folders and files

Latest commit

History

Repository files navigation

Whisper tools

Install brew and then install ffmpeg

Install Python Poetry Environment (2.0 version)

Compile and build Whisper.cpp

Create a CSV file to do conversion, example headers:

Run output splitter to create folders, convert audio, and run whisper and split to final versions

For debugging you can run an individual stage with the -a flag and you can modify other flags as follows

To split to 5 words per line, and only first 60 seconds of the project name matching 'Irina', only processing the SRT generation step.

FAQ

Translations are repeating over and over again.

Add library via poetry

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages