whisper-local-transcribe

kobim/whisper-local-transcribe

Fork 0

T

Kristofer Rolf Söderström 03fcd5798f Update README.md

2023-03-22 13:24:41 +01:00

sample_audio

added example

2023-03-22 12:07:29 +01:00

example.ipynb

Rename examples.ipynb to example.ipynb

2023-03-22 12:08:43 +01:00

LICENSE

Initial commit

2023-03-22 11:21:02 +01:00

README.md

Update README.md

2023-03-22 13:24:41 +01:00

transcribe.py

added example

2023-03-22 12:07:29 +01:00

README.md

transcribe

Simple implementation of OpenAI's Whisper to transcribe audio files from your local folders.

Instructions

Requirements

This script was made and tested in an Anaconda environment with python 3.10, if you're not familiar with python I would recommend this method. See here for instructions. You might need administrator rights.
Whisper requires some additional libraries. The setup page states: "The codebase also depends on a few Python packages, most notably HuggingFace Transformers for their fast tokenizer implementation and ffmpeg-python for reading audio files." Users might not need to specifically install Transfomers. However, a conda installation might be needed for ffmepg, which takes care of setting up PATH variables. Install with:

 conda install -c conda-forge ffmpeg-python

More advanced users can use pip install ffmpeg-python but be ready to deal with some PATH issues, which I encountered in Windows 11. 3. The main functionality comes from openai-whisper. See their page for details. As of 2023-03-22 you can install via:

pip install -U openai-whisper

Using the script

This is a simple script with no installation. You can either clone the repository with

git clone https://github.com/soderstromkr/transcribe.git

and use the example.ipynb template to use the script.
OR download the .py file into your work folder. Then you can either import it to another script or notebook for use. I recommend jupyter notebook for new users.

Example

See the example implementation on jupyter notebook.