oreille

Oreille is a wrapper on OpenAPI Whisper API. It's provide support for long audio files.


Keywords
oreille, openai-api, speech-to-text
License
Apache-2.0
Install
pip install oreille==0.1.4

Documentation

Oreille

Oreille is a wrapper on OpenAPI Whisper API. It provides support for long audio files.

OpenAPI Whisper support only files that are less than 25 MB. Oreille will break the audio file into chunks of 25 MB's or less. https://platform.openai.com/docs/guides/speech-to-text/longer-inputs

Oreille will also compute the correct timing of the subtitle when merging the output of Whisper. So once you export the subtitle the timestamp of the subtitle will be right.

You can open and save WAV files with pure python. For opening and saving non-wav files – like mp3 – you'll need ffmpeg or libav.

Features

  • Drop-in replacement to the OpenAPI openai.Audio.transcribe
  • Support for large audio file
  • Export as text, srt, vtt, json

Usage

You need to set your OPENAI_API_KEY environment variable first.

export OPENAI_API_KEY=sk-xxxxx
pip install oreille

Then you can use it like this:

import oreille

print(oreille.transcribe("whisper-1", open("examples/test.wav", "rb"), language="fr",
        prompt="Test son", response_format="verbose_json", audio_format="wav"))

See more examples in the examples directory.

License

Free software: Apache Software License 2.0

FAQ

Why Oreille?

Oreille is the french word for ear. The original author of Oreille is french.