vernacular-ai-speech

Vernacular Speech API python client


Keywords
asr, multilingual-speech-recognition, speech-recognition, speech-recognition-api, speech-to-text
License
Apache-2.0
Install
pip install vernacular-ai-speech==0.1.2

Documentation

Speech-to-Text API

Converts audio to text

We support these ten indian languages (language codes).

  • Hindi
  • English
  • Marathi
  • Kannada
  • Malayalam
  • Bengali
  • Gujarati
  • Punjabi
  • Telugu
  • Tamil

Authentication

To get access to our APIs reach out to us at hello@vernacular.ai

Ways to use the Service

  • Transcribing short audios [audios upto 1 min]
  • Transcribing long audios [more than 1 min]
  • Transcribing audio from streaming input

We recommend that you call this service using Vernacular provided client libraries. If your application needs to call this service using your own libraries, you should use the HTTP Endpoints.

Supported SDKs: Python

REST Reference

ServiceHost: https://asr.vernacular.ai

Speech Recognition

Name Description
recognize Performs synchronous speech recognition: receive results after all audio has been sent and processed.
longrunningrecognize Performs asynchronous speech recognition. Generally used for long audios

RPC Reference

Speech Recognition

Methods Description
Recognize Performs synchronous speech recognition: receive results after all audio has been sent and processed.
LongRunningRecognize Performs asynchronous speech recognition: receive results via the longrunning.Operations interface.
StreamingRecognize Performs streaming speech recognition: receive results while sending audio. Supports both unidirectional and bidirectional streaming.