Whisper
Whisper
Bases: FoundationalModel
Whisper is a state-of-the-art transcription model from OpenAI. It is trained on 680,000 hours of multilingual and multitask supervised data.
Examples:
Invocation with a URL
from baseten.models import Whisper
model = Whisper()
transcript = model("https://baseten.s3.amazonaws.com/whisper/whisper_test.wav")
Invocation with a local file
__call__
Generate text from an audio file. Supports local file paths or URLs.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
path |
str
|
Path to audio file. Can be a local file path or a URL. |
required |
Returns:
Name | Type | Description |
---|---|---|
transcript |
dict
|
Dictionary containing 3 keys:
|