- transcribe - Audio transcriptions
Given an audio file, the model transcribes it into text.
import os
from friendli import SyncFriendli
with SyncFriendli(
token=os.getenv("FRIENDLI_TOKEN", ""),
) as friendli:
res = friendli.container.audio.transcribe(
file="@/path/to/file/audio.mp3".encode(), model="(adapter-route)"
)
# Handle response
print(res)| Parameter | Type | Required | Description | Example |
|---|---|---|---|---|
file |
bytes | ✔️ | The audio file object (not file name) to transcribe, in one of these formats: mp3, wav, flac, ogg, and many other standard audio formats. | |
model |
OptionalNullable[str] | ➖ | Routes the request to a specific adapter. | (adapter-route) |
chunking_strategy |
OptionalNullable[models.ContainerAudioTranscriptionBodyChunkingStrategy] | ➖ | Controls how the audio is cut into chunks. When set to "auto", the server first normalizes loudness and then uses voice activity detection (VAD) to choose boundaries. server_vad object can be provided to tweak VAD detection parameters manually. If unset, the audio is transcribed as a single block. |
|
language |
OptionalNullable[str] | ➖ | The language of the input audio. Supplying the input language in ISO-639-1 (e.g. en) format will improve accuracy and latency. |
|
temperature |
OptionalNullable[float] | ➖ | The sampling temperature, between 0 and 1. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. | |
retries |
Optional[utils.RetryConfig] | ➖ | Configuration to override the default retry behavior of the client. | |
server_url |
Optional[str] | ➖ | An optional server URL to use. | http://localhost:8080 |
models.ContainerAudioTranscriptionSuccess
| Error Type | Status Code | Content Type |
|---|---|---|
| models.SDKError | 4XX, 5XX | */* |