Welcome to Sarvam AI Documentation
Sarvam provides Models & APIs across the stack to help developers build powerful applications. Whether you’re looking to translate text, convert speech to text, or combine speech recognition with translation, Sarvam has you covered.Key Features
Translate Text
Use the
/translate endpoint to translate text from one language to another. Supports 10 Indic languages along with English, achieving best-in-class performance. Know moreSpeech to Text
Use the
/speech-to-text endpoint to convert spoken language into written text. Output is returned in the same input language. Know moreSpeech to Text Translate
Use the
/speech-to-text-translate endpoint to combine speech recognition and translation, allowing you to convert spoken language directly into translated text. Know moreText to Speech
Use the
/text-to-speech endpoint to convert written text into spoken words. Supports natural-sounding voices across 10 languages. Know moreCall Analytics
Use the
/call-analytics endpoint to perform intelligent question-answering on recorded calls or conversations. Know moreText Analytics
Use the
/text-analytics endpoint to conduct advanced question-answering on written text. Know moreGetting Started
To get started with Sarvam APIs, follow these steps:- Authenticate: Learn how to authenticate your API requests in the Authentication Guide.
- Meta Prompt: You can now use our Meta Prompt directly to guide any AI chat model with the context needed to use Sarvam’s APIs effectively. An example on AI Studio with Gemini’s latest model: Explore here
- Try Examples: Use the Usage Guides to see examples and best practices.