pdf-icon

Models

Explore all available models and compare their capabilities.

Reasoning models

Flagship chat models

Multimodal models

Text-to-speech

Models that can convert text into natural sounding spoken audio.

Transcription

Model that can transcribe and translate audio into text.

Keyword spotting

Models that can detect specific keywords in audio streams.

Voice activity detection

Models that can detect whether there is speech in an audio stream.

Automatic Speech Recognition

Models that can convert spoken language into text.

Vision

Models that can process images and perform tasks like object detection, etc.

Model Pages

On This Page