AiPortalXAIPortalX Logo

Filters

Selected Filters

Speech recognition asr
Task1
Domain
Organization
Country

Include Other Tiers

By default, only production models are shown

30 Models found

Google DeepMind

Gemini Robotics-ER 1.5

By Google DeepMind
Domain
VisionVision
LanguageLanguage
SpeechSpeech
Task
Instruction interpretationInstruction interpretation
Robotic manipulationRobotic manipulation
Image captioningImage captioning
Object detectionObject detection
+5 more
Alibaba

Qwen3-Omni-30B-A3B

By Alibaba
Domain
MultimodalMultimodal
LanguageLanguage
VisionVision
SpeechSpeech
+1 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
Visual question answeringVisual question answering
+6 more
OpenAI

gpt-realtime

By OpenAI
Domain
SpeechSpeech
VisionVision
LanguageLanguage
Task
Speech recognition ASRSpeech recognition ASR
Speech synthesisSpeech synthesis
Visual question answeringVisual question answering
Speech-to-speechSpeech-to-speech
+1 more
NVIDIA

Canary 1B v2

By NVIDIA
Domain
SpeechSpeech
Task
Speech recognition ASRSpeech recognition ASR
TranslationTranslation
Speech-to-textSpeech-to-text
NVIDIA

Parakeet-tdt-0.6b-v3

By NVIDIA
Domain
SpeechSpeech
Task
Speech-to-textSpeech-to-text
Speech recognition ASRSpeech recognition ASR
Google

Gemini 2.5 Deep Think

By Google
Domain
LanguageLanguage
MultimodalMultimodal
VisionVision
VideoVideo
+2 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Mathematical reasoningMathematical reasoning
Code generationCode generation
+6 more
Google DeepMind

Gemini 2.5 Flash-Lite Jun 2024

By Google DeepMind
Domain
LanguageLanguage
VisionVision
VideoVideo
SpeechSpeech
+1 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
TranslationTranslation
+9 more
Google

Gemma 3n

By Google
Domain
LanguageLanguage
MultimodalMultimodal
SpeechSpeech
VisionVision
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
ChatChat
+7 more
Google DeepMind

Gemini 2.5 Flash

By Google DeepMind
Domain
LanguageLanguage
MultimodalMultimodal
VisionVision
SpeechSpeech
+1 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
Code generationCode generation
+9 more
Google DeepMind

Gemini 2.5 Pro

By Google DeepMind
Domain
LanguageLanguage
VisionVision
VideoVideo
MultimodalMultimodal
+1 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
Code generationCode generation
+6 more
Google

Chirp 3 Speech-to-Text

By Google
Domain
SpeechSpeech
Task
Speech recognition ASRSpeech recognition ASR
Speech-to-textSpeech-to-text
TranslationTranslation
Reka AI

Reka Flash 3

By Reka AI
Domain
MultimodalMultimodal
LanguageLanguage
VisionVision
VideoVideo
+1 more
Task
ChatChat
Code generationCode generation
Language modelingLanguage modeling
Language generationLanguage generation
+6 more
Baichuan

Baichuan-Omni-1.5

By Baichuan
Domain
MultimodalMultimodal
LanguageLanguage
SpeechSpeech
VisionVision
+2 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
Audio question answeringAudio question answering
+8 more
Google DeepMind

Gemini 2.0 Flash

By Google DeepMind
Domain
LanguageLanguage
VisionVision
AudioAudio
SpeechSpeech
+2 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
Visual question answeringVisual question answering
+9 more
Google DeepMind

Gemini 2.0 Pro

By Google DeepMind
Domain
LanguageLanguage
MultimodalMultimodal
VisionVision
VideoVideo
+1 more
Task
Code generationCode generation
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
+3 more
Google

Chirp 2 Speech-to-Text

By Google
Domain
SpeechSpeech
Task
Speech recognition ASRSpeech recognition ASR
Speech-to-textSpeech-to-text
TranslationTranslation