AiPortalX
Search...
⌘
K
Log in
Filters
Selected Filters
Speech
Task
Domain
1
Organization
Country
Include Other Tiers
Active Research
Legacy Models
By default, only production models are shown
Toggle Sidebar
42 Models found
Gemini Robotics-ER 1.5
By
Google DeepMind
Domain
Vision
Language
Speech
Task
Instruction interpretation
Robotic manipulation
Image captioning
Object detection
+5 more
Qwen3-Omni-30B-A3B
By
Alibaba
Domain
Multimodal
Language
Vision
Speech
+1 more
Task
Language modeling
Language generation
Question answering
Visual question answering
+6 more
Chatterbox Multilingual
By
Resemble AI
Domain
Speech
Task
Text-to-speech TTS
Speech synthesis
MAI-Voice-1
By
Microsoft
Domain
Speech
Task
Text-to-speech TTS
Speech synthesis
gpt-realtime
By
OpenAI
Domain
Speech
Vision
Language
Task
Speech recognition ASR
Speech synthesis
Visual question answering
Speech-to-speech
+1 more
Canary 1B v2
By
NVIDIA
Domain
Speech
Task
Speech recognition ASR
Translation
Speech-to-text
Parakeet-tdt-0.6b-v3
By
NVIDIA
Domain
Speech
Task
Speech-to-text
Speech recognition ASR
Gemini 2.5 Flash-Lite Jun 2024
By
Google DeepMind
Domain
Language
Vision
Video
Speech
+1 more
Task
Language modeling
Language generation
Question answering
Translation
+9 more
Gemini 2.5 Flash Native Audio
By
Google DeepMind
Domain
Speech
Task
Speech-to-speech
Audio question answering
Text-to-speech TTS
Speech synthesis
OpenAudio-S1-mini
By
Fish Audio
Domain
Speech
Task
Speech synthesis
Text-to-speech TTS
Gemma 3n
By
Google
Domain
Language
Multimodal
Speech
Vision
Task
Language modeling
Language generation
Question answering
Chat
+7 more
Gemini 2.5 Flash
By
Google DeepMind
Domain
Language
Multimodal
Vision
Speech
+1 more
Task
Language modeling
Language generation
Question answering
Code generation
+9 more
Gemini 2.5 Pro
By
Google DeepMind
Domain
Language
Vision
Video
Multimodal
+1 more
Task
Language modeling
Language generation
Question answering
Code generation
+6 more
Chirp 3 HD Text-to-Speech
By
Google
Domain
Speech
Task
Text-to-speech TTS
Speech synthesis
Chirp 3 Speech-to-Text
By
Google
Domain
Speech
Task
Speech recognition ASR
Speech-to-text
Translation
Reka Flash 3
By
Reka AI
Domain
Multimodal
Language
Vision
Video
+1 more
Task
Chat
Code generation
Language modeling
Language generation
+6 more
Load More