AiPortalX
Search...
⌘
K
Log in
Filters
Selected Filters
Audio question answering
Task
1
Domain
Organization
Country
Include Other Tiers
Active Research
Legacy Models
By default, only production models are shown
Toggle Sidebar
3 Models found
gpt-realtime
By
OpenAI
Domain
Speech
Vision
Language
Task
Speech recognition ASR
Speech synthesis
Visual question answering
Speech-to-speech
+1 more
Gemini 2.5 Flash Native Audio
By
Google DeepMind
Domain
Speech
Task
Speech-to-speech
Audio question answering
Text-to-speech TTS
Speech synthesis
Baichuan-Omni-1.5
By
Baichuan
Domain
Multimodal
Language
Speech
Vision
+2 more
Task
Language modeling
Language generation
Question answering
Audio question answering
+8 more
No more models