AiPortalXAIPortalX Logo

Filters

Selected Filters

Audio
Task
Domain1
Organization
Country

Include Other Tiers

By default, only production models are shown

19 Models found

Google

Gemini 2.5 Deep Think

By Google
Domain
LanguageLanguage
MultimodalMultimodal
VisionVision
VideoVideo
+2 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Mathematical reasoningMathematical reasoning
Code generationCode generation
+6 more
Google DeepMind

Lyria RealTime

By Google DeepMind
Domain
AudioAudio
Task
Audio generationAudio generation
Baichuan

Baichuan-Omni-1.5

By Baichuan
Domain
MultimodalMultimodal
LanguageLanguage
SpeechSpeech
VisionVision
+2 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
Audio question answeringAudio question answering
+8 more
Google DeepMind

Gemini 2.0 Flash

By Google DeepMind
Domain
LanguageLanguage
VisionVision
AudioAudio
SpeechSpeech
+2 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
Visual question answeringVisual question answering
+9 more
Google DeepMind

Gemini 2.0 Pro

By Google DeepMind
Domain
LanguageLanguage
MultimodalMultimodal
VisionVision
VideoVideo
+1 more
Task
Code generationCode generation
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
+3 more
Google Research

Whale Bioacoustics Model

By Google Research
Domain
AudioAudio
Task
Audio classificationAudio classification
OpenAI

GPT-4o May 2024

By OpenAI
Domain
MultimodalMultimodal
LanguageLanguage
AudioAudio
SpeechSpeech
+1 more
Task
ChatChat
Image generationImage generation
Audio generationAudio generation
Vision-language generationVision-language generation
+6 more
Google DeepMind

Gemini 1.5 Flash May 2024

By Google DeepMind
Domain
MultimodalMultimodal
LanguageLanguage
VisionVision
AudioAudio
Task
ChatChat
Image captioningImage captioning
Visual question answeringVisual question answering
TranslationTranslation
+4 more
Google DeepMind

Gemini 1.5 Flash 8B

By Google DeepMind
Domain
MultimodalMultimodal
LanguageLanguage
VisionVision
AudioAudio
Task
ChatChat
Image captioningImage captioning
Visual question answeringVisual question answering
TranslationTranslation
+4 more
Meta AI

MAGNeT

By Meta AI
Domain
AudioAudio
Task
Audio generationAudio generation
Google DeepMind

Gemini Nano-1

By Google DeepMind
Domain
MultimodalMultimodal
LanguageLanguage
VisionVision
AudioAudio
Task
ChatChat
Image captioningImage captioning
Speech recognition ASRSpeech recognition ASR
Google DeepMind

Lyria

By Google DeepMind
Domain
AudioAudio
Task
Audio generationAudio generation
Meta AI

MultiBand Diffusion

By Meta AI
Domain
AudioAudio
SpeechSpeech
Task
Audio generationAudio generation
Google Research

AudioLM

By Google Research
Domain
AudioAudio
Task
Audio generationAudio generation
Meta AI

MusicGen

By Meta AI
Domain
AudioAudio
Task
Audio generationAudio generation
Suno

Suno Bark Model

By Suno
Domain
AudioAudio
Task
Audio generationAudio generation