AiPortalXAIPortalX Logo

Filters

Selected Filters

Audio
Task
Organization
Country

Include Other Tiers

By default, only production models are shown

19 Models found

Google

Gemini 2.5 Deep Think

By Google
Domain
LanguageLanguageMultimodalMultimodalVisionVision+2 more
Task
Language modelingLanguage modelingLanguage generationLanguage generationMathematical reasoningMathematical reasoning+6 more
Google DeepMind

Lyria RealTime

By Google DeepMind
Domain
AudioAudio
Task
Audio generationAudio generation
Baichuan

Baichuan-Omni-1.5

By Baichuan
Domain
MultimodalMultimodalLanguageLanguageSpeechSpeech+2 more
Task
Language modelingLanguage modelingLanguage generationLanguage generationQuestion answeringQuestion answering+8 more
Google DeepMind

Gemini 2.0 Flash

By Google DeepMind
Domain
LanguageLanguageVisionVisionAudioAudio+2 more
Task
Language modelingLanguage modelingLanguage generationLanguage generationQuestion answeringQuestion answering+9 more
Google DeepMind

Gemini 2.0 Pro

By Google DeepMind
Domain
LanguageLanguageMultimodalMultimodalVisionVision+1 more
Task
Code generationCode generationLanguage modelingLanguage modelingLanguage generationLanguage generation+3 more
Google Research

Whale Bioacoustics Model

By Google Research
Domain
AudioAudio
Task
Audio classificationAudio classification
OpenAI

GPT-4o May 2024

By OpenAI
Domain
MultimodalMultimodalLanguageLanguageAudioAudio+1 more
Task
ChatChatImage generationImage generationAudio generationAudio generation+6 more
Google DeepMind

Gemini 1.5 Flash May 2024

By Google DeepMind
Domain
MultimodalMultimodalLanguageLanguageVisionVision
Task
ChatChatImage captioningImage captioningVisual question answeringVisual question answering+4 more
Google DeepMind

Gemini 1.5 Flash 8B

By Google DeepMind
Domain
MultimodalMultimodalLanguageLanguageVisionVision
Task
ChatChatImage captioningImage captioningVisual question answeringVisual question answering+4 more
Meta AI

MAGNeT

By Meta AI
Domain
AudioAudio
Task
Audio generationAudio generation
Google DeepMind

Gemini Nano-1

By Google DeepMind
Domain
MultimodalMultimodalLanguageLanguageVisionVision
Task
ChatChatImage captioningImage captioningSpeech recognition ASRSpeech recognition ASR
Google DeepMind

Lyria

By Google DeepMind
Domain
AudioAudio
Task
Audio generationAudio generation
Meta AI

MultiBand Diffusion

By Meta AI
Domain
AudioAudioSpeechSpeech
Task
Audio generationAudio generation
Google Research

AudioLM

By Google Research
Domain
AudioAudio
Task
Audio generationAudio generation
Meta AI

MusicGen

By Meta AI
Domain
AudioAudio
Task
Audio generationAudio generation