AiPortalXAIPortalX Logo

Filters

Selected Filters

Speech To Text
Task1
Organization
Country

Include Other Tiers

By default, only production models are shown

11 Models found

Alibaba

Qwen3-Omni-30B-A3B

By Alibaba
Domain
MultimodalMultimodalLanguageLanguageVisionVision+1 more
Task
Language modelingLanguage modelingLanguage generationLanguage generationQuestion answeringQuestion answering+6 more
NVIDIA

Canary 1B v2

By NVIDIA
Domain
SpeechSpeech
Task
Speech recognition ASRSpeech recognition ASRTranslationTranslationSpeech-to-textSpeech-to-text
NVIDIA

Parakeet-tdt-0.6b-v3

By NVIDIA
Domain
SpeechSpeech
Task
Speech-to-textSpeech-to-textSpeech recognition ASRSpeech recognition ASR
Google

Gemini 2.5 Deep Think

By Google
Domain
LanguageLanguageMultimodalMultimodalVisionVision+2 more
Task
Language modelingLanguage modelingLanguage generationLanguage generationMathematical reasoningMathematical reasoning+6 more
Google

Gemma 3n

By Google
Domain
LanguageLanguageMultimodalMultimodalSpeechSpeech
Task
Language modelingLanguage modelingLanguage generationLanguage generationQuestion answeringQuestion answering+7 more
Google

Chirp 3 Speech-to-Text

By Google
Domain
SpeechSpeech
Task
Speech recognition ASRSpeech recognition ASRSpeech-to-textSpeech-to-textTranslationTranslation
Baichuan

Baichuan-Omni-1.5

By Baichuan
Domain
MultimodalMultimodalLanguageLanguageSpeechSpeech+2 more
Task
Language modelingLanguage modelingLanguage generationLanguage generationQuestion answeringQuestion answering+8 more
Google

Chirp 2 Speech-to-Text

By Google
Domain
SpeechSpeech
Task
Speech recognition ASRSpeech recognition ASRSpeech-to-textSpeech-to-textTranslationTranslation
OpenAI

GPT-4o May 2024

By OpenAI
Domain
MultimodalMultimodalLanguageLanguageAudioAudio+1 more
Task
ChatChatImage generationImage generationAudio generationAudio generation+6 more
NVIDIA

Parakeet ASR rnnt 1.1B

By NVIDIA
Domain
SpeechSpeech
Task
Speech recognition ASRSpeech recognition ASRSpeech-to-textSpeech-to-text
Google

Chirp

By Google
Domain
SpeechSpeech
Task
Speech recognition ASRSpeech recognition ASRSpeech-to-textSpeech-to-text
No more models