AiPortalXAIPortalX Logo

Filters

Selected Filters

Character recognition ocr
Task1
Domain
Organization
Country

Include Other Tiers

By default, only production models are shown

21 Models found

Anthropic

Claude Opus 4.5

By Anthropic
Domain
LanguageLanguage
MultimodalMultimodal
VisionVision
Task
Code generationCode generation
Language modelingLanguage modeling
Language generationLanguage generation
Quantitative reasoningQuantitative reasoning
+13 more
Zhipu AI

GLM-4.5-Air

By Zhipu AI
Domain
LanguageLanguage
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
Visual question answeringVisual question answering
+4 more
Google DeepMind

Aeneas

By Google DeepMind
Domain
VisionVision
MultimodalMultimodal
LanguageLanguage
Task
Character recognition OCRCharacter recognition OCR
Visual question answeringVisual question answering
xAI

Grok 4

By xAI
Domain
LanguageLanguage
MultimodalMultimodal
VisionVision
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
SearchSearch
+4 more
Google DeepMind

Gemini 2.5 Flash-Lite Jun 2024

By Google DeepMind
Domain
LanguageLanguage
VisionVision
VideoVideo
SpeechSpeech
+1 more
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
TranslationTranslation
+9 more
Anthropic

Claude Opus 4

By Anthropic
Domain
LanguageLanguage
MultimodalMultimodal
VisionVision
Task
Code generationCode generation
Language modelingLanguage modeling
Language generationLanguage generation
Quantitative reasoningQuantitative reasoning
+13 more
Anthropic

Claude Sonnet 4

By Anthropic
Domain
LanguageLanguage
MultimodalMultimodal
VisionVision
Task
Code generationCode generation
Language modelingLanguage modeling
Language generationLanguage generation
Quantitative reasoningQuantitative reasoning
+13 more
Google

Gemma 3n

By Google
Domain
LanguageLanguage
MultimodalMultimodal
SpeechSpeech
VisionVision
Task
Language modelingLanguage modeling
Language generationLanguage generation
Question answeringQuestion answering
ChatChat
+7 more
Reka AI

Reka Flash 3

By Reka AI
Domain
MultimodalMultimodal
LanguageLanguage
VisionVision
VideoVideo
+1 more
Task
ChatChat
Code generationCode generation
Language modelingLanguage modeling
Language generationLanguage generation
+6 more
Mistral AI

Mistral OCR

By Mistral AI
Domain
MultimodalMultimodal
VisionVision
LanguageLanguage
Task
Character recognition OCRCharacter recognition OCR
ChatChat
Language generationLanguage generation
NVIDIA

NVILA 15B

By NVIDIA
Domain
VisionVision
LanguageLanguage
MultimodalMultimodal
VideoVideo
Task
Visual question answeringVisual question answering
Video descriptionVideo description
Language modelingLanguage modeling
Language generationLanguage generation
+2 more
Rhymes AI

Aria

By Rhymes AI
Domain
MultimodalMultimodal
LanguageLanguage
VideoVideo
VisionVision
Task
Language modelingLanguage modeling
Language generationLanguage generation
Visual question answeringVisual question answering
Image captioningImage captioning
+4 more
Alibaba

Qwen2-VL-2B

By Alibaba
Domain
LanguageLanguage
VisionVision
MultimodalMultimodal
Task
Visual question answeringVisual question answering
Video descriptionVideo description
Language modelingLanguage modeling
Language generationLanguage generation
+4 more
Alibaba

Qwen2-VL-72B

By Alibaba
Domain
LanguageLanguage
VisionVision
MultimodalMultimodal
Task
Visual question answeringVisual question answering
Video descriptionVideo description
Language modelingLanguage modeling
Language generationLanguage generation
+4 more
Alibaba

Qwen2-VL-7B

By Alibaba
Domain
LanguageLanguage
VisionVision
MultimodalMultimodal
Task
Visual question answeringVisual question answering
Video descriptionVideo description
Language modelingLanguage modeling
Language generationLanguage generation
+4 more
New York University NYU

Cambrian-1-13B

By New York University NYU
Domain
MultimodalMultimodal
VisionVision
LanguageLanguage
Task
Image captioningImage captioning
Visual question answeringVisual question answering
Character recognition OCRCharacter recognition OCR