Back to Models
Meta Llama 3.2 11B Vision Instruct
Standard
Instruction-tuned image reasoning generative model (text + images in / text out) optimized for visual recognition, image reasoning, captioning and answering general questions about the image.
Features
VisionPDF ComprehensionImage generation
Model details
- Provider
- Meta
- Context
- Unknown
- Multimodal
- No
Benchmark performance
via Artificial AnalysisIntelligence
Coding
Math
Knowledge
29%
Creative Writing
21%
Speed
14%
Value
28%