
Meta
meta-llama/llama-3.2-11b-vision-instructLlama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
3
credits / gen
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
Provider
Meta
Type
Chat
Context Window
131,072 tokens
Pricing
3 credits
Knowledge Cutoff
2023-12-31
Vision
Can process and understand images
File Support
Can read PDF, DOCX, XLSX & more
131K Context
Large context window for long documents
Vision (OR)
OpenRouter reports vision support