Back to Explore

Llama 3.2 11B Vision Instruct

Chat

Meta

meta-llama/llama-3.2-11b-vision-instruct

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

3

credits / gen

Try this model
Vision File Support 131K Context Vision (OR)

About this model

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

Technical Specifications

Provider

Meta

Type

Chat

Context Window

131,072 tokens

Pricing

3 credits

Knowledge Cutoff

2023-12-31

Capabilities

Vision

Can process and understand images

File Support

Can read PDF, DOCX, XLSX & more

131K Context

Large context window for long documents

Vision (OR)

OpenRouter reports vision support