Grok-1.5V

Name: Grok-1.5V
Author: xAI

Multimodal

xAI

A multimodal model capable of processing text and visual information, including documents, diagrams, charts, screenshots, and photos. Features strong real-world spatial understanding capabilities.

Key Specifications

Parameters

Context

Release Date

April 12, 2024

Average Score

71.9%

Results Blog

Timeline

Key dates in the model's history

Announcement

April 12, 2024

Last Update

July 19, 2025

Today

March 25, 2026

Technical Specifications

Parameters

Training Tokens

Knowledge Cutoff

Family

Capabilities

MultimodalZeroEval

Benchmark Results

Model performance metrics across various tests and benchmarks

Multimodal

Working with images and visual data

AI2D

Evaluation without preliminary training • Self-reported

88.3%

ChartQA

Evaluation without • Self-reported

76.1%

DocVQA

Evaluation in mode training AI: translation! This indeed correct for "zero-shot evaluation" in by • Self-reported

85.6%

MathVista

Evaluation without preliminary training • Self-reported

52.8%

MMMU

Evaluation without preliminary training • Self-reported

53.6%

Other Tests

Specialized benchmarks

RealWorldQA

Evaluation in mode training • Self-reported

68.7%

TextVQA

evaluation without preliminary training • Self-reported

78.1%

License & Metadata

License

proprietary

Announcement Date

April 12, 2024

Last Updated

July 19, 2025

Similar Models

All Models

Grok-2

xAI

Best score:0.9 (HumanEval)

Released:Aug 2024

Price:$2.00/1M tokens

Grok-4

xAI

Best score:0.9 (GPQA)

Released:Jul 2025

Price:$3.00/1M tokens

Grok-2 mini

xAI

Best score:0.9 (MMLU)

Released:Aug 2024

Grok 4.20

xAI

Released:Mar 2026

Grok-4.1 Fast Non-Reasoning

xAI

Released:Nov 2025

Price:$0.20/1M tokens

Grok-4.1 Fast Reasoning

xAI

Released:Nov 2025

Price:$0.20/1M tokens

Grok-4 Fast Non-Reasoning

xAI

Released:Aug 2025

Price:$0.20/1M tokens

Grok-4 Fast Reasoning

xAI

Released:Aug 2025

Price:$0.20/1M tokens

Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.