Llama 3.3 70B Instruct
Llama 3.3 is a multilingual large language model optimized for conversational use cases across multiple languages. It is a pre-trained and instruction-tuned generative model with 70 billion parameters, outperforming many open and closed chat models on common industry benchmarks. Llama 3.3 supports a 128,000 token context length and is intended for commercial and research use across multiple languages.
Key Specifications
Timeline
Technical Specifications
Pricing & Availability
Benchmark Results
Model performance metrics across various tests and benchmarks
General Knowledge
Programming
Mathematics
Reasoning
Other Tests
License & Metadata
Similar Models
All ModelsLlama 3.1 70B Instruct
Meta
Llama 3.1 405B Instruct
Meta
Phi 4 Reasoning Plus
Microsoft
Phi 4 Reasoning
Microsoft
Hermes 3 70B
Nous Research
Phi 4
Microsoft
Magistral Small 2506
Mistral AI
ERNIE 4.5
Baidu
Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.