Llama 3

Llama 3

The open-source AI models you can fine-tune, distill and deploy anywhere. Choose from our collection of models: Llama 3.1, Llama 3.2, Llama 3.3.
Learn more

Llama 3 models

Llama includes multilingual text-only models (1B, 3B), including quantized versions, text-image models (11B, 90B) and Llama 3.3 70B model offering similar performance to the Llama 3.1 405B model, allowing developers to achieve greater quality and performance on text-based applications at a fraction of the cost.
Start building
Multilingual

Llama 3.1

•
8B: Light-weight, ultra-fast model you can run anywhere.
•
405B: Flagship foundation model driving widest variety of use cases
Download models
Lightweight and Multimodal

Llama 3.2

•
1B and 3B: Light-weight, efficient models you can run everywhere on mobile and on edge devices.
•
11B and 90B: Multimodal models that are flexible and can reason on high resolution images.
Download models
Multilingual

Llama 3.3

•
70B: Experience leading performance and quality at a fraction of the cost with our latest release.
Download models

Download the 405B model

Learn more about building with Llama 3

placeholder-image

Llama case studies

See how other innovators are building with Llama.
Learn more
placeholder-image

Llama Cookbooks

The Llama Cookbook Github repo has what you need to get started from recipes to notebooks.
Learn more
placeholder-image

Videos

Check out our video page to watch tutorials on Llama.
Learn more

Do more with Llama

On-Device
Multimodal
Llama Stack
On-device
On-device
Use our 1B or 3B models for on device applications such as summarizing a discussion from your phone or calling on-device tools like calendar.
Multimodal
Multimodal
Use our 11B or 90B models for image use cases such as transforming an existing image into something new or getting more information from an image of your surroundings.
Llama Stack
Llama Stack
Seamlessly build agentic applications from a comprehensive toolchain.
On-device
Use our 1B or 3B models for on device applications such as summarizing a discussion from your phone or calling on-device tools like calendar.

Model evaluations

We evaluated performance on over 150 benchmark datasets that span a wide range of languages. For the vision LLMs, we evaluated performance on benchmarks for image understanding and visual reasoning. In addition, we performed extensive human evaluations that compare Llama with competing models in real-world scenarios.

Instruction-tuned benchmarks

Category
Benchmark

General

MMLU Chat

(0-shot, CoT)

MMLU PRO

(5-shot, CoT)

Instruction Following

IFEval

Code

HumanEval

(0-shot)

MBPP EvalPlus

(base) (0-shot)

Math

MATH

(0-sho, CoT)

Reasoning

GPQA Diamond

(0-shot, CoT)

Tool use

BFCL v2

(0-shot)

Long context

NIH/Multi-needle

Multilingual

Multilingual MGSM

(0-shot)

Pricing*

1M Input tokens

(Cheapest among providers)*

1M Output tokens

(Cheapest among providers)*

Llama 3.1 70B

86.0

66.4

87.5

80.5

86.0

67.8

48.0

77.5

97.5

86.9

$0.1

$0.4

Llama 3.3 70B

86.0

68.9

92.1

88.4

87.6

77.0

50.5

77.3

97.5

91.1

$0.1

$0.4

Amazon Nova
Pro

85.9

-

92.1

89.0

-

76.6

-

-

-

-

$0.80

$3.20

Llama 3.1 405B

88.6

73.4

88.6

89.0

88.6

73.9

49.0

81.1

98.1

91.6

$1.0

$1.8

Gemini Pro
1.5

87.1

76.1

81.9

89.0

87.8

82.9

53.5

80.3

94.7

89.6

$1.30

$5.0

GPT-4o

87.5

73.8

84.6

86.0

83.9

76.9

47.5

74.0

-

90.6

2.5$

10.0$

Claude 3.5
Sonnet

88.9

77.8

89.3

93.7

86.8

78.3

65.0

79.3

99.4

92.8

$3.0

$15.0

* API Pricing based on publicly available data on Artificial Analysis as of 12/3/24.

Category
Benchmark

General

MMLU Chat

(0-shot, CoT)

MMLU PRO

(5-shot, CoT)

Instruction Following

IFEval

Code

HumanEval

(0-shot)

MBPP EvalPlus

(base) (0-shot)

Math

MATH

(0-sho, CoT)

Reasoning

GPQA Diamond

(0-shot, CoT)

Tool use

BFCL v2

(0-shot)

Long context

NIH/Multi-needle

Multilingual

Multilingual MGSM

(0-shot)

Pricing*

1M Input tokens

(Cheapest among providers)*

1M Output tokens

(Cheapest among providers)*

Llama 3.1 70B

86.0

66.4

87.5

80.5

86.0

67.8

48.0

77.5

97.5

86.9

$0.1

$0.4

Llama 3.3 70B

86.0

68.9

92.1

88.4

87.6

77.0

50.5

77.3

97.5

91.1

$0.1

$0.4

Amazon Nova
Pro

85.9

-

92.1

89.0

-

76.6

-

-

-

-

$0.80

$3.20

Llama 3.1 405B

88.6

73.4

88.6

89.0

88.6

73.9

49.0

81.1

98.1

91.6

$1.0

$1.8

Gemini Pro
1.5

87.1

76.1

81.9

89.0

87.8

82.9

53.5

80.3

94.7

89.6

$1.30

$5.0

GPT-4o

87.5

73.8

84.6

86.0

83.9

76.9

47.5

74.0

-

90.6

2.5$

10.0$

Claude 3.5
Sonnet

88.9

77.8

89.3

93.7

86.8

78.3

65.0

79.3

99.4

92.8

$3.0

$15.0

* API Pricing based on publicly available data on Artificial Analysis as of 12/3/24.

Horizon banner image
llama protections

Llama Protections

Making safety tools accessible to everyone.
Enabling developers, advancing safety, and building an open ecosystem.
Learn more

Resources

deep learning ai course graphic
Deep Learning AI course
Introducing Multimodal Llama 3.2 with Amit Sangani
Learn more
  Knowledge Distillation with Llama 3.1 405B graphic
Knowledge Distillation with Llama 3.1 405B
Watch this video in order to explore the use of large language models (LLMs) for fine-tuning and synthetic data generation.
Watch now
Understanding the Llama 3 Tokenizer
Understanding the Llama 3 Tokenizer
Aston Zhang, research scientist working on Llama at Meta discusses the new tokenizer in Llama 3.
Watch now
Horizon banner image

Stay up-to-date

Our latest updates delivered to your inbox

Subscribe to our newsletter to keep up with the latest Llama updates, releases and more.