Llama 3.1 405B is Meta's most advanced and capable model to date. To help you unlock its full potential, please refer to the partner guides below.
Our partner guides offer tailored support and expertise to ensure a seamless deployment process, enabling you to harness the features and capabilities of Llama 3.1 405B. Browse the following partner guides to explore their specific offerings and take the first step towards successful deployment.
Amazon Web Services (AWS) gives you access to Meta's industry-leading Llama models—letting you build and scale sophisticated generative AI applications with ease.
Databricks Mosaic AI has partnered with Meta to support the Llama 3.1 model architecture across the full suite of products:
Google Cloud Platform (GCP) is a suite of cloud computing services offered by Google. Building on top of GCP services, Model Garden on Vertex AI offers infrastructure to jumpstart your ML project, providing a single place to discover, customize, and deploy a wide range of models. You can also use Google Cloud Platform (GCP) to run Meta Llama models on your own managed infrastructure.
Together Turbo and Together Lite endpoints for Llama models enable performance, quality, and price flexibility so enterprises do not have to compromise. Together AI applies cutting-edge research and innovations such as FlashAttention-3, enabling Together Inference Engine 2.0 to deliver up to 15x throughput improvement, while lowering operational costs.