in

Llama 3.1 on Vertex AI


Today, we’re excited to announce the addition of the Llama 3.1 family of models, including a new 405B model – Meta’s most powerful and versatile model to date — to Vertex AI Model Garden. These additions continue Google Cloud’s commitment to open and flexible AI ecosystems that help you build solutions best-suited to your needs. 

Vertex AI provides a curated collection of first-party, open-source, and third-party models, many of which — including the new Llama models — can be delivered as fully-managed Model-as-a-service (MaaS) offerings. With MaaS, you can choose the foundation model that fits your requirements, access it simply via an API, tailor it with robust development tools, and deploy on our fully-managed infrastructure — all with the simplicity of a single bill and hassle-free infrastructure.

Meta’s Llama 3.1 represents a paradigm shift in open-weight models, boasting unparalleled performance and versatility in its class. This release features a family of models tailored for diverse applications:

  • Llama 3.1 405B: The largest openly available foundation model to date, Llama 3.1 405B sets a new standard among open models for flexibility, control, and innovation. This model opens an array of new possibilities, from generating synthetic data and powering complex reasoning tasks to effortlessly handling direct inference scenarios with minimal fine-tuning.

  • Llama 3.1 8B and 70B: These new versions of Llama 3 models excel at understanding language nuances, grasping context, and performing complex tasks such as translation and dialogue generation. 

You can access the new 405B model in just a few clicks using Model-as-a-Service in preview here, without any setup or infrastructure hassles. General availability begins in the coming weeks. The 8B and 70B models will also be available as MaaS in the coming weeks. All three models are available for self-service in Vertex AI Model Garden starting today, giving you the flexibility to choose your preferred infrastructure.

These models are available as pre-trained and instruction-tuned versions to support your specific needs, and they include an expanded context of 128,000 tokens, offering deeper comprehension of longer, more complex text than earlier generations. Llama 3.1 models also include multilingual support across eight languages, further broadening their reach and applicability.

Using Llama 3.1 in Google Cloud 

Google Cloud’s Vertex AI is a comprehensive AI platform for experimenting with, customizing, and deploying, and monitoring foundation models like Llama 3.1. Llama 3.1 joins over 150 curated, enterprise-ready models already available on Vertex AI Model Garden, expanding your choice and flexibility to choose the best models for your needs and budget, and to keep pace with leap-frogging innovations.

Senators probe OpenAI on safety and employment practices

Senators probe OpenAI on safety and employment practices

Meta's New Llama 3.1 AI Model Is Free, Powerful, and Risky

Meta’s New Llama 3.1 AI Model Is Free, Powerful, and Risky