April 13, 2025
Meta Launches New Llama 4 Herd AI Models

Meta Launches New Llama 4 Herd AI Models

Posted April 5, 2025 at 11:57pm by iClarified
Meta announced the release of its new AI models today, dubbed the Llama 4 herd. The company introduced two flagship models, Llama 4 Scout and Llama 4 Maverick, alongside a preview of the still-training Llama 4 Behemoth.

Llama 4 Scout, a 17 billion active parameter model with 16 experts, is designed to fit on a single NVIDIA H100 GPU using Int4 quantization. Meta claims it outperforms all previous Llama models and similarly sized competitors like Gemma 3, Gemini 2.0 Flash-Lite, and Mistral 3.1 across widely reported benchmarks. It boasts an industry-leading context window of 10 million tokens, enabling tasks such as multi-document summarization and reasoning over large codebases.

Meta Launches New Llama 4 Herd AI Models


Llama 4 Maverick, also featuring 17 billion active parameters but with 128 experts and 400 billion total parameters, is designed for top-tier multimodal performance. Meta says it surpasses GPT-4o and Gemini 2.0 Flash on several benchmarks, while achieving results comparable to the much larger DeepSeek v3 in reasoning and coding. Despite its scale, it runs on a single NVIDIA H100 host. An experimental chat version of Maverick has achieved an ELO score of 1417 on LMArena.

Powering these models is Llama 4 Behemoth, a 288 billion active parameter teacher model with 16 experts and nearly two trillion total parameters. Though still in training, Meta reports it outperforms GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on STEM-focused benchmarks like MATH-500 and GPQA Diamond. Behemoth plays a key role in distilling knowledge to Scout and Maverick, though it is not yet available for public release.

Both Scout and Maverick employ a mixture-of-experts (MoE) architecture—a first for the Llama series—activating only a subset of total parameters per token to improve efficiency. Scout has 109 billion total parameters, while Maverick scales to 400 billion. The models offer native multimodality with early fusion of text and vision tokens, backed by an enhanced MetaCLIP-based vision encoder.

Developers can download Llama 4 Scout and Maverick starting today, April 5, 2025, from llama.com and Hugging Face. Meta is also rolling out access via partners in the coming days. Users can try Meta AI powered by Llama 4 on WhatsApp, Messenger, Instagram Direct, and the Meta.AI website. More details, including technical insights and future plans for the Behemoth model, will be shared at LlamaCon on April 29.


Hit the link below for the full announcement...

Read More


Meta Launches New Llama 4 Herd AI Models
Add Comment
Would you like to be notified when someone replies or adds a new comment?
Yes (All Threads)
Yes (This Thread Only)
No
iClarified Icon
Notifications
Would you like to be notified when we post a new Apple news article or tutorial?
Yes
No
Comments
You must login or register to add a comment...
Recent. Read the latest Apple News.
RECENT
Tutorials. Help is here.
TUTORIALS
Where to Download macOS Sonoma
Where to Download macOS Ventura
AppleTV Firmware Download Locations
Where To Download iPad Firmware Files From
Where To Download iPhone Firmware Files From
Deals. Save on Apple devices and accessories.
DEALS