Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • Scandi-style tiny house combines smart storage and simple layout
    • Our Favorite Apple Watch Has Never Been Less Expensive
    • Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)
    • Today’s NYT Strands Hints, Answer and Help for April 20 #778
    • KV Cache Is Eating Your VRAM. Here’s How Google Fixed It With TurboQuant.
    • OneOdio Focus A1 Pro review
    • The 11 Best Fans to Buy Before It Gets Hot Again (2026)
    • A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Sunday, April 19
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»News»Meta’s surprise Llama 4 drop exposes the gap between AI ambition and reality
    News

    Meta’s surprise Llama 4 drop exposes the gap between AI ambition and reality

    Editor Times FeaturedBy Editor Times FeaturedApril 8, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link

    Meta constructed the Llama 4 fashions utilizing a mixture-of-experts (MoE) structure, which is a technique across the limitations of operating enormous AI fashions. Consider MoE like having a big group of specialised employees; as an alternative of everybody engaged on each process, solely the related specialists activate for a particular job.

    For instance, Llama 4 Maverick incorporates a 400 billion parameter measurement, however solely 17 billion of these parameters are energetic without delay throughout considered one of 128 consultants. Likewise, Scout options 109 billion complete parameters, however solely 17 billion are energetic without delay throughout considered one of 16 consultants. This design can cut back the computation wanted to run the mannequin, since smaller parts of neural community weights are energetic concurrently.

    Llama’s actuality test arrives shortly

    Present AI fashions have a comparatively restricted short-term reminiscence. In AI, a context window acts considerably in that style, figuring out how a lot data it could course of concurrently. AI language fashions like Llama sometimes course of that reminiscence as chunks of information referred to as tokens, which may be entire phrases or fragments of longer phrases. Massive context home windows permit AI fashions to course of longer paperwork, bigger code bases, and longer conversations.

    Regardless of Meta’s promotion of Llama 4 Scout’s 10 million token context window, builders have to this point found that utilizing even a fraction of that quantity has confirmed difficult because of reminiscence limitations. Willison reported on his weblog that third-party companies offering entry, like Groq and Fireworks, restricted Scout’s context to only 128,000 tokens. One other supplier, Collectively AI, provided 328,000 tokens.

    Proof suggests accessing bigger contexts requires immense sources. Willison pointed to Meta’s personal instance pocket book (“build_with_llama_4“), which states that operating a 1.4 million token context wants eight high-end Nvidia H100 GPUs.

    Willison documented his personal testing troubles. When he requested Llama 4 Scout by way of the OpenRouter service to summarize a protracted on-line dialogue (round 20,000 tokens), the outcome wasn’t helpful. He described the output as “full junk output,” which devolved into repetitive loops.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)

    April 19, 2026

    A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)

    April 19, 2026

    Google is in talks with Marvell Technology to develop a memory processing unit that works alongside TPUs, and a new TPU for running AI models (Qianer Liu/The Information)

    April 19, 2026

    At the Beijing half-marathon, several humanoid robots beat human winners by 10+ minutes; a robot made by Honor beat the human world record held by Jacob Kiplimo (Reuters)

    April 19, 2026

    A look at the AI nonprofit METR, whose time-horizon metrics are used by AI researchers and Wall Street investors to track the rapid development of AI systems (Kevin Roose/New York Times)

    April 19, 2026

    Binance and Bitget to probe a rally in RaveDAO’s RAVE token, which surged 4,500% in a week, after ZachXBT alleged RAVE insiders engineered a large short squeeze (Francisco Rodrigues/CoinDesk)

    April 19, 2026

    Comments are closed.

    Editors Picks

    Scandi-style tiny house combines smart storage and simple layout

    April 19, 2026

    Our Favorite Apple Watch Has Never Been Less Expensive

    April 19, 2026

    Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)

    April 19, 2026

    Today’s NYT Strands Hints, Answer and Help for April 20 #778

    April 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Minnesota court dismisses lawsuit against tribal gaming executives

    August 23, 2025

    Google’s New Chrome ‘Auto Browse’ Agent Attempts to Roam the Web Without You

    January 28, 2026

    Simple gene change grows much bigger tomatoes and eggplants

    March 7, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.