Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • TOP 100 Business Cards of history’s most important people
    • ‘100% Stupid’: MAGA World Is Cautiously Turning on Elon Musk
    • Today’s NYT Mini Crossword Answers for June 7
    • How I Automated My Machine Learning Workflow with Just 10 Lines of Python
    • Saudi Arabia and Egypt reportedly plan Red Sea crossing
    • Elon Musk’s Fight With Trump Threatens $48 Billion in Government Contracts
    • Millions of low-cost Android devices turn home networks into crime platforms
    • Resident Evil 9 Revealed at Summer Game Fest After Early Fake-Out
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Saturday, June 7
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»News»Meta’s surprise Llama 4 drop exposes the gap between AI ambition and reality
    News

    Meta’s surprise Llama 4 drop exposes the gap between AI ambition and reality

    Editor Times FeaturedBy Editor Times FeaturedApril 8, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link

    Meta constructed the Llama 4 fashions utilizing a mixture-of-experts (MoE) structure, which is a technique across the limitations of operating enormous AI fashions. Consider MoE like having a big group of specialised employees; as an alternative of everybody engaged on each process, solely the related specialists activate for a particular job.

    For instance, Llama 4 Maverick incorporates a 400 billion parameter measurement, however solely 17 billion of these parameters are energetic without delay throughout considered one of 128 consultants. Likewise, Scout options 109 billion complete parameters, however solely 17 billion are energetic without delay throughout considered one of 16 consultants. This design can cut back the computation wanted to run the mannequin, since smaller parts of neural community weights are energetic concurrently.

    Llama’s actuality test arrives shortly

    Present AI fashions have a comparatively restricted short-term reminiscence. In AI, a context window acts considerably in that style, figuring out how a lot data it could course of concurrently. AI language fashions like Llama sometimes course of that reminiscence as chunks of information referred to as tokens, which may be entire phrases or fragments of longer phrases. Massive context home windows permit AI fashions to course of longer paperwork, bigger code bases, and longer conversations.

    Regardless of Meta’s promotion of Llama 4 Scout’s 10 million token context window, builders have to this point found that utilizing even a fraction of that quantity has confirmed difficult because of reminiscence limitations. Willison reported on his weblog that third-party companies offering entry, like Groq and Fireworks, restricted Scout’s context to only 128,000 tokens. One other supplier, Collectively AI, provided 328,000 tokens.

    Proof suggests accessing bigger contexts requires immense sources. Willison pointed to Meta’s personal instance pocket book (“build_with_llama_4“), which states that operating a 1.4 million token context wants eight high-end Nvidia H100 GPUs.

    Willison documented his personal testing troubles. When he requested Llama 4 Scout by way of the OpenRouter service to summarize a protracted on-line dialogue (round 20,000 tokens), the outcome wasn’t helpful. He described the output as “full junk output,” which devolved into repetitive loops.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    Millions of low-cost Android devices turn home networks into crime platforms

    June 7, 2025

    Anthropic releases custom AI chatbot for classified spy work

    June 6, 2025

    An interview with ASML CEO Christophe Fouquet, as the company navigates political instability in The Netherlands and abroad and the impacts of Trump’s trade war (Adam Satariano/New York Times)

    June 6, 2025

    “In 10 years, all bets are off”—Anthropic CEO opposes decadelong freeze on state AI laws

    June 5, 2025

    Two certificate authorities booted from the good graces of Chrome

    June 4, 2025

    Meta and Yandex are de-anonymizing Android users’ web browsing identifiers

    June 3, 2025

    Comments are closed.

    Editors Picks

    TOP 100 Business Cards of history’s most important people

    June 7, 2025

    ‘100% Stupid’: MAGA World Is Cautiously Turning on Elon Musk

    June 7, 2025

    Today’s NYT Mini Crossword Answers for June 7

    June 7, 2025

    How I Automated My Machine Learning Workflow with Just 10 Lines of Python

    June 7, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    AT&T finally has a network test drive program

    October 23, 2024

    Capitalising on the value of proprietary data

    March 7, 2025

    How multi-site manufacturers use cobot solutions to boost profitability—quickly and safely

    May 10, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.