Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • OpenAI and Anthropic Sign Letter to Prevent AI-Developed Biological Weapons
    • New York sports betting statements bill advances
    • SwitchBot Launches the Most Complete Home Weather Station I’ve Seen
    • What It Takes for Future-Ready Power Distribution
    • Are we safe from this deadly virus?
    • Edinburgh-based Wordsmith raises €60.2 million Series B to scale legal AI platform for in-house teams
    • Elon Musk and America’s Far Right Stoke Anger Over Murder of UK Teen
    • Why geolocation is challenging for prediction markets
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Thursday, June 4
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»News»DeepSeek tests “sparse attention” to slash AI processing costs
    News

    DeepSeek tests “sparse attention” to slash AI processing costs

    Editor Times FeaturedBy Editor Times FeaturedOctober 1, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link

    The eye bottleneck

    In AI, “consideration” is a time period for a software program approach that determines which phrases in a textual content are most related to understanding one another. These relationships map out context, and context builds that means in language. For instance, within the sentence “The financial institution raised rates of interest,” consideration helps the mannequin set up that “financial institution” pertains to “rates of interest” in a monetary context, not a riverbank context. By way of consideration, conceptual relationships turn out to be quantified as numbers saved in a neural community. Consideration additionally governs how AI language fashions select what data “issues most” when producing every phrase of their response.

    Calculating context with a machine is difficult, and it wasn’t sensible at scale till chips like GPUs that may calculate these relationships in parallel reached a sure degree of functionality. Even so, the unique Transformer structure from 2017 checked the connection of every phrase in a immediate with each different phrase in a type of brute drive method. So should you fed 1,000 phrases of a immediate into the AI mannequin, it resulted in 1,000 x 1,000 comparisons, or 1 million relationships to compute. With 10,000 phrases, that turns into 100 million relationships. The cost grows quadratically, which creates a basic bottleneck for processing lengthy conversations.

    Though it is possible that OpenAI makes use of some sparse consideration methods in GPT-5, lengthy conversations nonetheless undergo efficiency penalties. Each time you submit a brand new response to ChatGPT, the AI mannequin at its core processes context comparisons for your complete dialog historical past yet again.

    In fact, the researchers behind the unique Transformer mannequin designed it for machine translation with comparatively brief sequences (perhaps just a few hundred tokens, that are chunks of knowledge that characterize phrases), the place quadratic consideration was manageable. It is when individuals began scaling to 1000’s or tens of 1000’s of tokens that the quadratic value grew to become prohibitive.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    New York sports betting statements bill advances

    June 4, 2026

    Why geolocation is challenging for prediction markets

    June 3, 2026

    Indian IT companies have spent $7.1B on acquisitions since the start of 2025 to gain clients, as AI-led pricing pressure weakens organic growth (Shristi Achar/The Economic Times)

    June 3, 2026

    People Incorporated launches $18B bid for MGM Resorts

    June 3, 2026

    Illinois prediction markets face new transaction tax

    June 3, 2026

    Galveston gambling investigation expands with coordinated raids

    June 2, 2026

    Comments are closed.

    Editors Picks

    OpenAI and Anthropic Sign Letter to Prevent AI-Developed Biological Weapons

    June 4, 2026

    New York sports betting statements bill advances

    June 4, 2026

    SwitchBot Launches the Most Complete Home Weather Station I’ve Seen

    June 4, 2026

    What It Takes for Future-Ready Power Distribution

    June 4, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Aizy acquires Dutch performance marketing software company Uptmz following €2 million raise

    May 29, 2026

    Lessons from a €50 million Series B: And why it matters at every startup funding stage

    August 23, 2025

    Radlight survival lighter makes for easy, painless campfire lighting

    May 22, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.