Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • Scandi-style tiny house combines smart storage and simple layout
    • Our Favorite Apple Watch Has Never Been Less Expensive
    • Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)
    • Today’s NYT Strands Hints, Answer and Help for April 20 #778
    • KV Cache Is Eating Your VRAM. Here’s How Google Fixed It With TurboQuant.
    • OneOdio Focus A1 Pro review
    • The 11 Best Fans to Buy Before It Gets Hot Again (2026)
    • A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Sunday, April 19
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»News»DeepSeek tests “sparse attention” to slash AI processing costs
    News

    DeepSeek tests “sparse attention” to slash AI processing costs

    Editor Times FeaturedBy Editor Times FeaturedOctober 1, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link

    The eye bottleneck

    In AI, “consideration” is a time period for a software program approach that determines which phrases in a textual content are most related to understanding one another. These relationships map out context, and context builds that means in language. For instance, within the sentence “The financial institution raised rates of interest,” consideration helps the mannequin set up that “financial institution” pertains to “rates of interest” in a monetary context, not a riverbank context. By way of consideration, conceptual relationships turn out to be quantified as numbers saved in a neural community. Consideration additionally governs how AI language fashions select what data “issues most” when producing every phrase of their response.

    Calculating context with a machine is difficult, and it wasn’t sensible at scale till chips like GPUs that may calculate these relationships in parallel reached a sure degree of functionality. Even so, the unique Transformer structure from 2017 checked the connection of every phrase in a immediate with each different phrase in a type of brute drive method. So should you fed 1,000 phrases of a immediate into the AI mannequin, it resulted in 1,000 x 1,000 comparisons, or 1 million relationships to compute. With 10,000 phrases, that turns into 100 million relationships. The cost grows quadratically, which creates a basic bottleneck for processing lengthy conversations.

    Though it is possible that OpenAI makes use of some sparse consideration methods in GPT-5, lengthy conversations nonetheless undergo efficiency penalties. Each time you submit a brand new response to ChatGPT, the AI mannequin at its core processes context comparisons for your complete dialog historical past yet again.

    In fact, the researchers behind the unique Transformer mannequin designed it for machine translation with comparatively brief sequences (perhaps just a few hundred tokens, that are chunks of knowledge that characterize phrases), the place quadratic consideration was manageable. It is when individuals began scaling to 1000’s or tens of 1000’s of tokens that the quadratic value grew to become prohibitive.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)

    April 19, 2026

    A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)

    April 19, 2026

    Google is in talks with Marvell Technology to develop a memory processing unit that works alongside TPUs, and a new TPU for running AI models (Qianer Liu/The Information)

    April 19, 2026

    At the Beijing half-marathon, several humanoid robots beat human winners by 10+ minutes; a robot made by Honor beat the human world record held by Jacob Kiplimo (Reuters)

    April 19, 2026

    A look at the AI nonprofit METR, whose time-horizon metrics are used by AI researchers and Wall Street investors to track the rapid development of AI systems (Kevin Roose/New York Times)

    April 19, 2026

    Binance and Bitget to probe a rally in RaveDAO’s RAVE token, which surged 4,500% in a week, after ZachXBT alleged RAVE insiders engineered a large short squeeze (Francisco Rodrigues/CoinDesk)

    April 19, 2026

    Comments are closed.

    Editors Picks

    Scandi-style tiny house combines smart storage and simple layout

    April 19, 2026

    Our Favorite Apple Watch Has Never Been Less Expensive

    April 19, 2026

    Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)

    April 19, 2026

    Today’s NYT Strands Hints, Answer and Help for April 20 #778

    April 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    What a new law and an investigation could mean for Grok AI deepfakes

    January 12, 2026

    AI Thinks It Cracked Kryptos. The Artist Behind It Says No Chance

    March 7, 2025

    Resident Evil 9 Revealed at Summer Game Fest After Early Fake-Out

    June 7, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.