Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • Optimizing AI Agent Planning with Operations Research and Data Science
    • Mercedes-AMG electric GT 4-Door performance EV revealed
    • AI coworker startup Viktor raises €64.7 million Series A after hitting €12.9 million revenue run rate within 10 weeks of launch
    • The 10 Best TV Shows to Stream This Month (May 2026)
    • CFTC fights Minnesota prediction markets felony restrictions
    • Plex Is Raising Its Lifetime Subscription Price Again, to a Whopping $750
    • Can LLMs Replace Survey Respondents?
    • New copper plates slash data center energy use
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Wednesday, May 20
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»Artificial Intelligence»From Possible to Probable AI Models
    Artificial Intelligence

    From Possible to Probable AI Models

    Editor Times FeaturedBy Editor Times FeaturedMay 20, 2026No Comments8 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link


    of years, I’ve been concerned in lots of conversations about generative AI (and also you in all probability have, too!). These conversations assorted in focus, from ones with most of the people about the usage of AI to ones with extra technical individuals concerning the accuracy of fashions. No matter who I’m conversing with, individuals are usually fascinated and interested in what fashions can do.

    Can an LLM write a purposeful kernel driver? It could possibly. Can it write a music about how a lot you like your cat? It certain can. Can a diffusion mannequin generate a photo-realistic picture of a medieval astronaut? It could possibly.

    However, does “can” imply it is going to be good? Seems what’s “potential” for many fashions generally is a surprisingly low bar.

    As somebody who has studied chance or statistics, you in all probability know that in a sufficiently massive pattern house, virtually something turns into potential. The problem will not be figuring out whether or not an end result can occur; it’s understanding how probably that end result is and whether or not we will rely upon it repeatedly.

    That proper there’s something many confuse about chance concept: whether or not it’s associated to generative AI. That distinction issues as a result of constructing a manufacturing AI system may be very completely different from constructing a demo. Demos thrive on fascinating edge circumstances. Manufacturing programs rely upon consistency.

    As AI programs turn into an more and more enormous and vital a part of workflows and decision-making, it’s value revisiting elementary concepts from chance concept and analyzing the place frequent assumptions about AI reliability start to interrupt down.

    1. Dimensionality and the Area of Potentialities

    To be honest, speaking about dependable programs is a lot simpler than constructing them. To grasp why reliability stays very tough, it helps to take a step again and take into consideration pattern areas. Let’s begin with the only of circumstances, a coin flip. For a coin toss: Ω={H,T}Omega = {H, T}. The potential outcomes are simple to visualise as a result of there’s a small house of prospects.

    Now think about a language mannequin producing a sequence of 512 tokens with a vocabulary of fifty,000 potential tokens, which provides a pattern house of measurement 5000051250000^{512}. The scale of this pattern house is sort of inconceivable to grasp, not to mention visualize (in your head or in follow).

    In such circumstances, the place we’ve got a big house, the area similar to helpful, coherent, and factually appropriate outputs can turn into surprisingly small relative to the variety of believable alternate options. In different phrases, the ocean of potential outcomes, what’s possible is a pond…

    When the mannequin returns a solution that it’s potential, however not possible, we name it a hallucination. And a hallucination, then, will not be essentially a software program bug. As a substitute, it occurs as a result of the mannequin is sampling from areas of the distribution with non-zero chance however little sensible worth.

    At first look, chances are you’ll suppose:

    “If we merely accumulate extra knowledge, hallucinations will disappear.”

    However the problem is that hallucinations naturally come up in probabilistic programs. Sampling from a distribution at all times introduces the potential of touchdown in low-probability areas.

    Picture by the creator

    2. Frequentist measurements vs Bayesian expectations

    When evaluating AI programs, there are sometimes two very completely different approaches. The primary is, roughly, a frequentist perspective: you run 1000 benchmark duties and measure efficiency. If a mannequin solves 850 accurately, we name it an 85% correct system.

    The second is a Bayesian perspective, the place you begin with expectations about how an clever system ought to behave and replace these beliefs when surprising failures occur.

    This distinction turns into vital as a result of prompts are not often impartial occasions. Suppose a mannequin solutions 9 math questions accurately. Primarily based on that, we could assume the chance of getting query ten proper is its reported accuracy.

    However language fashions usually are not a set of remoted Bernoulli trials. Their outputs rely upon earlier context, hidden representations, and the density of associated examples throughout the coaching distribution.

    Which suggests their efficiency is usually conditional moderately than static.

    3. Confidence will not be the identical as chance

    One of the generally used features in machine studying is the Softmax perform. We frequently interpret Softmax outputs as confidence scores: “If the mannequin outputs 0.90 for cat, it’s 90% certain.” However this interpretation might be deceptive.

    Okay, step again for a second: the Softmax perform states that due to the exponential time period, small variations between logits might be amplified.

    So, a mannequin can seem extremely assured not as a result of it “is aware of” one thing, however as a result of one logit occurred to be barely bigger than the others and the exponential operation amplified the distinction.

    So when ChatGPT predicts the subsequent phrase, what it’s basically doing is answering:

    “Of all potential tokens, after Softmax, which one is more than likely?”

    This creates what I consider because the “assured idiot” downside: a system confidently asserting one thing incorrect as a result of it has not realized methods to categorical uncertainty.

    Picture by the creator

    4. The Regulation of Massive Numbers and why extra knowledge doesn’t mechanically imply extra reality

    The Regulation of Massive Numbers states that as pattern sizes improve, noticed averages strategy their anticipated values. This concept usually motivates the usage of extraordinarily massive datasets to coach our fashions. In any case, if a mannequin sees sufficient examples, ultimately it ought to study the reality, proper?

    At first look, this sounds cheap, primarily as a result of that’s how we study! However there is a crucial assumption hidden within the Regulation of Massive Numbers: the underlying distribution should stay comparatively steady.

    Human data and language usually are not steady distributions. They alter repeatedly and comprise contradictions, biases, and inaccuracies. Spoken language varies from one space to a different. Even throughout the identical metropolis, individuals would use the identical language, the identical expressions, and the identical phrases in a different way.

    Because of this, the mannequin doesn’t essentially converge towards “reality.” As a substitute, it converges towards dominant patterns. So, if a false impression seems steadily sufficient within the knowledge, the mannequin could study it as a result of, statistically, it turns into probably the most possible continuation.

    5. Stochasticity will not be essentially creativity

    Many usually describe AI programs as “artistic” once they produce stunning outputs. Nonetheless, from a probabilistic perspective, one thing else could also be occurring.

    Temperature sampling modifications the chance that the mannequin selects much less possible tokens. Samples with low temperature are predictable and secure! These with excessive temperature are usually extra various and stunning, usually resulting in a better danger of hallucination.

    So, growing the temperature sampling successfully flattens the chance distribution. Which suggests lower-probability outcomes might be sampled extra steadily. What we typically interpret as creativity could as a substitute be the mannequin exploration of much less probably areas of the distribution.

    Picture by the creator

    6. Shifting from potential to dependable

    If our purpose is to construct AI programs that persistently work in actual environments, we have to transfer past asking whether or not one thing is feasible and give attention to reliability. Once more, simpler mentioned than performed. However, some helpful approaches to try this embody:

    1- Utilizing methods corresponding to Platt Scaling and Isotonic Regression to assist align confidence scores with noticed efficiency.

    2- Utilizing strategies corresponding to Bayesian neural networks or Monte Carlo Dropout to assist quantify what a mannequin doesn’t know.

    3- Utilizing exterior validation strategies to implement output construction and necessities, moderately than assuming the mannequin will naturally observe guidelines.

    Closing Ideas

    A couple of years in the past, everybody was impressed by AI programs that merely predicted the subsequent phrase. Now we’re discovering that predicting the subsequent phrase is simply a part of the issue.

    The more durable problem is predicting the appropriate phrase repeatedly and reliably. Particularly with new fashions popping up daily. With spectacular fashions and lots of guarantees of an amazing efficiency. So, subsequent time you see a powerful AI demo, I encourage you to ask (your self or the individual presenting the mannequin):

    “Is that this what the mannequin sometimes does, or is that this a very fortunate pattern?”

    In a world with practically infinite prospects, virtually something can occur. Engineering, nevertheless, is never about what can occur. It’s about what you possibly can belief to occur once more.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    Optimizing AI Agent Planning with Operations Research and Data Science

    May 20, 2026

    Can LLMs Replace Survey Respondents?

    May 20, 2026

    How to Safely Run Coding Agents

    May 20, 2026

    Introduction to Lean for Programmers

    May 19, 2026

    Deploying a Multistage Multimodal Recommender System on Amazon Elastic Kubernetes Service

    May 19, 2026

    Grounding LLMs with Fresh Web Data to Reduce Hallucinations

    May 19, 2026
    Leave A Reply Cancel Reply

    Editors Picks

    Optimizing AI Agent Planning with Operations Research and Data Science

    May 20, 2026

    Mercedes-AMG electric GT 4-Door performance EV revealed

    May 20, 2026

    AI coworker startup Viktor raises €64.7 million Series A after hitting €12.9 million revenue run rate within 10 weeks of launch

    May 20, 2026

    The 10 Best TV Shows to Stream This Month (May 2026)

    May 20, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Google unveils plans to try again with smart glasses in 2026

    December 9, 2025

    Nvidia shares sink as Chinese AI app DeepSeek spooks US markets

    February 2, 2025

    What Are Those Weird Devices Basketball Players Are Holding?

    February 21, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.