Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • OneOdio Focus A1 Pro review
    • The 11 Best Fans to Buy Before It Gets Hot Again (2026)
    • A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)
    • ‘Euphoria’ Season 3 Release Schedule: When Does Episode 2 Come Out?
    • Francis Bacon and the Scientific Method
    • Proxy-Pointer RAG: Structure Meets Scale at 100% Accuracy with Smarter Retrieval
    • Sulfur lava exoplanet L 98-59 d defies classification
    • Hisense U7SG TV Review (2026): Better Design, Great Value
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Sunday, April 19
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»AI Technology News»There are more AI health tools than ever—but how well do they work?
    AI Technology News

    There are more AI health tools than ever—but how well do they work?

    Editor Times FeaturedBy Editor Times FeaturedMarch 30, 2026No Comments3 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link


    Singhal, the OpenAI well being lead, notes that the corporate’s present GPT-5 collection of fashions, which had not but been launched when the unique HealthBench examine was performed, do a significantly better job of soliciting extra data than their predecessors. Nonetheless, OpenAI has reported that GPT-5.4, the present flagship, is definitely worse at searching for context than GPT-5.2, an earlier model.

    Ideally, Bean says, well being chatbots could be subjected to managed assessments with human customers, as they had been in his examine, earlier than being launched to the general public. That could be a heavy raise, significantly given how briskly the AI world strikes and the way lengthy human research can take. Bean’s personal examine used GPT-4o, which got here out nearly a yr in the past and is now outdated. 

    Earlier this month, Google launched a examine that meets Bean’s requirements. Within the examine, sufferers mentioned medical issues with the corporate’s Articulate Medical Intelligence Explorer (AMIE), a medical LLM chatbot that’s not but out there to the general public, earlier than assembly with a human doctor. General, AMIE’s diagnoses had been simply as correct as physicians’, and not one of the conversations raised main security issues for researchers. 

    Regardless of the encouraging outcomes, Google isn’t planning to launch AMIE anytime quickly. “Whereas the analysis has superior, there are important limitations that have to be addressed earlier than real-world translation of methods for analysis and therapy, together with additional analysis into fairness, equity, and security testing,” wrote Alan Karthikesalingam, a analysis scientist at Google DeepMind, in an electronic mail. Google did not too long ago reveal that Health100, a well being platform it’s constructing in partnership with CVS, will embody an AI assistant powered by its flagship Gemini fashions, although that software will presumably not be supposed for analysis or therapy.

    Rodman, who led the AMIE examine with Karthikesalingam, doesn’t suppose such in depth, multiyear research are essentially the fitting strategy for chatbots like ChatGPT Well being and Copilot Well being. “There’s a number of causes that the medical trial paradigm doesn’t at all times work in generative AI,” he says. “And that’s the place this benchmarking dialog is available in. Are there benchmarks [from] a trusted third social gathering that we will agree are significant, that the labs can maintain themselves to?”

    They key there’s “third social gathering.” Regardless of how extensively firms consider their very own merchandise, it’s powerful to belief their conclusions fully. Not solely does a third-party analysis convey impartiality, but when there are lots of third events concerned, it additionally helps shield in opposition to blind spots.

    OpenAI’s Singhal says he’s strongly in favor of exterior analysis. “We strive our greatest to help the group,” he says. “A part of why we put out HealthBench was really to provide the group and different mannequin builders an instance of what an excellent analysis appears like.” 

    Given how costly it’s to supply a high-quality analysis, he says, he’s skeptical that any particular person educational laboratory would be capable of produce what he calls “the one analysis to rule all of them.” However he does communicate extremely of efforts that educational teams have made to convey preexisting and novel evaluations collectively into complete evaluations suites—similar to Stanford’s MedHELM framework, which assessments fashions on all kinds of medical duties. At the moment, OpenAI’s GPT-5 holds the best MedHELM rating.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    How robots learn: A brief, contemporary history

    April 17, 2026

    Vibe Coding Best Practices: 5 Claude Code Habits

    April 16, 2026

    Why having “humans in the loop” in an AI war is an illusion

    April 16, 2026

    Making AI operational in constrained public sector environments

    April 16, 2026

    Treating enterprise AI as an operating layer

    April 16, 2026

    Building trust in the AI era with privacy-led UX

    April 15, 2026

    Comments are closed.

    Editors Picks

    OneOdio Focus A1 Pro review

    April 19, 2026

    The 11 Best Fans to Buy Before It Gets Hot Again (2026)

    April 19, 2026

    A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)

    April 19, 2026

    ‘Euphoria’ Season 3 Release Schedule: When Does Episode 2 Come Out?

    April 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Epson launches AirPlay 3LCD projectors for home and office

    August 24, 2025

    London’s Cyb3r Operations raises €4.6 million led by Octopus Ventures to tackle third-party cyber risk

    January 15, 2026

    Facebook tests £9.99 monthly subscription for sharing more than two links

    December 18, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.