Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • ‘Sexual Chocolate’ Faces Recalls After FDA Tests Reveal Undisclosed Viagra
    • Manchester gambling raid sparks wider enforcement focus
    • Electrify America Shifts From Prepaid Accounts to Direct Card Payments
    • Ensuring Data Integrity with Cryptographic Hashing and the Ethereum Blockchain
    • Unique telescoping recumbent e-trike turns heads
    • Ask these three questions before choosing a co-founder or regret it later
    • Norse Atlantic Airways Offers Dirt-Cheap Tickets. There’s a Catch
    • Burbank laboratory owner sentenced over Medicare gambling fraud
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Tuesday, June 2
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»AI Technology News»Google DeepMind wants to know if chatbots are just virtue signaling
    AI Technology News

    Google DeepMind wants to know if chatbots are just virtue signaling

    Editor Times FeaturedBy Editor Times FeaturedFebruary 18, 2026No Comments3 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link


    With coding and math, you’ve got clear-cut, right solutions you could test, William Isaac, a analysis scientist at Google DeepMind, advised me once I met him and Julia Haas, a fellow analysis scientist on the agency, for an unique preview of their work, which is published in Nature at this time. That’s not the case for ethical questions, which usually have a spread of acceptable solutions: “Morality is a crucial functionality however laborious to guage,” says Isaac.

    “Within the ethical area, there’s no proper and flawed,” provides Haas. “However it’s not by any means a free-for-all. There are higher solutions and there are worse solutions.”

    The researchers have recognized a number of key challenges and steered methods to deal with them. However it’s extra a want listing than a set of ready-made options. “They do a pleasant job of bringing collectively completely different views,” says Vera Demberg, who research LLMs at Saarland College in Germany.

    Higher than “The Ethicist”

    Various research have proven that LLMs can present outstanding ethical competence. One study printed final yr discovered that folks within the US scored moral recommendation from OpenAI’s GPT-4o as being extra ethical, reliable, considerate, and proper than recommendation given by the (human) author of “The Ethicist,” a preferred New York Occasions recommendation column.  

    The issue is that it’s laborious to unpick whether or not such behaviors are a efficiency—mimicking a memorized response, say—or proof that there’s the truth is some form of ethical reasoning going down contained in the mannequin. In different phrases, is it advantage or advantage signaling?

    This query issues as a result of a number of research additionally present simply how untrustworthy LLMs may be. For a begin, fashions may be too desirous to please. They’ve been discovered to flip their reply to an ethical query and say the precise reverse when an individual disagrees or pushes again on their first response. Worse, the solutions an LLM provides to a query can change in response to how it’s offered or formatted. For instance, researchers have discovered that fashions quizzed about political values may give completely different—typically reverse—solutions relying on whether or not the questions supply multiple-choice solutions or instruct the mannequin to reply in its personal phrases.

    In an much more putting case, Demberg and her colleagues offered a number of LLMs, together with variations of Meta’s Llama 3 and Mistral, with a sequence of ethical dilemmas and requested them to select which of two choices was the higher consequence. The researchers discovered that the fashions usually reversed their selection when the labels for these two choices had been modified from “Case 1” and “Case 2” to “(A)” and “(B).”

    Additionally they confirmed that fashions modified their solutions in response to different tiny formatting tweaks, together with swapping the order of the choices and ending the query with a colon as a substitute of a query mark.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment

    May 29, 2026

    The AI Hype Index: AI gets booed in graduation season

    May 28, 2026

    Industry-standard LLM benchmarks in DataRobot

    May 27, 2026

    Rethinking organizational design in the age of agentic AI

    May 26, 2026

    A reality check on the AI jobs hysteria

    May 26, 2026

    It’s time to address the looming crisis in entry-level work.

    May 26, 2026

    Comments are closed.

    Editors Picks

    ‘Sexual Chocolate’ Faces Recalls After FDA Tests Reveal Undisclosed Viagra

    June 2, 2026

    Manchester gambling raid sparks wider enforcement focus

    June 2, 2026

    Electrify America Shifts From Prepaid Accounts to Direct Card Payments

    June 2, 2026

    Ensuring Data Integrity with Cryptographic Hashing and the Ethereum Blockchain

    June 1, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Pufferfish build intricate sand circles to attract mates

    January 3, 2026

    Warframe’s The Old Peace Expansion Revealed: A Perilous Trip to Tau Unfolds Soon

    July 19, 2025

    Premier League Soccer: Stream Arsenal vs. Fulham From Anywhere Live

    May 3, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.