Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • Inheritance: A Software Engineering Concept Data Scientists Must Know To Succeed
    • Maryland startup InventWood to launch Superwood stronger than steel
    • EU-Startups Podcast | Episode 118: Ash Arora, Partner at LocalGlobe
    • Why 3D-Printing an Untraceable Ghost Gun Is Easier Than Ever
    • New Claude 4 AI model refactored code for 7 hours straight
    • Marvel Rivals’ Sharknado Team-Up Ability Cements the Game’s Fun Direction
    • Truck Platooning: The Near Future of Freight
    • Multiple Linear Regression Analysis | Towards Data Science
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Friday, May 23
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»News»Eerily realistic AI voice demo sparks amazement and discomfort online
    News

    Eerily realistic AI voice demo sparks amazement and discomfort online

    Editor Times FeaturedBy Editor Times FeaturedMarch 5, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link

    An instance argument with Sesame’s CSM created by Gavin Purcell.

    An instance argument with Sesame’s CSM created by Gavin Purcell.

    Gavin Purcell, co-host of the AI for Humans podcast, posted an example video on Reddit the place the human pretends to be an embezzler and argues with a boss. It is so dynamic that it is tough to inform who the human is and which one is the AI mannequin. Judging by our personal demo, it is completely able to what you see within the video.

    “Close to-human high quality”

    Beneath the hood, Sesame’s CSM achieves its realism by utilizing two AI fashions working collectively (a spine and a decoder) based mostly on Meta’s Llama structure that processes interleaved textual content and audio. Sesame skilled three AI mannequin sizes, with the biggest utilizing 8.3 billion parameters (an 8 billion spine mannequin plus a 300 million parameter decoder) on roughly 1 million hours of primarily English audio.

    Sesame’s CSM does not comply with the standard two-stage method utilized by many earlier text-to-speech programs. As an alternative of producing semantic tokens (high-level speech representations) and acoustic particulars (fine-grained audio options) in two separate phases, Sesame’s CSM integrates right into a single-stage, multimodal transformer-based mannequin, collectively processing interleaved textual content and audio tokens to provide speech. OpenAI’s voice mannequin makes use of the same multimodal method.

    In blind assessments with out conversational context, human evaluators confirmed no clear desire between CSM-generated speech and actual human recordings, suggesting the mannequin achieves near-human high quality for remoted speech samples. Nevertheless, when supplied with conversational context, evaluators nonetheless persistently most popular actual human speech, indicating a niche stays in totally contextual speech technology.

    Sesame co-founder Brendan Iribe acknowledged present limitations in a touch upon Hacker Information, noting that the system is “nonetheless too keen and sometimes inappropriate in its tone, prosody and pacing” and has points with interruptions, timing, and dialog movement. “In the present day, we’re firmly within the valley, however we’re optimistic we are able to climb out,” he wrote.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    New Claude 4 AI model refactored code for 7 hours straight

    May 23, 2025

    Destructive malware available in NPM repo went unnoticed for 2 years

    May 23, 2025

    VMware cloud partners demand “firm regulatory action” on Broadcom

    May 22, 2025

    Authorities carry out global takedown of infostealer used by cybercriminals

    May 22, 2025

    Apple legend Jony Ive takes control of OpenAI’s design future

    May 22, 2025

    “Microsoft has simply given us no other option,” Signal says as it blocks Windows Recall

    May 21, 2025

    Comments are closed.

    Editors Picks

    Inheritance: A Software Engineering Concept Data Scientists Must Know To Succeed

    May 23, 2025

    Maryland startup InventWood to launch Superwood stronger than steel

    May 23, 2025

    EU-Startups Podcast | Episode 118: Ash Arora, Partner at LocalGlobe

    May 23, 2025

    Why 3D-Printing an Untraceable Ghost Gun Is Easier Than Ever

    May 23, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Dynasty Warriors maker reveals unmade sequel and Star Wars dreams

    February 2, 2025

    16 Best Crossplay Games for Consoles and PC (2025): Xbox, PlayStation, Switch, Mobile

    February 21, 2025

    OpenAI says Chinese rivals using its work for their AI apps

    February 1, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.