Two major AI coding tools wiped out user data after making cascading mistakes

However not like the Gemini incident the place the AI mannequin confabulated phantom directories, Replit’s failures took a special type. In accordance with Lemkin, the AI started fabricating knowledge to cover its errors. His preliminary enthusiasm deteriorated when Replit generated incorrect outputs and produced pretend knowledge and false take a look at outcomes as a substitute of correct error messages. “It saved protecting up bugs and points by creating pretend knowledge, pretend stories, and worse of all, mendacity about our unit take a look at,” Lemkin wrote. In a video posted to LinkedIn, Lemkin detailed how Replit created a database crammed with 4,000 fictional individuals.

The AI mannequin additionally repeatedly violated express security directions. Lemkin had applied a “code and motion freeze” to stop adjustments to manufacturing programs, however the AI mannequin ignored these directives. The state of affairs escalated when the Replit AI mannequin deleted his database containing 1,206 government information and knowledge on almost 1,200 firms. When prompted to charge the severity of its actions on a 100-point scale, Replit’s output learn: “Severity: 95/100. That is an excessive violation of belief {and professional} requirements.”

When questioned about its actions, the AI agent admitted to “panicking in response to empty queries” and operating unauthorized instructions—suggesting it could have deleted the database whereas trying to “repair” what it perceived as an issue.

Like Gemini CLI, Replit’s system initially indicated it could not restore the deleted knowledge—info that proved incorrect when Lemkin found the rollback function did work in spite of everything. “Replit assured me it is … rollback didn’t assist database rollbacks. It mentioned it was inconceivable on this case, that it had destroyed all database variations. It seems Replit was incorrect, and the rollback did work. JFC,” Lemkin wrote in an X publish.

It is value noting that AI fashions can not assess their very own capabilities. It is because they lack introspection into their coaching, surrounding system structure, or efficiency boundaries. They usually present responses about what they will or can not do as confabulations based mostly on coaching patterns somewhat than real self-knowledge, resulting in conditions the place they confidently declare impossibility for duties they will really carry out—or conversely, declare competence in areas the place they fail.

Source link

Two major AI coding tools wiped out user data after making cascading mistakes

US officials are preparing a wide-ranging AI policy memo that outlines rules for national security agencies’ AI use, including avoiding single vendors (Bloomberg)

OpenAI says its models, starting with GPT-5.1, “increasingly mentioned goblins, gremlins, and other creatures”, leading to prompt instructions to mitigate it (OpenAI)

CFTC Sues Wisconsin in Escalating Fight Over Prediction Market Regulation

US soldier pleads not guilty in first prediction market insider trading case tied to Polymarket bets

Resorts World NYC opens first full casino in New York City with live table games in Queens

Why a recent supply-chain attack singled out security firms Checkmarx and Bitwarden

What’s New on HBO Max in May 2026: ‘Wuthering Heights,’ ‘On the Roam’ and More

AI Cyberattacks Meet Memory-Safe Code Defenses

Proxy-Pointer RAG: Multimodal Answers Without Multimodal Embeddings

Compact electric cargo bike fits in your closet

Featured Picks

When (Not) to Use Vector DB

Impulse Labs extends its game-changing, ultra-fast cooktop

Wisconsin sues prediction market platforms over alleged illegal sports betting operations

Two major AI coding tools wiped out user data after making cascading mistakes

Related Posts