New Claude 4 AI model refactored code for 7 hours straight

On Thursday, Anthropic launched Claude Opus 4 and Claude Sonnet 4, marking the corporate’s return to bigger mannequin releases after primarily specializing in mid-range Sonnet variants since June of final 12 months. The brand new fashions signify what the corporate calls its most succesful coding fashions but, with Opus 4 designed for advanced, long-running duties that may function autonomously for hours.

Alex Albert, Anthropic’s head of Claude Relations, advised Ars Technica that the corporate selected to revive the Opus line due to rising demand for agentic AI purposes. “Throughout all the businesses on the market which are constructing issues, there is a actually giant wave of those agentic purposes arising, and a really excessive demand and premium being positioned on intelligence,” Albert mentioned. “I believe Opus goes to suit that groove completely.”

Earlier than we go additional, a short refresher on Claude’s three AI mannequin “measurement” names (introduced in March 2024) might be warranted. Haiku, Sonnet, and Opus supply a tradeoff between worth (within the API), pace, and functionality.

Haiku fashions are the smallest, least costly to run, and least succesful when it comes to what you may name “context depth” (contemplating conceptual relationships within the immediate) and encoded data. Owing to the small measurement in parameter rely, Haiku fashions retain fewer concrete details and thus are inclined to confabulate extra regularly (plausibly answering questions primarily based on lack of knowledge) than bigger fashions, however they’re much quicker at fundamental duties than bigger fashions. Sonnet is historically a mid-range mannequin that hits a steadiness between value and functionality, and Opus fashions have at all times been the most important and slowest to run. Nonetheless, Opus fashions course of context extra deeply and are hypothetically higher suited to working deep logical duties.

A screenshot of the Claude internet interface with Opus 4 and Sonnet 4 choices proven.

Credit score:

Anthropic

There is no such thing as a Claude 4 Haiku simply but, however the brand new Sonnet and Opus fashions can reportedly deal with duties that earlier variations couldn’t. In our interview with Albert, he described testing eventualities the place Opus 4 labored coherently for as much as 24 hours on duties like playing Pokémon whereas coding refactoring duties in Claude Code ran for seven hours with out interruption. Earlier Claude fashions usually lasted just one to 2 hours earlier than shedding coherence, Albert mentioned, which means that the fashions may solely produce helpful self-referencing outputs for that lengthy earlier than starting to output too many errors.

Source link

New Claude 4 AI model refactored code for 7 hours straight

Destructive malware available in NPM repo went unnoticed for 2 years

VMware cloud partners demand “firm regulatory action” on Broadcom

Authorities carry out global takedown of infostealer used by cybercriminals

Apple legend Jony Ive takes control of OpenAI’s design future

“Microsoft has simply given us no other option,” Signal says as it blocks Windows Recall

Windows 11’s most important new feature is post-quantum cryptography. Here’s why.

Luxembourg-based childcare management platform Kidola raises €1.3 million to expand into France

AI Is Eating Data Center Power Demand—and It’s Only Getting Worse

Today’s NYT Connections: Sports Edition Hints, Answers for May 23 #242

Noise-Driven Computing: A Paradigm Shift

Featured Picks

When Predictors Collide: Mastering VIF in Multicollinear Regression

Dual-wheel balancing Tron 1 robot flaunts supreme agility

Scientists Are Mapping the Bizarre, Chaotic Spacetime Inside Black Holes

New Claude 4 AI model refactored code for 7 hours straight

Related Posts