Sixteen Claude AI agents working together created a new C compiler

Amid a push toward AI agents, with each Anthropic and OpenAI transport multi-agent instruments this week, Anthropic is greater than prepared to indicate off a few of its extra daring AI coding experiments. However as common with claims of AI-related achievement, you’ll discover some key caveats forward.

On Thursday, Anthropic researcher Nicholas Carlini published a blog post describing how he set 16 situations of the corporate’s Claude Opus 4.6 AI mannequin unfastened on a shared codebase with minimal supervision, tasking them with constructing a C compiler from scratch.

Over two weeks and practically 2,000 Claude Code classes costing about $20,000 in API charges, the AI mannequin brokers reportedly produced a 100,000-line Rust-based compiler able to constructing a bootable Linux 6.9 kernel on x86, ARM, and RISC-V architectures.

Carlini, a analysis scientist on Anthropic’s Safeguards crew who beforehand spent seven years at Google Mind and DeepMind, used a brand new function launched with Claude Opus 4.6 known as “agent teams.” In apply, every Claude occasion ran inside its personal Docker container, cloning a shared Git repository, claiming duties by writing lock recordsdata, then pushing accomplished code again upstream. No orchestration agent directed site visitors. Every occasion independently recognized no matter drawback appeared most evident to work on subsequent and began fixing it. When merge conflicts arose, the AI mannequin situations resolved them on their very own.

The ensuing compiler, which Anthropic has released on GitHub, can compile a spread of main open supply initiatives, together with PostgreSQL, SQLite, Redis, FFmpeg, and QEMU. It achieved a 99 p.c go charge on the GCC torture take a look at suite and, in what Carlini known as “the developer’s final litmus take a look at,” compiled and ran Doom.

It’s value noting {that a} C compiler is a near-ideal process for semi-autonomous AI mannequin coding: The specification is many years previous and well-defined, complete take a look at suites exist already, and there’s a known-good reference compiler to test in opposition to. Most real-world software program initiatives have none of those benefits. The onerous a part of most growth isn’t writing code that passes checks; it’s determining what the checks ought to be within the first place.

Source link

Sixteen Claude AI agents working together created a new C compiler

Sources say NSA is using Mythos Preview, and a source says it is also being used widely within the DoD, despite Anthropic’s designation as a supply chain risk (Axios)

Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)

A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)

Google is in talks with Marvell Technology to develop a memory processing unit that works alongside TPUs, and a new TPU for running AI models (Qianer Liu/The Information)

At the Beijing half-marathon, several humanoid robots beat human winners by 10+ minutes; a robot made by Honor beat the human world record held by Jacob Kiplimo (Reuters)

A look at the AI nonprofit METR, whose time-horizon metrics are used by AI researchers and Wall Street investors to track the rapid development of AI systems (Kevin Roose/New York Times)

Today’s NYT Connections Hints, Answers for April 20 #1044

AI Machine-Vision Earns Man Overboard Certification

Battery recycling startup Renewable Metals charges up on $12 million Series A

The Influencers Normalizing Not Having Sex

Featured Picks

Robots-Blog | Open Source Humanoid pib in neuer Version veröffentlicht

Efficient Simulation of Radiation Pattern Diagrams for Complex Electromagnetic Problems

Practical SQL Puzzles That Will Level Up Your Skill

Sixteen Claude AI agents working together created a new C compiler

Related Posts