Cutting-edge Chinese “reasoning” model rivals OpenAI o1—and it’s free to download

In contrast to typical LLMs, these SR fashions take further time to supply responses, and this further time usually will increase efficiency on duties involving math, physics, and science. And this newest open mannequin is popping heads for apparently shortly catching as much as OpenAI.

For instance, DeepSeek reports that R1 outperformed OpenAI’s o1 on a number of benchmarks and checks, together with AIME (a mathematical reasoning check), MATH-500 (a group of phrase issues), and SWE-bench Verified (a programming evaluation instrument). As we normally point out, AI benchmarks have to be taken with a grain of salt, and these outcomes have but to be independently verified.

A chart of DeepSeek R1 benchmark outcomes, created by DeepSeek.

Credit score:

DeepSeek

TechCrunch reports that three Chinese language labs—DeepSeek, Alibaba, and Moonshot AI’s Kimi—have now launched fashions they are saying match o1’s capabilities, with DeepSeek first previewing R1 in November.

However the brand new DeepSeek mannequin comes with a catch if run within the cloud-hosted version—being Chinese language in origin, R1 won’t generate responses about sure subjects like Tiananmen Sq. or Taiwan’s autonomy, because it should “embody core socialist values,” based on Chinese language Web rules. This filtering comes from a further moderation layer that is not a problem if the mannequin is run domestically outdoors of China.

Even with the potential censorship, Dean Ball, an AI researcher at George Mason College, wrote on X, “The spectacular efficiency of DeepSeek’s distilled fashions (smaller variations of r1) signifies that very succesful reasoners will proceed to proliferate extensively and be runnable on native {hardware}, removed from the eyes of any top-down management regime.”

Source link

Cutting-edge Chinese “reasoning” model rivals OpenAI o1—and it’s free to download

“In 10 years, all bets are off”—Anthropic CEO opposes decadelong freeze on state AI laws

Two certificate authorities booted from the good graces of Chrome

Meta and Yandex are de-anonymizing Android users’ web browsing identifiers

AI pioneer Yoshua Bengio launches LawZero, a nonprofit focused on safer AI; LawZero has raised $30M in donations, including from Skype co-founder Jaan Tallinn (Cristina Criddle/Financial Times)

Aerones, which makes robots that can service wind turbines in about half the time of humans, raised $62M led by Activate Capital and S2G Investments (Virginia Furness/Reuters)

Broadcom ends business with VMware’s lowest-tier channel partners

Spanish startup Voltrac raises €2 million to launch autonomous tractor platform for agriculture and frontline logistics

The Elon Musk and Donald Trump Breakup Has Started

“In 10 years, all bets are off”—Anthropic CEO opposes decadelong freeze on state AI laws

Set the Weights Down: BowFlex Adjustable Dumbbells Are Being Recalled. Here’s What to Know

Featured Picks

How to Use Spicy Chat

What is Palletizing? Steps, Types & Configurations

Gamification of Everything: Why Play Is the Future of Business

Cutting-edge Chinese “reasoning” model rivals OpenAI o1—and it’s free to download

Related Posts