SemanticScuttle - klotz.me » klotz: deepseek+reasoning

klotz: deepseek* + reasoning*

DeepSeek-R1-beating perf in a 32B package? El Reg digs its claws into Alibaba's QwQ

Alibaba's Qwen team aims to find out with its latest release, QwQ. Despite having a fraction of DeepSeek R1's claimed 671 billion parameters, Alibaba touts its comparatively compact 32-billion 'reasoning' model as outperforming R1 in select math, coding, and function-calling benchmarks.

2025-03-17 Tags: alibaba, inference, llm, qwq, deepseek, r1, reasoning by klotz

Researchers created an open rival to OpenAI’s o1 ‘reasoning’ model for under $50

AI researchers at Stanford and the University of Washington trained an AI 'reasoning' model named s1 for under $50 using cloud compute credits. The model, which performs similarly to OpenAI’s o1 and DeepSeek’s R1, is available on GitHub. It was developed using distillation from Google’s Gemini 2.0 Flash Thinking Experimental model and demonstrates strong performance on benchmarks.

2025-02-06 Tags: reasoning, llm, openai, deepseek, distillation, stanford, university of washington, google, gemini 2.0, s1 by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: deepseek* + reasoning*

Linked Tags

Related Tags