SemanticScuttle - klotz.me » klotz: alibaba+reasoning

klotz: alibaba* + reasoning*

Qwen 3 offers a case study in how to effectively release a model

Alibaba’s Qwen team released the Qwen 3 model family, offering a range of sizes and capabilities. The article discusses the model's features, performance, and the well-coordinated release across the LLM ecosystem, highlighting the trend of better models running on the same hardware.

2025-04-29 Tags: llm, qwen, mlx, ollama, reasoning, qwen 3, alibaba, simon willison by klotz
DeepSeek-R1-beating perf in a 32B package? El Reg digs its claws into Alibaba's QwQ

Alibaba's Qwen team aims to find out with its latest release, QwQ. Despite having a fraction of DeepSeek R1's claimed 671 billion parameters, Alibaba touts its comparatively compact 32-billion 'reasoning' model as outperforming R1 in select math, coding, and function-calling benchmarks.

2025-03-17 Tags: alibaba, inference, llm, qwq, deepseek, r1, reasoning by klotz

First / Previous / Next / Last / Page 1 of 0