SemanticScuttle - klotz.me » Tags: alibaba+simon willison

Tags: alibaba* + simon willison*

0 bookmark(s) - Sort by: Date ↓ / Title /

Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens

Alibaba's Qwen 2.5 LLM now supports input token limits up to 1 million using Dual Chunk Attention. Two models are released on Hugging Face, requiring significant VRAM for full capacity. Challenges in deployment with quantized GGUF versions and system resource constraints are discussed.

2025-01-28 Tags: qwen2.5-1m, alibaba, hugging face, gguf, llm, simon willison by klotz
Qwen2.5-Coder-32B is an LLM that can code well that runs on my Mac

Simon Willison reviews the new Qwen2.5-Coder-32B, an open-source LLM by Alibaba, which performs well on various coding benchmarks and can run on personal devices like his MacBook Pro M2.

2024-11-13 Tags: qwen2.5-coder-32b, llm, alibaba, simon willison, code, qwen by klotz

First / Previous / Next / Last / Page 1 of 0