SemanticScuttle - klotz.me » Tags: ai+large language models+deepseek

Tags: ai* + large language models* + deepseek*

0 bookmark(s) - Sort by: Date ↓ / Title /

A detailed comparison of the architectures of recent large language models (LLMs) including DeepSeek-V3, OLMo 2, Gemma 3, Mistral Small 3.1, Llama 4, Qwen3, SmolLM3, and Kimi 2, focusing on key design choices and their impact on performance and efficiency.

2025-07-19 Tags: llm, large language models, deep learning, ai, architecture, deepseek, olmo, gemma, mistral, llama, qwen, smollm, kimi, moe, attention, transformers by klotz

China is reportedly keeping DeepSeek under close watch

China appears to think homegrown AI startup DeepSeek could become a notable tech success story for the country. After DeepSeek's sudden rise to fame with the release of its open 'reasoning' model, R1, the company is now operating under new, tighter government-influenced restrictions.

2025-03-16 Tags: deepseek, china, ai, government restrictions, investor screening, llm by klotz

Scientists flock to DeepSeek: how they’re using the blockbuster AI model

Scientists are exploring the capabilities of the DeepSeek-R1 AI model, released by a Chinese firm. This open and cost-effective model performs comparably to industry leaders in solving mathematical and scientific problems. Researchers are leveraging its accessibility to create custom models for specific disciplines, although it still struggles with some tasks.

2025-01-30 Tags: deepseek, ai, machine learning, deepseek-r1, nature, llm, science by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: ai* + large language models* + deepseek*

Linked Tags

Related Tags