SemanticScuttle - klotz.me » Tags: small language models+llm

Tags: small language models* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

The article presents rStar-Math, a method demonstrating that small language models (SLMs) can rival or surpass the math reasoning capabilities of larger models like OpenAI's without distillation. rStar-Math employs Monte Carlo Tree Search (MCTS) for 'deep thinking', using a math policy SLM guided by an SLM-based process reward model. It introduces three innovations: a code-augmented CoT data synthesis method for training the policy SLM, a novel process reward model training method avoiding step-level score annotation, and a self-evolution recipe where both the policy SLM and process preference model are iteratively improved. Through self-evolution with millions of solutions for 747k math problems, rStar-Math achieves state-of-the-art math reasoning, significantly improving performance on benchmarks like MATH and AIME.

2025-01-11 Tags: small language models, llm, math, reasoning, self-evolution, rstar-math by klotz

The Tiny JSONist — meet AI NuExtract

This article explores NuExtract, a family of Small Language Models (SLMs) for extracting structured data from text. The author, Fabio Matricardi, discusses using NuExtract to process candidate CVs for a database and highlights its benefits for privacy protection and running on less powerful computers.

2024-08-22 Tags: llm, nuextract, information extraction, small language models, json by klotz

The Next Big Trends in Large Language Model (LLM) Research

Explores recent trends in LLM research, including multi-modal LLMs, open-source LLMs, domain-specific LLMs, LLM agents, smaller LLMs, and Non-Transformer LLMs. Mentions examples such as OpenAI's Sora, LLM360, BioGPT, StarCoder, and Mamba.

2024-07-05 Tags: llm, multimodal, agent, small language models, domain language models by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: small language models* + llm*

Linked Tags

Related Tags