SemanticScuttle - klotz.me » Tags: machine learning+ai+reinforcement learning

Tags: machine learning* + ai* + reinforcement learning*

0 bookmark(s) - Sort by: Date ↓ / Title /

Apple study shows LLMs also benefit from the oldest productivity trick in the book

An Apple study shows that large language models (LLMs) can improve performance by using a checklist-based reinforcement learning scheme, similar to a simple productivity trick of checking one's work.

2025-08-26 Tags: apple, llm, ai, machine learning, productivity, rlcf, reinforcement learning, checklists, artificial intelligence by klotz
AI has grown beyond human knowledge, says Google's DeepMind unit

DeepMind researchers propose a new 'streams' approach to AI development, focusing on experiential learning and autonomous interaction with the world, moving beyond the limitations of current large language models and potentially surpassing human intelligence.

2025-04-18 Tags: ai, deepmind, reinforcement learning, streams, llm, alphazero, experiential learning, agents by klotz
ByteDance Research Releases DAPO: A Fully Open-Sourced LLM Reinforcement Learning System at Scale

ByteDance Research has released DAPO (Dynamic Sampling Policy Optimization), an open-source reinforcement learning system for LLMs, aiming to improve reasoning abilities and address reproducibility issues. DAPO includes innovations like Clip-Higher, Dynamic Sampling, Token-level Policy Gradient Loss, and Overlong Reward Shaping, achieving a score of 50 on the AIME 2024 benchmark with the Qwen2.5-32B model.

2025-03-21 Tags: llm, reinforcement learning, dapo, open source, bytedance, ai, machine learning, reasoning, aime, qwen2.5 by klotz
Control What You Can: Reinforcement Learning with Task Planning!

2020-04-09 Tags: machine learning, reinforcement learning, ai, planning by klotz
MIT AI: Reinforcement Learning, Planning, and Robotics (Leslie Kaelbling) - YouTube

2019-03-12 Tags: leslie kaelbling, mit, ai, reinforcement learning, planning, robotics, video, machine learning, youtube by klotz
Key papers in deep reinforcement learning

2018-11-14 Tags: papers, list, ai, machine learning, deep learning, reinforcement learning by klotz
Why AGI is Achievable in Five Years

2018-10-25 Tags: agi, ai, machine learning, deep learning, epistemology, carlos perez, reinforcement learning by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle

SemanticScuttle - klotz.me

Tags: machine learning* + ai* + reinforcement learning*

Linked Tags

Related Tags