SemanticScuttle - klotz.me » klotz: rl

klotz: rl*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL

Scaling Reinforcement Learning (RL) to surpass O1 in deep learning models

2025-02-13 Tags: deepscaler, reinforcement learning, scaling, deep learning, o1-preview, 1.5b model, rl by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle