Tags: code interpreter*

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. Details the development and release of DeepCoder-14B-Preview, a 14B parameter code reasoning model achieving performance comparable to o3-mini through reinforcement learning, along with the dataset, code, and system optimizations used in its creation.

  2. A technical blog post about setting up JupyterLab and integrating it with OpenWebUI's code interpreter feature, enabling the LLM to execute and generate code for tasks such as exploratory data analysis.

  3. This article details a method for training large language models (LLMs) for code generation using a secure, local WebAssembly-based code interpreter and reinforcement learning with Group Relative Policy Optimization (GRPO). It covers the setup, training process, evaluation, and potential next steps.

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "code interpreter"

About - Propulsed by SemanticScuttle