SemanticScuttle - klotz.me » Tags: fine-tuning+python

Tags: fine-tuning* + python*

0 bookmark(s) - Sort by: Date ↓ / Title /

Training Large Language Models with Interpreter Feedback using WebAssembly

This article details a method for training large language models (LLMs) for code generation using a secure, local WebAssembly-based code interpreter and reinforcement learning with Group Relative Policy Optimization (GRPO). It covers the setup, training process, evaluation, and potential next steps.

2025-04-04 Tags: huggingface, llm, training, code generation, webassembly, wasm, grpo, reinforcement learning, axolotl, code interpreter, fine-tuning, python by klotz

Fine-Tuning of Llama-2 7B Chat for Python Code Generation: Using QLoRA, SFTrainer, and Gradient Checkpointing on the Alpaca-14k Dataset

This tutorial demonstrates how to fine-tune the Llama-2 7B Chat model for Python code generation using QLoRA, gradient checkpointing, and SFTTrainer with the Alpaca-14k dataset.

2025-02-09 Tags: llama-2, python, code generation, qlora, sftrainer, fine-tuning, llm, machine learning by klotz

DeepSeek Fine-Tuning Made Simple: Create Custom AI Models with Python

The article by Krishan Walia provides a beginner-friendly guide on fine-tuning the DeepSeek R1 model using Python. It highlights how developers can transform a general-purpose AI model into a specialized, domain-specific language model for various applications.

2025-02-02 Tags: deepseek r1, fine-tuning, llm, python by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: fine-tuning* + python*

Linked Tags

Related Tags