SemanticScuttle - klotz.me » klotz: lora+llm+nlp

klotz: lora* + llm* + nlp*

Fine-Tune Llama 3.1 Ultra-Efficiently with Unsloth

This article provides a comprehensive guide on fine-tuning the Llama 3.1 language model using Unsloth for efficient parameter-efficient training. It covers concepts like supervised fine-tuning, LoRA, QLoRA, and practical steps for training on a high-quality dataset.

2024-07-30 Tags: llama 3.1, fine-tuning, unsloth, lora, qlora, parameter-efficient training, llm, nlp by klotz
A Step-by-Step Guide to Representation Finetuning LLAMA3

"The paper introduces a technique called LoReFT (Low-rank Linear Subspace ReFT). Similar to LoRA (Low Rank Adaptation), it uses low-rank approximations to intervene on hidden representations. It shows that linear subspaces contain rich semantics that can be manipulated to steer model behaviors."

2024-05-26 Tags: linear subspace, lora, representation, fine tuning, reft, stanford, nlp, python, llm by klotz

First / Previous / Next / Last / Page 1 of 0