SemanticScuttle - klotz.me » Tags: lora+fine tuning

Tags: lora* + fine tuning*

0 bookmark(s) - Sort by: Date / Title ↑ /

A Step-by-Step Guide to Representation Finetuning LLAMA3

"The paper introduces a technique called LoReFT (Low-rank Linear Subspace ReFT). Similar to LoRA (Low Rank Adaptation), it uses low-rank approximations to intervene on hidden representations. It shows that linear subspaces contain rich semantics that can be manipulated to steer model behaviors."

2024-05-26 Tags: linear subspace, lora, representation, fine tuning, reft, stanford, nlp, python, llm by klotz
Easily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-Adapter, and More

2023-12-10 Tags: llm, lora, qlora, peft, fine tuning by klotz
Finetune LLMs on your own consumer hardware using tools from PyTorch and Hugging Face ecosystem | PyTorch

efficient method for fine-tuning LLM using LoRA and QLoRA, making it possible to train them even on consumer hardware

2024-01-12 Tags: llm, fine tuning, qlora, lora, peft, pytorch, hugging face, fine-tuning, llms by klotz

First / Previous / Next / Last / Page 1 of 0