SemanticScuttle - klotz.me » Tags: fine tuning+lora+llm+python+stanford

A Step-by-Step Guide to Representation Finetuning LLAMA3

"The paper introduces a technique called LoReFT (Low-rank Linear Subspace ReFT). Similar to LoRA (Low Rank Adaptation), it uses low-rank approximations to intervene on hidden representations. It shows that linear subspaces contain rich semantics that can be manipulated to steer model behaviors."

2024-05-26 Tags: linear subspace, lora, representation, fine tuning, reft, stanford, nlp, python, llm by klotz

SemanticScuttle - klotz.me

Tags: fine tuning* + lora* + llm* + python* + stanford*

Linked Tags

Related Tags