SemanticScuttle - klotz.me » klotz: llm+lora

klotz: llm* + lora*

Bookmarks on this page are managed by an admin user.

mistral-finetune - GitHub This bookmark is certified by an admin user.

A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA, a training paradigm where most weights are frozen and only 1-2% additional weights in the form of low-rank matrix perturbations are trained.

2024-06-06 Tags: github, mistral, lora, python, machine learning, fine tuning, llm by klotz

A Step-by-Step Guide to Representation Finetuning LLAMA3 This bookmark is certified by an admin user.

"The paper introduces a technique called LoReFT (Low-rank Linear Subspace ReFT). Similar to LoRA (Low Rank Adaptation), it uses low-rank approximations to intervene on hidden representations. It shows that linear subspaces contain rich semantics that can be manipulated to steer model behaviors."

2024-05-26 Tags: linear subspace, lora, representation, fine tuning, reft, stanford, nlp, python, llm by klotz

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning This bookmark is certified by an admin user.

This paper proposes a new method called MoRA for parameter-efficient fine-tuning of large language models (LLMs). The proposed method, MoRA, employs a square matrix to achieve high-rank updating, maintaining the same number of trainable parameters. The paper suggests that low-rank updating, as implemented in LoRA, may limit the ability of LLMs to effectively learn and memorize new knowledge. MoRA outperforms LoRA on memory-intensive tasks and achieves comparable performance on other tasks.

2024-05-26 Tags: llm, parameter-efficient fine-tuning, mora, high-rank updating, lora, instruction tuning, mathematical reasoning, continual pretraining, memory, pretraining, sebastian reschka, microsoft research by klotz

Fine-Tuning LLM Models Course | freeCodeCamp.org This bookmark is certified by an admin user.

This article announces a comprehensive course on fine-tuning large language models (LLMs) offered on the freeCodeCamp.org YouTube channel. The course, developed by Krish Naik, covers topics such as QLORA, LORA, quantization with LLama2, gradient, and Google Gemma Model, among others. The course aims to help learners deepen their understanding of machine learning and artificial intelligence.

2024-05-24 Tags: freecodecamp, course, fine-tuning, llm, qlora, lora, quantization, llama2 by klotz

14 Free Large Language Models Fine-Tuning Notebooks This bookmark is certified by an admin user.

- 14 free colab notebooks providing hands-on experience in fine-tuning large language models (LLMs).
- The notebooks cover topics from efficient training methodologies like LoRA and Hugging Face to specialized models such as Llama, Guanaco, and Falcon.
- They also include advanced techniques like PEFT Finetune, Bloom-560m-tagger, and Meta_OPT-6–1b_Model.