efficient method for fine-tuning LLM using LoRA and QLoRA, making it possible to train them even on consumer hardware
Details:
IS-LM 3B is StableLM 3B 4E1T(Licensed under CC BY-SA 4.0.) instruction tuned on DataForge Economics for 3 epochs with QLoRA(2305.14314).