Tags: llama* + github*

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. Transformer Lab is an open-source application for advanced LLM engineering, allowing users to interact, train, fine-tune, and evaluate large language models on their own computer. It supports various models, hardware, and inference engines and includes features like RAG, dataset building, and a REST API.

    2025-04-11 Tags: , , , , , , , , by klotz
  2. 2023-06-25 Tags: , , , , , , , by klotz
  3. llama-cpp-python offers a web server which aims to act as a drop-in replacement for the OpenAI API. This allows you to use llama.cpp compatible models with any OpenAI compatible client (language libraries, services, etc).

    2023-06-09 Tags: , , , , , , by klotz
  4. 2023-06-05 Tags: , , , , by klotz
  5. obtain the original LLaMA model weights and place them in ./models

    ls ./models 65B 30B 13B 7B tokenizer_checklist.chk tokenizer.model

    install Python dependencies

    python3 -m pip install -r requirements.txt

    convert the 7B model to ggml FP16 format

    python3 convert.py models/7B/

    quantize the model to 4-bits (using q4_0 method)

    ./quantize ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin q4_0

    run the inference

    ./main -m ./models/7B/ggml-model-q4_0.bin -n 128

    2023-06-05 Tags: , , , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "llama+github"

About - Propulsed by SemanticScuttle