klotz: llama-2*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. A comparison of frameworks, models, and costs for deploying Llama models locally and privately.

    - Four tools were analyzed: HuggingFace, vLLM, Ollama, and llama.cpp.
    - HuggingFace has a wide range of models but struggles with quantized models.
    - vLLM is experimental and lacks full support for quantized models.
    - Ollama is user-friendly but has some customization limitations.
    - llama.cpp is preferred for its performance and customization options.
    - The analysis focused on llama.cpp and Ollama, comparing speed and power consumption across different quantizations.
    2024-11-03 Tags: , , , , , by klotz
  2. "This is one of the best 13B models I've tested. (for programming, math, logic, etc) speechless-llama2-hermes-orca-platypus-wizardlm-13b"
    2023-10-02 Tags: , , , , , , by klotz
  3. 2023-09-01 Tags: , , , by klotz
  4. 2023-08-28 Tags: , , , by klotz
  5. 2023-08-26 Tags: , , , by klotz
  6. 2023-08-25 Tags: , , by klotz
  7. 2023-08-25 Tags: , , , by klotz
  8. 2023-08-19 Tags: , , , , by klotz
  9. 2023-08-03 Tags: , , by klotz
  10. 2023-07-25 Tags: , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: llama-2

About - Propulsed by SemanticScuttle