SemanticScuttle - klotz.me » Tags: llama.cpp

Show HN: Grammar Generator App for Llama.cpp | Hacker News This bookmark is certified by an admin user.

2024-02-13 Tags: gbnf, llama.cpp, text extraction, functions, json, github by klotz

Example prompt using the new grammar-guided generation functionality from llama.cpp This bookmark is certified by an admin user.

2024-02-13 Tags: gbnf, llama.cpp, text extraction, functions, json, github by klotz

localllm/llm-tool at main · GoogleCloudPlatform/localllm This bookmark is certified by an admin user.

llm-tool provides a command-line utility for running large language models locally. It includes scripts for pulling models from the internet, starting them, and managing them using various commands such as 'run', 'ps', 'kill', 'rm', and 'pull'. Additionally, it offers a Python script named 'querylocal.py' for querying these models. The repository also come

2024-02-08 Tags: llm, localllama, self-hosted, google, gcp, foss, llama.cpp, github by klotz

GoogleCloudPlatform/localllm: Run LLMs locally on Cloud Workstations This bookmark is certified by an admin user.

- create a custom base image for a Cloud Workstation environment using a Dockerfile
. Uses:

Quantized models from

2024-02-08 Tags: llm, google, locallama, github, foss, gguf, huggingface, llama.cpp by klotz

LLM Large Language Model Toolkit: Google This bookmark is certified by an admin user.

The "LLM" toolkit offers a versatile command-line utility and Python library that allows users to work efficiently with large language models. Users can execute prompts directly from their terminals, store the outcomes in SQLite databases, generate embeddings, and perform various other tasks. In this extensive tutorial, topics covered include setup, usage, OpenAI models, alternative models, embeddings, plugins, model aliases, Python APIs, prompt templates, logging, related tools, CLI references, contributing, and change logs.

2024-02-08 Tags: llm, cli, google, llama.cpp by klotz

The Most Simple Way to Set Up ChatGPT Locally This bookmark is certified by an admin user.

2024-01-18 Tags: llm, quantization, llama.cpp, self-hosted, tutorial by klotz

Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference This bookmark is certified by an admin user.

A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex

2024-01-15 Tags: llm, gguf, georgi gerganov, llama.cpp, llamaindex, huggingface, rag by klotz

Llama2 Models - Hugging Face This bookmark is certified by an admin user.

2023-07-19 Tags: llama.cpp, llama, llama2, facebook, meta, hugging face, thebloke, models, llm by klotz

A direct comparison between llama.cpp, AutoGPTQ, ExLlama, and transformers perplexities - LLM blog This bookmark is certified by an admin user.

2023-07-16 Tags: llm, comparison, benchmark, perplexity, llama.cpp, exllama, autogptq, linux by klotz

jlonge4/local_llama: This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies. This bookmark is certified by an admin user.

2023-06-25 Tags: llama.cpp, llama, pdf, chat, self-hosted, github, jlonge, hacks by klotz

SemanticScuttle - klotz.me

Tags: llama.cpp*

Linked Tags

Related Tags