SemanticScuttle - klotz.me » Tags: gguf+github

Tags: gguf* + github*

0 bookmark(s) - Sort by: Date ↓ / Title /

mistral.rs

Mistral.rs is a fast LLM inference platform supporting inference on a variety of devices, quantization, and easy-to-use application with an Open-AI API compatible HTTP server and Python bindings. It supports the latest Llama and Phi models, as well as X-LoRA and LoRA support. The project aims to provide the fastest LLM inference platform possible.

2024-04-29 Tags: rust, llm, mistral, gguf, github by klotz
GoogleCloudPlatform/localllm: Run LLMs locally on Cloud Workstations
- create a custom base image for a Cloud Workstation environment using a Dockerfile . Uses:
Quantized models from
2024-02-08 Tags: llm, google, locallama, github, foss, gguf, huggingface, llama.cpp by klotz

First / Previous / Next / Last / Page 1 of 0