SemanticScuttle - klotz.me » klotz: huggingface

klotz: huggingface*

A comparison of frameworks, models, and costs for deploying Llama models locally and privately.

- Four tools were analyzed: HuggingFace, vLLM, Ollama, and llama.cpp.
- HuggingFace has a wide range of models but struggles with quantized models.
- vLLM is experimental and lacks full support for quantized models.
- Ollama is user-friendly but has some customization limitations.
- llama.cpp is preferred for its performance and customization options.
- The analysis focused on llama.cpp and Ollama, comparing speed and power consumption across different quantizations.

2024-11-03 Tags: llm, self-hosted, huggingface, vllm, ollama, llama-2 by klotz

Microsoft AI Releases OmniParser Model on HuggingFace: A Compact Screen Parsing Module that can Convert UI Screenshots into Structured Elements

Microsoft has released the OmniParser model on HuggingFace, a vision-based tool designed to parse UI screenshots into structured elements, enhancing intelligent GUI automation across platforms without relying on additional contextual data.

2024-10-26 Tags: microsoft, omniparser, huggingface, gui, automation, vision, user interfaces, llm by klotz

Ollama just made it easier to use AI on your laptop — with no internet required

Ollama now supports HuggingFace GGUF models, making it easier for users to run AI models locally without internet. The GGUF format allows for the use of AI models on modest-sized consumer hardware.

2024-10-24 Tags: ollama, huggingface, gguf, llm, localllama by klotz

The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

This paper analyzes the performance of 20 large language models (LLMs) using two inference libraries: vLLM and HuggingFace Pipelines. The study investigates how hyperparameters influence inference performance and reveals that throughput landscapes are irregular, highlighting the importance of hyperparameter optimization.

2024-08-07 Tags: llm, hyperparameter, huggingface by klotz

Whisper WebGPU - a Hugging Face Space by Xenova

2024-06-09 Tags: whisper, speech recognition, webgpu, browser, llm, huggingface, self-hosted by klotz

HuggingFace Releases

HuggingFace has released FineWeb, a new large-scale dataset consisting of 15 trillion tokens and 44TB of disk space designed for pretraining large language models (LLMs). The dataset, which leverages data from CommonCrawl, undergoes rigorous deduplication and quality filtering processes, making it a valuable tool for researchers.

2024-06-04 Tags: huggingface, fineweb, dataset, llm, commoncrawl by klotz

Training and Finetuning Embedding Models with Sentence Transformers v3

This article explains how to use the Sentence Transformers library to finetune and train embedding models for a variety of applications, such as retrieval augmented generation, semantic search, and semantic textual similarity. It covers the training components, dataset format, loss function, training arguments, evaluators, and trainer.

2024-05-28 Tags: sentence transformers, finetune, embedding, models, similarity, llm, huggingface by klotz

abacusai/Smaug-Llama-3-70B-Instruct

This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-70B-Instruct.

The model outperforms Llama-3-70B-Instruct substantially, and is on par with GPT-4-Turbo, on MT-Bench (see below).

2024-05-21 Tags: llm, llama-3, 70b, instruct, smaug, chat, huggingface by klotz

HuggingFace Transformers Installation

python -c "from transformers import pipeline; print(pipeline('sentiment-analysis')('I love you'))"

2024-02-22 Tags: huggingface, transformers, installation, llm, python by klotz

blog/chat-templates.md at main · huggingface/blog

2024-02-22 Tags: llm, chat, template, huggingface, prompt by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: huggingface*

Linked Tags

Related Tags