SemanticScuttle - klotz.me » Tags: huggingface+llm

HuggingFace Releases This bookmark is certified by an admin user.

HuggingFace has released FineWeb, a new large-scale dataset consisting of 15 trillion tokens and 44TB of disk space designed for pretraining large language models (LLMs). The dataset, which leverages data from CommonCrawl, undergoes rigorous deduplication and quality filtering processes, making it a valuable tool for researchers.

2024-06-04 Tags: huggingface, fineweb, dataset, llm, commoncrawl by klotz

Training and Finetuning Embedding Models with Sentence Transformers v3 This bookmark is certified by an admin user.

This article explains how to use the Sentence Transformers library to finetune and train embedding models for a variety of applications, such as retrieval augmented generation, semantic search, and semantic textual similarity. It covers the training components, dataset format, loss function, training arguments, evaluators, and trainer.

2024-05-28 Tags: sentence transformers, finetune, embedding, models, similarity, llm, huggingface by klotz

abacusai/Smaug-Llama-3-70B-Instruct This bookmark is certified by an admin user.

This model was built using a new Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-70B-Instruct.

The model outperforms Llama-3-70B-Instruct substantially, and is on par with GPT-4-Turbo, on MT-Bench (see below).

2024-05-21 Tags: llm, llama-3, 70b, instruct, smaug, chat, huggingface by klotz

HuggingFace Transformers Installation This bookmark is certified by an admin user.

python -c "from transformers import pipeline; print(pipeline('sentiment-analysis')('I love you'))"

2024-02-22 Tags: huggingface, transformers, installation, llm, python by klotz

blog/chat-templates.md at main · huggingface/blog This bookmark is certified by an admin user.

2024-02-22 Tags: llm, chat, template, huggingface, prompt by klotz

LoneStriker/Everyone-Coder-4x7b-Base-5.0bpw-h6-exl2 · Hugging Face This bookmark is certified by an admin user.

Not Mixtral MoE but Merge-kit MoE

EveryoneLLM series of models are a new Mixtral type model created using experts that were finetuned by the community, for the community. This is the first model to release in the series and it is a coding specific model. EveryoneLLM, which will be a more generalized model, will be released in the near future after more work is done to fine tune the process of merging Mistral models into a larger Mixtral models with greater success.

The goal of the EveryoneLLM series of models is to be a replacement or an alternative to Mixtral-8x7b that is more suitable for general and specific use, as well as easier to fine tune. Since Mistralai is being secretive about the "secret sause" that makes Mixtral-Instruct such an effective fine tune of the Mixtral-base model, I've decided its time for the community to directly compete with Mistralai on our own.

2024-02-09 Tags: llm, huggingface, everyone, coder, mistral, moe, mixtral, quantization, lonestriker by klotz

GoogleCloudPlatform/localllm: Run LLMs locally on Cloud Workstations This bookmark is certified by an admin user.

- create a custom base image for a Cloud Workstation environment using a Dockerfile
. Uses:

Quantized models from

2024-02-08 Tags: llm, google, locallama, github, foss, gguf, huggingface, llama.cpp by klotz

Preference Tuning LLMs with Direct Preference Optimization Methods This bookmark is certified by an admin user.

2024-01-18 Tags: llm, dpo, fine tuning, huggingface by klotz

Can Ai Code Results - a Hugging Face Space by mike-ravkine This bookmark is certified by an admin user.

2024-01-17 Tags: llm, automatic programming, leaderboard, huggingface, mike ravkine, python, javascript by klotz

Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference This bookmark is certified by an admin user.

A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex

2024-01-15 Tags: llm, gguf, georgi gerganov, llama.cpp, llamaindex, huggingface, rag by klotz

SemanticScuttle - klotz.me

Tags: huggingface* + llm*

Linked Tags

Related Tags