SemanticScuttle - klotz.me » klotz: benchmark

klotz: benchmark*

A benchmark of large language models, sorted by size (on disk) for each score. Highlighted entries are on the Pareto frontier.

2024-09-03 Tags: llm, benchmark, oobabooga by klotz

StructuredRAG Released by Weaviate: A Comprehensive Benchmark to Evaluate Large Language Models’ Ability to Generate Reliable JSON Outputs for Complex AI Systems

Weaviate introduces StructuredRAG, a benchmark to evaluate LLMs' ability to generate reliable JSON outputs. The study finds that while LLMs perform well on simpler tasks, they struggle with more complex outputs.

2024-08-27 Tags: llm, json, weaviate, benchmark by klotz

vLLM Benchmark

This repository contains scripts for benchmarking the performance of large language models (LLMs) served using vLLM.

2024-08-24 Tags: vllm, benchmark, llm, performance, backprop.co by klotz

Benchmarks show even an old Nvidia RTX 3090 is enough to serve LLMs to thousands

A startup called Backprop has demonstrated that a single Nvidia RTX 3090 GPU, released in 2020, can handle serving a modest large language model (LLM) like Llama 3.1 8B to over 100 concurrent users with acceptable throughput. This suggests that expensive enterprise GPUs may not be necessary for scaling LLMs to a few thousand users.

2024-08-24 Tags: nvidia, rtx 3090, llm, gpu, performance, benchmark, llama 3.1 8b, vllm, production engineering, backprop.co by klotz

Artificial Analysis

Independent analysis of AI language models and API providers. Understand the AI landscape and choose the best model and API provider for your use-case.

2024-07-14 Tags: large language models, benchmark by klotz

Honey, I shrunk the LLM! A beginner's guide to quantization

This article explores the concept of quantization in large language models (LLMs) and its benefits, including reducing memory usage and improving performance. It also discusses various quantization methods and their effects on model quality.

2024-07-14 Tags: llm, quantization, gpu, benchmark by klotz

txtai-text-classify.py

A Github Gist containing a Python script for text classification using the TxTail API

2024-07-13 Tags: gist, python, txtail, text classification, github, benchmark, llm, gpt, bert by klotz

A Benchmark and Taxonomy of Categorical Encoders

- provides list of 55 categorical encoders, explains how to use the code as a supplement to Category Encoders Python module.
- categorizes encoders into families, explains how to reuse code from the benchmark to include your encoder or dataset in comparison

2024-04-01 Tags: categorical encoders, benchmark, taxonomy, distance-based, machine learning by klotz

Llama 2 is about as factually accurate as GPT-4 for summaries and is 30X cheaper | Anyscale

2023-08-25 Tags: llama-2, gpt, benchmark, llm openai by klotz

A direct comparison between llama.cpp, AutoGPTQ, ExLlama, and transformers perplexities - LLM blog

2023-07-16 Tags: llm, comparison, benchmark, perplexity, llama.cpp, exllama, autogptq, linux by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: benchmark*

Linked Tags

Related Tags