klotz: benchmark* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. A benchmark of large language models, sorted by size (on disk) for each score. Highlighted entries are on the Pareto frontier.
    2024-09-03 Tags: , , by klotz
  2. Weaviate introduces StructuredRAG, a benchmark to evaluate LLMs' ability to generate reliable JSON outputs. The study finds that while LLMs perform well on simpler tasks, they struggle with more complex outputs.
    2024-08-27 Tags: , , , by klotz
  3. This repository contains scripts for benchmarking the performance of large language models (LLMs) served using vLLM.
    2024-08-24 Tags: , , , , by klotz
  4. A startup called Backprop has demonstrated that a single Nvidia RTX 3090 GPU, released in 2020, can handle serving a modest large language model (LLM) like Llama 3.1 8B to over 100 concurrent users with acceptable throughput. This suggests that expensive enterprise GPUs may not be necessary for scaling LLMs to a few thousand users.
  5. Independent analysis of AI language models and API providers. Understand the AI landscape and choose the best model and API provider for your use-case.
    2024-07-14 Tags: , by klotz
  6. This article explores the concept of quantization in large language models (LLMs) and its benefits, including reducing memory usage and improving performance. It also discusses various quantization methods and their effects on model quality.
    2024-07-14 Tags: , , , by klotz
  7. A Github Gist containing a Python script for text classification using the TxTail API
  8. Compare the performance of different LLM that can be deployed locally on consumer hardware. The expected good response and scores are generated by GPT-4.
    2023-06-09 Tags: , , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: benchmark + llm

About - Propulsed by SemanticScuttle