SemanticScuttle - klotz.me

Tags: llama*

0 bookmark(s) - Sort by: Date ↓ / Title /

Transformer Lab: Experiment with Large Language Models

Transformer Lab is an open-source application for advanced LLM engineering, allowing users to interact, train, fine-tune, and evaluate large language models on their own computer. It supports various models, hardware, and inference engines and includes features like RAG, dataset building, and a REST API.

2025-04-11 Tags: electron, transformers, llama, lora, mlx, llms, rlhf, llm, github by klotz

Search/ReSearch: Asking questions of images with AI?

An analysis of how well different AI systems perform in describing images and answering questions about them. The article compares ChatGPT, Gemini, Llama, and Claude using four images: a hand, a bottle of wine, a piece of pastry, and a flower.

2025-03-01 Tags: vlm, image description, chatgpt, gemini, llama, claude, image, dan russell by klotz

xplore-terminallm - main.py

A script utilizing OpenAI's Llama models to interact within a terminal environment, allowing the models to execute Python code and communicate based on predefined prompts.

2024-12-09 Tags: openai, llama, python, code, agents, chaotic neutral by klotz

NotebookLlama: An Open Source version of NotebookLM

A guided series of tutorials/notebooks to build a PDF to Podcast workflow using Llama models for text processing, transcript writing, dramatization, and text-to-speech conversion.

2024-10-28 Tags: notebookllama, pdf, llama, text processing, foss, facebook by klotz

mistral.rs: Running Llama Vision on Mac M2

Simon Willison explains how to use the mistral.rs library in Rust to run the Llama Vision model on a Mac M2 laptop. He provides a detailed example and discusses the memory usage and GPU utilization.

2024-10-19 Tags: mistral.rs, llama, vision, rust, simon willison, llm, cli, inference by klotz

Gemma vs. Llama vs. Mistral: Exploring Smaller AI Models

This article compares the performance of smaller language models Gemma, Llama 3, and Mistral on reading comprehension tasks. The author highlights the trend of smaller, more accessible models and discusses Apple's recent foray into the field with its own proprietary model.

2024-08-07 Tags: llm, gemma, llama, mistral by klotz

Achieving Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, vLLM)

This blog post benchmarks and compares the performance of SGLang, TensorRT-LLM, and vLLM for serving large language models (LLMs). SGLang demonstrates superior or competitive performance in offline and online scenarios, often outperforming vLLM and matching or exceeding TensorRT-LLM.

2024-07-27 Tags: sglang, tensorrt-llm, vllm, llama, llm by klotz

How to log output of running models and performance monitoring

A discussion post on Reddit's LocalLLaMA subreddit about logging the output of running models and monitoring performance, specifically for debugging errors, warnings, and performance analysis. The post also mentions the need for flags to output logs as flat files, GPU metrics (GPU utilization, RAM usage, TensorCore usage, etc.) for troubleshooting and analytics.

2024-06-12 Tags: llama, python, logging, performance, monitoring, gpu, metrics, debugging, nvidia, analytics, product lion engineering, llms by klotz

A Beginner-Friendly Introduction to LLMs

This article provides a beginner-friendly introduction to Large Language Models (LLMs) and explains the key concepts in a clear and organized way.

2024-05-10 Tags: llm, introduction, bert, palm, gpt, llama by klotz

An explanation of why you should not include a trailing whitespace at the end of your prompts (ChatGPT, Llama, etc.) : r/LocalLLaMA

2024-04-19 Tags: llm, reddit, llama, whitespace, prompt engineering by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: llama*

Linked Tags

Related Tags