SemanticScuttle - klotz.me » klotz: llm+python

klotz: llm* + python*

meGPT - upload an author's content into an LLM

This repository organizes public content to train an LLM to answer questions and generate summaries in an author's voice, focusing on the content of 'virtual_adrianco' but designed to be extensible to other authors.

2025-04-01 Tags: llm, rag, persona, ai, replicai, python, github, adrian cockcroft by klotz

Monitoring Gen AI apps with NVIDIA GPUs

This Splunk Lantern article outlines the steps to monitor Gen AI applications with Splunk Observability Cloud, covering setup with OpenTelemetry, NVIDIA GPU metrics, Python instrumentation, and OpenLIT integration to monitor GenAI applications built with technologies like Python, LLMs (OpenAI's GPT-4o, Anthropic's Claude 3.5 Haiku, Meta’s Llama), NVIDIA GPUs, Langchain, and vector databases (Pinecone, Chroma) using Splunk Observability Cloud. It outlines a six-step process:

Access Splunk Observability Cloud: Sign up for a free trial if needed.
Deploy Splunk Distribution of OpenTelemetry Collector: Use a Helm chart to install the collector in Kubernetes.
Capture NVIDIA GPU Metrics: Utilize the NVIDIA GPU Operator and Prometheus receiver in the OpenTelemetry Collector.
Instrument Python Applications: Use the Splunk Distribution of OpenTelemetry Python agent for automatic instrumentation and enable Always On Profiling.
Enhance with OpenLIT: Install and initialize OpenLIT to capture detailed trace data, including LLM calls and interactions with vector databases (with options to disable PII capture).
Start Using the Data: Leverage the collected metrics and traces, including features like Tag Spotlight, to identify and resolve performance issues (example given: OpenAI rate limits).

The article emphasizes OpenTelemetry's role in GenAI observability and highlights how Splunk Observability Cloud facilitates monitoring these complex applications, providing insights into performance, cost, and potential bottlenecks. It also points to resources for help and further information on specific aspects of the process.

2025-03-27 Tags: splunk, llm, observability, opentelemetry, nvidia, gpus, python, openlit, kubernetes by klotz

Function calling

This document details how to use function calling with Mistral AI models to connect to external tools and build more complex applications, outlining a four-step process: User query & tool specification, Model argument generation, User function execution, and Model final answer generation.

2025-03-21 Tags: mistral ai, functions, llm, api, tools, integration, python, typescript, dataframes by klotz

Browser Use: Enable AI to Control Your Browser

Browser Use is a library that enables AI agents to interact with web browsers, making websites accessible for automated tasks. It includes features for browser automation, agent memory, and various demos showcasing its capabilities.

2025-03-14 Tags: python, browser, automation, agents, llm, github, crawler, scraper by klotz

gpt-engineer

A terminal-based platform to experiment with the AI Software Engineer. It allows users to specify software in natural language, watch as an AI writes and executes the code, and implement improvements. Supports various models and customization options.

2025-03-05 Tags: python, ai, openai, coding assistant, gpt-engineer, llm, hallux, github by klotz

ClickUi - www.ClickUi.app

ClickUi is a powerful, open-source, cross-platform AI-assistant application built in Python. It integrates various AI models, speech recognition, and web scraping capabilities, providing both voice and text interaction interfaces. The tool is designed to be a comprehensive AI-computer assistant, supporting features such as voice mode, chat mode, file attachments, property lookups, and web searches. It aims to be user-friendly and adaptable, encouraging community collaboration for future development and improvements.

2025-03-02 Tags: clickui, llm, python, web ui, github by klotz

ArXiv Paper Summarizer

This repository provides a Python script to fetch and summarize research papers from arXiv using the free Gemini API. It includes features for summarizing a single paper or multiple papers, easy setup, and automatic daily extraction and summarization based on specific keywords. The tool is designed to help researchers, students, and enthusiasts quickly extract key insights from arXiv papers without manually reading through lengthy documents.

2025-02-26 Tags: arxiv, paper summarization, python, gemini api, keywords, summarizer, llm by klotz

SmolVLM2: Bringing Video Understanding to Every Device

SmolVLM2 represents a shift in video understanding technology by introducing efficient models that can run on various devices, from phones to servers. The release includes models of three sizes (2.2B, 500M, and 256M) with Python and Swift API support. These models offer video understanding capabilities with reduced memory consumption, supported by a suite of demo applications for practical use.

2025-02-21 Tags: smolvlm2, video understanding, python, machine learning, video, transformers, mlx, vlm, llm by klotz

Fine-Tuning of Llama-2 7B Chat for Python Code Generation: Using QLoRA, SFTrainer, and Gradient Checkpointing on the Alpaca-14k Dataset

This tutorial demonstrates how to fine-tune the Llama-2 7B Chat model for Python code generation using QLoRA, gradient checkpointing, and SFTTrainer with the Alpaca-14k dataset.

2025-02-09 Tags: llama-2, python, code generation, qlora, sftrainer, fine-tuning, llm, machine learning by klotz

OpenInference

OpenInference is a set of conventions and plugins that complements OpenTelemetry to enable tracing of AI applications, with native support from arize-phoenix and compatibility with other OpenTelemetry-compatible backends.

2025-02-08 Tags: openinference, opentelemetry, observability, tracing, ai, arize, python, javascript, llm, production by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: llm* + python*

Linked Tags

Related Tags