SemanticScuttle - klotz.me » klotz: llama+nvidia

How to log output of running models and performance monitoring

A discussion post on Reddit's LocalLLaMA subreddit about logging the output of running models and monitoring performance, specifically for debugging errors, warnings, and performance analysis. The post also mentions the need for flags to output logs as flat files, GPU metrics (GPU utilization, RAM usage, TensorCore usage, etc.) for troubleshooting and analytics.

2024-06-12 Tags: llama, python, logging, performance, monitoring, gpu, metrics, debugging, nvidia, analytics, product lion engineering, llms by klotz

How to get oobabooga/text-generation-webui running on Windows or Linux with LLaMa-30b 4bit mode via GPTQ-for-LLaMa on an RTX 3090 start to finish.

2023-07-22 Tags: oogabookga, text-generation-webui, linux, llama, nvidia, rtx 3090, gtpq by klotz

Reddit LocalLlama GPU / CPU

2023-06-09 Tags: llama, llama.cpp, llm, reddit, gpu, nvidia, 3090, 4090, machine learning by klotz

SemanticScuttle - klotz.me

klotz: llama* + nvidia*

Linked Tags

Related Tags