SemanticScuttle - klotz.me » klotz: metrics

klotz: metrics*

What Is OpenTelemetry? The Ultimate Guide

OpenTelemetry is not just an observability platform, it's a set of best practices and standards that can be integrated into platform engineering or DevOps.

2024-08-26 Tags: opentelemetry, observability, telemetry data, golden signals, metrics, logs, out traces, platform engineering, production engineering by klotz

Metrics to Evaluate a Classification Machine Learning Model

This article explores various metrics used to evaluate the performance of classification machine learning models, including precision, recall, F1-score, accuracy, and alert rate. It explains how these metrics are calculated and provides insights into their application in real-world scenarios, particularly in fraud detection.

2024-08-01 Tags: machine learning, classification, metrics, evaluation, precision, recall, f1-score, accuracy, alert rate, fraud detection, llm by klotz

It’s Time to Finally Memorize Those Dang Classification Metrics!

This article discusses the importance of understanding and memorizing classification metrics in machine learning. The author shares their own experience and strategies for memorizing metrics such as accuracy, precision, recall, F1 score, and ROC AUC.

2024-06-24 Tags: classification, metrics, machine learning, data science, precision, recall, accuracy, roc, auc by klotz

Analysing Interactions with Friedman’s H-stat and Python

The article explains how to apply Friedman's h-statistic to understand if complex machine learning models use interactions to make predictions. It uses the artemis package and interprets the pairwise, overall, and unnormalised metrics.

2024-06-21 Tags: statistics, metrics, friedman_s h-statistic, machine learning, interactions, artemis, pairwise by klotz

How to log output of running models and performance monitoring

A discussion post on Reddit's LocalLLaMA subreddit about logging the output of running models and monitoring performance, specifically for debugging errors, warnings, and performance analysis. The post also mentions the need for flags to output logs as flat files, GPU metrics (GPU utilization, RAM usage, TensorCore usage, etc.) for troubleshooting and analytics.

2024-06-12 Tags: llama, python, logging, performance, monitoring, gpu, metrics, debugging, nvidia, analytics, product lion engineering, llms by klotz

Metrics, Traces, Logs — And Now, OpenTelemetry Profile Data

With the addition of profiling to OpenTelemetry, we expect continuous production profiling to hit the mainstream.

2024-06-01 Tags: metrics, traces, logs, opentelemetry, profiling, observability, ebpf, production engineering by klotz

Observability, Telemetry, and Monitoring: Learn About the Differences

This article explains the differences between observability, telemetry, and monitoring, and how they work together to help teams understand and improve their software systems. It also discusses the benefits of using OpenTelemetry, a standard for creating and collecting telemetry for software systems, and Honeycomb's observability platform.

2024-05-29 Tags: observability, telemetry, monitoring, honeycomb, opentelemetry, logs, metrics, traces, slo, data, semi-structured data, structured data, semantic, production engineering by klotz

OpenTelemetry Is No ‘Magic Button’ for Observability

OpenTelemetry offers a standardized process for observability, but its functionality is a work in progress. Its usefulness depends on the observability tools and platforms used in conjunction with OpenTelemetry.

2024-05-24 Tags: opentelemetry, observability, devops, metrics, logs, traces, grafana, honeycomb, datadog, dynatrace, splunk, interoperability, vendor lock-in, production engineering by klotz

Langfuse - Open Source LLM Engineering Platform

Langfuse is an open-source LLM engineering platform that offers tracing, prompt management, evaluation, datasets, metrics, and playground for debugging and improving LLM applications. It is backed by several renowned companies and has won multiple awards. Langfuse is built with security in mind, with SOC 2 Type II and ISO 27001 certifications and GDPR compliance.

2024-05-23 Tags: lamgfuse, llm, prompt engineering, evaluation, datasets, metrics, observability by klotz

Steady the Course: Navigating the Evaluation of LLM-based Applications

Why evaluating LLM apps matters and how to get started

2023-11-10 Tags: llm, application, evaluation, metrics by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: metrics*

Linked Tags

Related Tags