SemanticScuttle - klotz.me » klotz: relevance+information retrieval

klotz: relevance* + information retrieval*

Redefining Retrieval Evaluation in the Era of LLMs

This paper addresses the misalignment between traditional IR evaluation metrics and the requirements of modern Retrieval-Augmented Generation (RAG) systems. It proposes a novel annotation schema and the UDCG metric to better evaluate retrieval quality for LLM consumers.

2025-10-29 Tags: retrieval augmented generation, rag, information retrieval, llms, evaluation metrics, udcg, relevance, utility by klotz
Improving retrieval with LLM-as-a-judge

This blog post demonstrates how to create a reusable retrieval evaluation dataset using an LLM to judge query-document pairs. It discusses the process, including building a small labeled dataset, aligning LLM judgments with human preferences, and using the LLM to judge a large set of queries and documents.

2024-07-05 Tags: llm, information retrieval, relevance, vespa by klotz

First / Previous / Next / Last / Page 1 of 0