SemanticScuttle - klotz.me » Tags: information retrieval

Tags: information retrieval*

0 bookmark(s) - Sort by: Date ↓ / Title /

Towards Better Search with Domain-Aware Text Embeddings for C2C Marketplaces

This paper reports on an experiment to build a domain-aware Japanese text-embedding approach to improve the quality of search at Mercari, Japan's largest C2C marketplace.

2025-12-25 Tags: text embeddings, information retrieval, search, machine learning, llm, fine tuning by klotz

The Architecture Behind Web Search in AI Chatbots

This article explores the architecture enabling AI chatbots to perform web searches, covering retrieval-augmented generation (RAG), vector databases, and the challenges of integrating search with LLMs.

2025-12-07 Tags: chat, web search, rag, retrieval-augmented generation, vector databases, llm, information retrieval, knowledge base by klotz

Redefining Retrieval Evaluation in the Era of LLMs

This paper addresses the misalignment between traditional IR evaluation metrics and the requirements of modern Retrieval-Augmented Generation (RAG) systems. It proposes a novel annotation schema and the UDCG metric to better evaluate retrieval quality for LLM consumers.

2025-10-29 Tags: retrieval augmented generation, rag, information retrieval, llms, evaluation metrics, udcg, relevance, utility by klotz

How I Built Lightning-Fast Vector Search for Legal Documents

This article details the process of building a fast vector search system for a large legal dataset (Australian High Court decisions). It covers choosing embedding providers, performance benchmarks, using USearch and Isaacus embeddings, and the importance of API terms of service. It focuses on achieving speed and scalability while maintaining reasonable accuracy.

2025-10-21 Tags: vector search, embeddings, legal documents, usearch, isaacus, performance, scalability, nlp, information retrieval, rag by klotz

PLUM: Adapting Pre-trained Language Models for Industrial-scale Generative Recommendations

In this paper, we introduce PLUM, a framework designed to adapt pre-trained LLMs for industry-scale recommendation tasks. PLUM consists of item tokenization using Semantic IDs, continued pre-training (CPT) on domain-specific data, and task-specific fine-tuning for recommendation objectives. We conduct comprehensive experiments on large-scale internal video recommendation datasets and demonstrate substantial improvements for retrieval compared to a heavily-optimized production model.

2025-10-16 Tags: information retrieval, machine learning, recommendation systems, large language models, generative models by klotz

Answer: So what ARE LLMs good at? What are they bad at?

A blog post comparing when to use regular Google search versus LLMs for research, outlining the strengths and weaknesses of each. It details scenarios where search engines excel (facts, current events, specific sources) and where LLMs shine (analysis, synthesis, creative thinking). It also lists tasks LLMs struggle with, such as complex reasoning, real-time information, and fact verification.

2025-07-23 Tags: llms, ai, search engines, information retrieval, synthesis, analysis, factual accuracy, current events, dan russell by klotz

Answer: Can you extract and summarize a blog?

This blog post details an experiment testing the ability of LLMs (Gemini, ChatGPT, Perplexity) to accurately retrieve and summarize recent blog posts from a specific URL (searchresearch1.blogspot.com). The author found significant issues with hallucinations and inaccuracies, even in models claiming live web access, highlighting the unreliability of LLMs for even simple research tasks.

2025-04-10 Tags: llm, ai, hallucination, web access, search, gemini, chatgpt, perplexity, research, information retrieval, dan russell by klotz

Overcome Failing Document Ingestion & RAG Strategies with Agentic Knowledge Distillation

This article introduces the pyramid search approach using Agentic Knowledge Distillation to address the limitations of traditional RAG strategies in document ingestion.

The pyramid structure allows for multi-level retrieval, including atomic insights, concepts, abstracts, and recollections. This structure mimics a knowledge graph but uses natural language, making it more efficient for LLMs to interact with.

**Knowledge Distillation Process**:
- **Conversion to Markdown**: Documents are converted to Markdown for better token efficiency and processing.
- **Atomic Insights Extraction**: Each page is processed using a two-page sliding window to generate a list of insights in simple sentences.
- **Concept Distillation**: Higher-level concepts are identified from the insights to reduce noise and preserve essential information.
- **Abstract Creation**: An LLM writes a comprehensive abstract for each document, capturing dense information efficiently.
- **Recollections/Memories**: Critical information useful across all tasks is stored at the top of the pyramid.

2025-03-07 Tags: agent, knowledge distillation, rag, document, pyramid search, llm, information retrieval, scuttle, summarizer by klotz

rerankers: A Lightweight Python Library to Unify Ranking Methods

Re-ranking is integral to retrieval pipelines, but implementation methods vary. We introduce rerankers, a Python library offering a unified interface for common re-ranking approaches.

2024-09-17 Tags: re-ranking, python, information retrieval, search engine, release by klotz

Why Does Position-Based Chunking Lead to Poor Performance in RAGs? How to implement semantic chunking and gain better results.

This article explores the limitations of position-based chunking in Retrieval Augmented Generation (RAG) systems and proposes semantic chunking as a better alternative for improved performance.

2024-08-24 Tags: rag, chunking, llm, information retrieval by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: information retrieval*

Linked Tags

Related Tags