Tags: llm* + rag* + document*

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. This article introduces the pyramid search approach using Agentic Knowledge Distillation to address the limitations of traditional RAG strategies in document ingestion.

    The pyramid structure allows for multi-level retrieval, including atomic insights, concepts, abstracts, and recollections. This structure mimics a knowledge graph but uses natural language, making it more efficient for LLMs to interact with.

    **Knowledge Distillation Process**:
    - **Conversion to Markdown**: Documents are converted to Markdown for better token efficiency and processing.
    - **Atomic Insights Extraction**: Each page is processed using a two-page sliding window to generate a list of insights in simple sentences.
    - **Concept Distillation**: Higher-level concepts are identified from the insights to reduce noise and preserve essential information.
    - **Abstract Creation**: An LLM writes a comprehensive abstract for each document, capturing dense information efficiently.
    - **Recollections/Memories**: Critical information useful across all tasks is stored at the top of the pyramid.
  2. pip install 'ragna builtin » ' # Install ragna with all extensions
    ragna config # Initialize configuration
    ragna ui # Launch the web app
    2023-11-02 Tags: , , , , , , , by klotz
  3. Image Similarity Search
    Reverse Image Search
    Object Similarity Search
    Robust OCR Document Search
    Semantic Search
    Cross-modal Retrieval
    Probing Perceptual Similarity
    Comparing Model Representations
    Concept Interpolation
    Concept Space Traversal
    Image Similarity Search

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "llm+rag+document"

About - Propulsed by SemanticScuttle