SemanticScuttle - klotz.me » Tags: llm+search

Tags: llm* + search*

0 bookmark(s) - Sort by: Date ↓ / Title /

An open source web crawler that searches the internet. It's a minimal, real-time web search CLI that searches the internet for you. Enter a query and get search results as JSON (title, url, published_date), sorted by recency.

2025-08-28 Tags: web, crawler, scraper, search, cli, json, internet, open source, python, llm by klotz

rendergit

Render any git repo into a single static HTML page for humans or LLMs. Flatten any GitHub repository into a single, searchable HTML page with syntax highlighting, markdown rendering, and a clean sidebar navigation.

2025-08-20 Tags: git, html, static, code, search, llm, markdown, syntax highlighting, karpathy, rendergit, github by klotz

local-first semantic code search engine

GitHub - kantord/SeaGOAT: local-first semantic code search engine

2025-07-20 Tags: github, kantord, seagoat, semantic, code, search, llm by klotz

llm-tools-kiwix

Turn any Kiwix ZIM archive (offline Wikipedia, Stack Exchange, DevDocs, etc.) into an instant knowledge source for LLMs with a tiny CLI + Python server exposing searchable chunks, metadata and citations.

2025-07-14 Tags: datasette, llm, kiwix, wikipedia, stack exchange, devdocs, search, python, cli, mcp by klotz

Semantic Mail

Lightweight CLI agent to semantically search and ask your emails. Downloads inbox, generates embeddings using local (or external) LLMs, and stores everything in a vector database on your machine. Supports incremental sync for fast updates.

2025-06-24 Tags: gmail, search, chromadb, ollama, llm, email, hallux, github, yahorbarkouski by klotz

Googler’s Deposition Offers View Of Google’s Ranking Systems

A Google engineer's testimony shows how page quality is scored and confirms the existence of a popularity signal that uses Chrome data.

2025-05-17 Tags: google, search, ranking, pagerank, llm by klotz

Answer: Can you extract and summarize a blog?

This blog post details an experiment testing the ability of LLMs (Gemini, ChatGPT, Perplexity) to accurately retrieve and summarize recent blog posts from a specific URL (searchresearch1.blogspot.com). The author found significant issues with hallucinations and inaccuracies, even in models claiming live web access, highlighting the unreliability of LLMs for even simple research tasks.

2025-04-10 Tags: llm, ai, hallucination, web access, search, gemini, chatgpt, perplexity, research, information retrieval, dan russell by klotz

A Coding Implementation to Build a Document Search Agent (DocSearchAgent) with Hugging Face, ChromaDB, and Langchain

This tutorial demonstrates how to build a powerful document search engine using Hugging Face embeddings, Chroma DB, and Langchain for semantic search capabilities.

2025-03-21 Tags: document, search, hugging face, chromadb, langchain, vector database, embedding, agents, llm by klotz

Qodo’s open code embedding model sets new enterprise standard, beating OpenAI, Salesforce

Qodo releases Qodo-Embed-1-1.5B, an open-source code embedding model that outperforms competitors from OpenAI and Salesforce, enhancing code search, retrieval, and understanding for enterprise development teams.

2025-03-04 Tags: qodo, code, embedding, llm, search, retrieval, software engineering by klotz

SearchResearch (2/6/2025): SearchResearch, Search, and Deep Research

- "Deep Research" is a new trend in AI-driven research using large language models for multi-step investigations.
- The article compares Deep Research systems, highlighting capabilities and limitations like generating tangential content and handling nonsensical queries.
- Includes systems such as Gemini Advanced 1.5 Pro, OpenAI’s Deep Research, Perplexity’s Deep Research Mode, and You.com’s Research Feature.

2025-02-07 Tags: deep research, llm, search, research, dan russell by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: llm* + search*

Linked Tags

Related Tags