discrawl mirrors Discord guild data into a local SQLite database, allowing you to search, inspect, and query server history independently of Discord. It’s a bot-token crawler – no user-token hacks – and keeps your data local. It discovers accessible guilds, syncs channels, threads, members, and message history, maintains FTS5 search indexes for fast text search (including small attachments), records mentions, and tails Gateway events for live updates with repair syncs. It provides read-only SQL access for analysis and supports multi-guild schemas with a simple single-guild default. Search defaults to all guilds, while sync and tail default to a configured default guild or fan out to all discovered guilds if none is set.
This tutorial demonstrates how to combine LLM embeddings, TF-IDF vectors, and metadata features into a single Scikit-learn pipeline for document retrieval and search. It covers generating embeddings with Sentence Transformers, calculating TF-IDF, handling metadata, and building a combined retrieval system.
This article discusses how AI tools can be used to enhance the reading experience by providing instant access to information and background details, similar to using a dictionary or Wikipedia, but with the ability to ask more complex questions. The author shares personal examples of using AI while reading 'The Dark Forest' and other books to clarify plot points and gain a better understanding of the material.
"Yahoo Scout looks like a more web-friendly take on AI searchIt’s somewhere between 10 blue links and a full-blown AI assistant, and so far, it feels like the right mix."
App Finder is an independent search engine that indexes the Google Play Store, offering advanced filtering options to locate niche apps that are often buried by the Play Store's algorithm. It allows users to filter by permissions, keywords, features, ratings, update dates, and more, providing a more precise search experience.
Search the Computer History Museum's collections, including archives, physical objects, and oral histories.
MCP-native command line interface for Z.AI capabilities: vision analysis, web search, web reader, and GitHub repo exploration.
A review of the SearchResearch blog's 2025 posts, highlighting a shift towards AI-augmented research methods, testing AI tools, and emphasizing the importance of verification and critical thinking in online research.
This paper reports on an experiment to build a domain-aware Japanese text-embedding approach to improve the quality of search at Mercari, Japan's largest C2C marketplace.
This article details a method for finding books on your shelves using Gemini's text recognition capabilities. The author describes how taking pictures of bookshelves and using AI to scan them can help locate lost books and even provide insights into reading habits.