This tutorial demonstrates how to combine LLM embeddings, TF-IDF vectors, and metadata features into a single Scikit-learn pipeline for document retrieval and search. It covers generating embeddings with Sentence Transformers, calculating TF-IDF, handling metadata, and building a combined retrieval system.
This article discusses how AI tools can be used to enhance the reading experience by providing instant access to information and background details, similar to using a dictionary or Wikipedia, but with the ability to ask more complex questions. The author shares personal examples of using AI while reading 'The Dark Forest' and other books to clarify plot points and gain a better understanding of the material.
"Yahoo Scout looks like a more web-friendly take on AI searchIt’s somewhere between 10 blue links and a full-blown AI assistant, and so far, it feels like the right mix."
App Finder is an independent search engine that indexes the Google Play Store, offering advanced filtering options to locate niche apps that are often buried by the Play Store's algorithm. It allows users to filter by permissions, keywords, features, ratings, update dates, and more, providing a more precise search experience.
Search the Computer History Museum's collections, including archives, physical objects, and oral histories.
MCP-native command line interface for Z.AI capabilities: vision analysis, web search, web reader, and GitHub repo exploration.
A review of the SearchResearch blog's 2025 posts, highlighting a shift towards AI-augmented research methods, testing AI tools, and emphasizing the importance of verification and critical thinking in online research.
This paper reports on an experiment to build a domain-aware Japanese text-embedding approach to improve the quality of search at Mercari, Japan's largest C2C marketplace.
This article details a method for finding books on your shelves using Gemini's text recognition capabilities. The author describes how taking pictures of bookshelves and using AI to scan them can help locate lost books and even provide insights into reading habits.
A new protocol is emerging to give site owners control over how AI companies use their content, potentially integrated into robots.txt. The IETF AI Preferences Working Group is defining standardized rules for AI access and usage.