An open source web crawler that searches the internet. It's a minimal, real-time web search CLI that searches the internet for you. Enter a query and get search results as JSON (title, url, published_date), sorted by recency.
An introduction to the Requests-HTML library, covering its installation, core features, basic and advanced usages, and practical application scenarios for web scraping and parsing.
This project provides an LLM Websearch Agent using a local SearXNG server for search functionality and includes Python scripts and a bash script for interacting with an LLM to summarize search results.
Scraperr is a self-hosted web application for scraping data from web pages using XPath. It supports queuing URLs, managing scrape elements, and provides features such as job management, user login, and integration with AI services.
Parsera is a new tool for web scraping that leverages large language models (LLMs) to make the process more straightforward and efficient. It focuses on minimizing token usage for faster processing and lower costs.
Parsera is a simple and fast Python library for scraping websites using Large Language Models (LLMs). It's designed to be lightweight and minimize token usage for speed and cost efficiency.
Mesop is a Python-based UI framework that allows you to rapidly build web apps like demos and internal apps. Easy to get started, fast iteration, and flexible & composable.