discrawl mirrors Discord guild data into a local SQLite database, allowing you to search, inspect, and query server history independently of Discord. It’s a bot-token crawler – no user-token hacks – and keeps your data local. It discovers accessible guilds, syncs channels, threads, members, and message history, maintains FTS5 search indexes for fast text search (including small attachments), records mentions, and tails Gateway events for live updates with repair syncs. It provides read-only SQL access for analysis and supports multi-guild schemas with a simple single-guild default. Search defaults to all guilds, while sync and tail default to a configured default guild or fan out to all discovered guilds if none is set.
This post details how to restore a GitHub repository using the Software Heritage archive, specifically the example of the now-unavailable 'sqlite-s3vfs' repository.
The author details their transition from Pocket to Karakeep, a self-hosted, open-source alternative for saving and reading articles later. They discuss the benefits of owning your data and the features of Karakeep, including RSS integration and AI-powered tagging.
Discussion on yacy, a personal search engine used as a bookmarking service, and alternatives such as Pinboard, Historio, and Zotero.