MinerU is a tool that converts PDFs into machine-readable formats (e.g., markdown, JSON), allowing for easy extraction into any format.
Ship measurable improvements in your GenAI systems with Opik, your open-source LLM observability and agent optimization platform. Trusted by over 150,000 developers and thousands of companies.
NVIDIA AI releases Nemotron-Elastic-12B, a 12B parameter reasoning model that embeds nested 9B and 6B variants in the same parameter space, allowing for multiple model sizes from a single training job.
LLM Council works together to answer your hardest questions. A local web app that uses OpenRouter to send queries to multiple LLMs, have them review/rank each other's work, and finally a Chairman LLM produces the final response.
Trail of Bits announces the open-sourcing of Buttercup, their AI-driven Cyber Reasoning System (CRS) developed for DARPA’s AI Cyber Challenge (AIxCC). The article details how Buttercup works, including its four main components (Orchestration/UI, Vulnerability discovery, Contextual analysis, and Patch generation), provides instructions for getting started, and outlines future development plans.
A detailed blog post discussing OpenAI's newly released open-weight GPT models, including performance benchmarks, initial testing on various hardware (Mac laptops, Cerebras), and comparisons to other open-source models. It covers aspects like reasoning capabilities, tool calling, and the new OpenAI Harmony prompt format.
This article details 10 open-source AI tools for developers, covering their benefits, features, and use cases. It emphasizes transparency, offline capabilities, and community support as key advantages of open-source AI.
| **Tool Name** | **Description** | **Key Features** | **What I Like About It** |
|---|---|---|---|
| **Talkd.ai** | Prototyping AI Agents | No-code, JSON/YAML config, API integration | Fast prototyping, no backend needed |
| **Marimo** | Python Notebooks for Apps | Reactive cells, version control, UI widgets | Stable, shareable, version-controlled apps |
| **Unsloth AI** | LLM Fine-Tuning | Memory-optimized training, supports Llama 3 | Accessible fine-tuning on modest hardware |
| **HackingBuddyGPT** | AI for Ethical Hacking | Offline operation, recon tools, payload generation | Offline security, privacy |
| **Giskard** | AI Testing & Debugging | Test case creation, continuous monitoring | Engineering discipline for AI quality |
| **OpenWebUI** | Self-Hosted ChatGPT UI | Local LLMs, plugin support, persistent memory | Privacy, local control |
| **Axolotl** | LLM Fine-Tuning | YAML config, supports QLORA/PEFT/LORA | Simplified fine-tuning, reproducibility |
| **FastRAG** | RAG Pipeline | Local operation, fast query times | Quick, lightweight RAG setup |
| **Nav2** | Robot Navigation Framework | Real-time obstacle detection, multi-robot coordination | Flexible, modern ROS 2 integration |
| **MindsDB** | Machine Learning in Database | SQL-based training/inference, supports various DBs | Easy integration with existing SQL workflows |
This GitHub repository contains a collection of example files demonstrating various use cases and configurations for the llamafiles tools, including examples:
* **System Administration:** Scripts and configurations for Ubuntu, Raspberry Pi 5, and macOS.
* **LLM Interaction:** Examples of prompts and interactions with LLMs like Mixtral and Dolphin.
* **Text Processing:** Scripts for summarizing text, extracting information, and formatting output.
* **Development Tools:** Examples related to Git, Emacs, and other development tools.
* **Hardware Monitoring:** Scripts for monitoring GPU and NVMe drive status.
IBM announces Granite 3.3, featuring a new speech-to-text model (Granite Speech 3.3 8B), enhanced reasoning capabilities in Granite 3.3 8B Instruct, and RAG-focused LoRA adapters for Granite 3.2. The release also includes activated LoRAs (aLoRAs) for improved efficiency and all models are open source.
A list of 11 open source AI projects designed to help developers streamline their work, from training models to improving productivity and data management.
| Project Name | Description |
|----------------------|-----------------------------------------------------------------------------|
| Upscayl | Increases image resolution for enhanced detail, ideal for digital artwork. |
| Nyro | Automates mundane tasks like taking screenshots and resizing windows. |
| Geppetto | Enhances Slack documentation with help from LLMs and can request art from Dall-E. |
| E2B sandboxes | Allows LLMs to use web browsers, GitHub, and command-line tools for tasks like cloud management. |
| Dataline | Generates SQL commands to extract data and create data science reports locally. |
| Swirl Connect | Links standard databases with LLMs and RAG search indices for easier data access. |
| DSPy | Offers a systematic approach to LLM training by connecting modules and optimizers. |
| Guardrails | Integrates controls into generative AI pipelines to refine AI-generated answers and reduce errors. |
| Unsloth | Optimizes training of open-source models for faster and more accurate results. |
| Wren AI for SQL | Translates natural language questions into SQL queries, simplifying data retrieval. |
| AnythingLLM | Organizes digital documents and allows querying with any LLM or RAG system. |