The M.2 Max is an AI inference acceleration card powered by the Metis AIPU, designed to enable Large Language Models (LLMs) and Vision Language Models (VLMs) on power-constrained edge and embedded devices. It offers high memory performance in a small footprint and supports complex computer vision tasks using parallel or cascaded models.
Key features include:
- Memory capacities up to 16 GB with various cooling options.
- Support for standard and extended operating temperature ranges.
- Hardware Root-of-Trust for secure boot and firmware integrity.
- Integration via the Voyager SDK and advanced quantization tools.
- Compatibility with PCIe Gen. 3.0 x4, Intel, AMD, and Arm64 processors across Linux and Windows environments.
A technical guide to running lightweight OCR models (LightOnOCR, GLM-OCR, Deepseek-OCR) on low-end hardware using llama.cpp. Includes implementation details for CLI, REST APIs, and performance optimization.
Topics Covered:
- llama.cpp OCR integration
- Low-spec hardware optimization
- CLI & REST API setup
- Quantization & Prompting
- Hallucination mitigation
This is an open, unconventional textbook covering mathematics, computing, and artificial intelligence from foundational principles. It's designed for practitioners seeking a deep understanding, moving beyond exam preparation and focusing on real-world application. The author, drawing from years of experience in AI/ML, has compiled notes that prioritize intuition, context, and clear explanations, avoiding dense notation and outdated material.
The compendium covers a broad range of topics, from vectors and matrices to machine learning, computer vision, and multimodal learning, with future chapters planned for areas like data structures and AI inference.
Sipeed’s MaixCAM2 is a powerful, open-source AI camera designed for makers, offering significant performance improvements over Raspberry Pi and OpenMV solutions. It features the Axera Tech AX630 AI SoC with up to 12.8 TOPS and supports training-free vision models and vision-language models.
Introduction to the OSOYOO V4.0 Robot Car for Raspberry Pi, highlighting its advanced features and capabilities for complex robotic projects compared to Arduino-based kits.
Moondream transforms the humble Raspberry Pi into a context-aware visual interpreter, capable of answering nuanced questions about images in plain English. This guide explores its potential for home automation, security analysis, and more.
This book covers foundational topics within computer vision, with an image processing and machine learning perspective. It aims to build the reader’s intuition through visualizations and is intended for undergraduate and graduate students, as well as experienced practitioners.
Creativity and a Jetson Orin Nano Super can help hobbyists build accessible robots that can reason and interact with the world. The article discusses building a robot using accessible hardware like Arduino and Raspberry Pi, eventually upgrading to more capable hardware like the Jetson Orin Nano Super to run a large language model (LLM) onboard.
Learn how to use Python and OpenCV to perform face detection and recognition. This tutorial also covers concepts like bounding boxes, intersection over union (IoU), and grayscale conversion.
• A beginner's guide to understanding Hugging Face Transformers, a library that provides access to thousands of pre-trained transformer models for natural language processing, computer vision, and more.
• The guide covers the basics of Hugging Face Transformers, including what it is, how it works, and how to use it with a simple example of running Microsoft's Phi-2 LLM in a notebook
• The guide is designed for non-technical individuals who want to understand open-source machine learning without prior knowledge of Python or machine learning.