SemanticScuttle - klotz.me » Tags: hugging face

Tags: hugging face*

0 bookmark(s) - Sort by: Date ↓ / Title /

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Google releases Gemma 3, a new iteration of their Gemma family of models. It ranges from 1B to 27B parameters, supports up to 128k tokens, accepts images and text, and supports 140+ languages. This article details its technical enhancements (longer context, multimodality, multilinguality) and provides information on inference with Hugging Face transformers, on-device deployment, and evaluation.

2025-04-03 Tags: gemma 3, llm, hugging face, llama.cpp, google by klotz

Primer LLM Embedding

This Space demonstrates a simple method for embedding text using a LLM (Large Language Model) via the Hugging Face Inference API. It showcases how to convert text into numerical vector representations, useful for semantic search and similarity comparisons.

2025-03-28 Tags: llm, embedding, hugging face, inference, api, semantic search, vector representation, text embedding by klotz

A Coding Implementation to Build a Document Search Agent (DocSearchAgent) with Hugging Face, ChromaDB, and Langchain

This tutorial demonstrates how to build a powerful document search engine using Hugging Face embeddings, Chroma DB, and Langchain for semantic search capabilities.

2025-03-21 Tags: document, search, hugging face, chromadb, langchain, vector database, embedding, agents, llm by klotz

LinkedIn cofounder Reid Hoffman, Hugging Face CEO Clement Delangue sign open letter calling for AI ‘public goods’

Reid Hoffman and Clement Delangue are among the signatories of a new open letter calling for the creation of public data sets and incentives to develop 'small' AI models. The letter aims to encourage collaboration among governments, tech companies, and civil society groups to harness the benefits of AI while mitigating its risks.

2025-02-10 Tags: ai, public goods, reid hoffman, clement delangue, linkedin, hugging face, aol by klotz

Qwen2.5-VL-3B-Instruct

Qwen2.5-VL-3B-Instruct is the latest addition to the Qwen family of vision-language models by Hugging Face, featuring enhanced capabilities in understanding visual content and generating structured outputs. It is designed to directly interact with tools and use computer and phone functions as a visual agent. Qwen2.5-VL can comprehend videos up to an hour long and localize objects within images using bounding boxes or points. It is available in three sizes: 3, 7, and 72 billion parameters.

2025-02-08 Tags: qwen2.5-vl, vlm, hugging face, image, video, llm, qwen by klotz

Hugging Face Clones OpenAI’s Deep Research in 24 Hours

Hugging Face researchers developed an open-source AI research agent called 'Open Deep Research' in 24 hours, aiming to match OpenAI's Deep Research. The project demonstrates the potential of agent frameworks to enhance AI model capabilities, achieving 55.15% accuracy on the GAIA benchmark. The initiative highlights the rapid development and collaborative nature of open-source AI projects.

2025-02-06 Tags: hugging face, openai, deep research, agent, benchmark, machine learning, llm by klotz

Open-R1: a fully open reproduction of DeepSeek-R1

Hugging Face's initiative to replicate DeepSeek-R1, focusing on developing datasets and sharing training pipelines for reasoning models.

The article introduces Hugging Face's Open-R1 project, a community-driven initiative to reconstruct and expand upon DeepSeek-R1, a cutting-edge reasoning language model. DeepSeek-R1, which emerged as a significant breakthrough, utilizes pure reinforcement learning to enhance a base model's reasoning capabilities without human supervision. However, DeepSeek did not release the datasets, training code, or detailed hyperparameters used to create the model, leaving key aspects of its development opaque.

The Open-R1 project aims to address these gaps by systematically replicating and improving upon DeepSeek-R1's methodology. The initiative involves three main steps:

Replicating the Reasoning Dataset: Creating a reasoning dataset by distilling knowledge from DeepSeek-R1.
Reconstructing the Reinforcement Learning Pipeline: Developing a pure RL pipeline, including large-scale datasets for math, reasoning, and coding.
Demonstrating Multi-Stage Training: Showing how to transition from a base model to supervised fine-tuning (SFT) and then to RL, providing a comprehensive training framework.

2025-01-28 Tags: open-r1, deepseek-r1, hugging face, reinforcement learning, llm, open source by klotz

Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens

Alibaba's Qwen 2.5 LLM now supports input token limits up to 1 million using Dual Chunk Attention. Two models are released on Hugging Face, requiring significant VRAM for full capacity. Challenges in deployment with quantized GGUF versions and system resource constraints are discussed.

2025-01-28 Tags: qwen2.5-1m, alibaba, hugging face, gguf, llm, simon willison by klotz

Introducing smolagents, a simple library to build agents

smolagents is a simple library that enables agentic capabilities for language models, allowing them to interact with external tools and perform tasks based on real-world data.

2024-12-31 Tags: smolagents, agents, llm, code, hugging face by klotz

Hugging Face Just Released SmolAgents: A Smol Library that Enables to Run Powerful AI Agents in a Few Lines of Code

Hugging Face's SmolAgents simplifies the creation of intelligent agents by allowing developers to build them with just a few lines of code using powerful pretrained models.

2024-12-31 Tags: hugging face, smolagents, agents, llm by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: hugging face*

Linked Tags

Related Tags