SemanticScuttle - klotz.me » Tags: transformers

Tags: transformers*

0 bookmark(s) - Sort by: Date ↓ / Title /

The attention mechanism in Large Language Models (LLMs) helps derive the meaning of a word from its context. This involves encoding words as multi-dimensional vectors, calculating query and key vectors, and using attention weights to adjust the embedding based on contextual relevance.

2025-03-07 Tags: attention, llm, machine-learning, neural networks, nlp, transformers by klotz

SmolVLM2: Bringing Video Understanding to Every Device

SmolVLM2 represents a shift in video understanding technology by introducing efficient models that can run on various devices, from phones to servers. The release includes models of three sizes (2.2B, 500M, and 256M) with Python and Swift API support. These models offer video understanding capabilities with reduced memory consumption, supported by a suite of demo applications for practical use.

2025-02-21 Tags: smolvlm2, video understanding, python, machine learning, video, transformers, mlx, vlm, llm by klotz

A Complete Introduction to Using BERT Models

This article provides a comprehensive guide on the basics of BERT (Bidirectional Encoder Representations from Transformers) models. It covers the architecture, use cases, and practical implementations, helping readers understand how to leverage BERT for natural language processing tasks.

2025-02-07 Tags: bert, natural language processing, machine learning, transformers, text classification, sentiment analysis, llm by klotz

Deep Dive into Self-Attention by Hand✍︎

Explore the intricacies of the attention mechanism responsible for fueling the transformers.

2025-02-04 Tags: transformers, self-attention, neural networks, llm, machine learning by klotz

How to Build a Text Classification Model using Hugging Face Transformers

A detailed guide on creating a text classification model with Hugging Face's transformer models, including setup, training, and evaluation steps.

2024-12-17 Tags: text classification, hugging face, transformers, machine learning, nlp by klotz

BERT — Intuitively and Exhaustively Explained

This article explains BERT, a language model designed to understand text rather than generate it. It discusses the transformer architecture BERT is based on and provides a step-by-step guide to building and training a BERT model for sentiment analysis.

2024-08-24 Tags: bert, embedding, transformers, natural language processing, sentiment analysis, machine learning by klotz

How to Fine-Tune BERT for Sentiment Analysis with Hugging Face Transformers

This tutorial covers fine-tuning BERT for sentiment analysis using Hugging Face Transformers. Learn to prepare data, set up environment, train and evaluate the model, and make predictions.

2024-06-06 Tags: bert, sentiment analysis, hugging face, transformers, natural language processing, machine learning, pytorch, data science by klotz

New Trends in LLM Architecture

Discusses the trends in Large Language Models (LLMs) architecture, including the rise of more GPU, more weights, more tokens, energy-efficient implementations, the role of LLM routers, and the need for better evaluation metrics, faster fine-tuning, and self-tuning.

2024-06-01 Tags: llm, machine learning, deep learning, transformers, self-tuning, evaluation by klotz

Understanding Long RoPE in LLMs

This article explains the Long RoPE methodology used to expand the context lengths in LLMs without significant performance degradation. It discusses the importance of context length in LLMs and the limitations of previous positional encoding methods. The article then introduces Rotational Positional Encoding (RoPE) and its limitations, and explains how Long RoPE extends RoPE to larger contexts.

2024-05-15 Tags: llm, rope, context, positional encoding, transformers, rotational positional encoding by klotz

HuggingFace Transformers Installation

python -c "from transformers import pipeline; print(pipeline('sentiment-analysis')('I love you'))"

2024-02-22 Tags: huggingface, transformers, installation, llm, python by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: transformers*

Linked Tags

Related Tags