SemanticScuttle - klotz.me

How to Fine-Tune BERT for Sentiment Analysis with Hugging Face Transformers This bookmark is certified by an admin user.

This tutorial covers fine-tuning BERT for sentiment analysis using Hugging Face Transformers. Learn to prepare data, set up environment, train and evaluate the model, and make predictions.

2024-06-06 Tags: bert, sentiment analysis, hugging face, transformers, natural language processing, machine learning, pytorch, data science by klotz

The Emergence of Super Tiny Language Models (STLMs) for Sustainable AI Transforms the Realm of NLP This bookmark is certified by an admin user.

A research team introduces Super Tiny Language Models (STLMs) to address the resource-intensive nature of large language models, providing high performance with significantly reduced parameter counts.

2024-06-02 Tags: super tiny language model, nlp, llm, production by klotz

Contextual Transformer Embeddings Using Self-Attention Explained with Diagrams and Python Code This bookmark is certified by an admin user.

This article is part of a series titled ‘LLMs from Scratch’, a complete guide to understanding and building Large Language Models (LLMs). In this article, we discuss the self-attention mechanism and how it is used by transformers to create rich and context-aware transformer embeddings.

The Self-Attention mechanism is used to add context to learned embeddings, which are vectors representing each word in the input sequence. The process involves the following steps:

1. Learned Embeddings: These are the initial vector representations of words, learned during the training phase. The weights matrix, storing the learned embeddings, is stored in the first linear layer of the Transformer architecture.

2. Positional Encoding: This step adds positional information to the learned embeddings. Positional information helps the model understand the order of the words in the input sequence, as transformers process all words in parallel, and without this information, they would lose the order of the words.

3. Self-Attention: The core of the Self-Attention mechanism is to update the learned embeddings with context from the surrounding words in the input sequence. This mechanism determines which words provide context to other words, and this contextual information is used to produce the final contextualized embeddings.

2024-06-01 Tags: transformer, attention, self-attention, embeddings, nlp, deep learning, llm, machine learning by klotz

A Complete Guide to BERT with Code: History, Architecture, Pre-training, and Fine-tuning This bookmark is certified by an admin user.

In this article, we will explore various aspects of BERT, including the landscape at the time of its creation, a detailed breakdown of the model architecture, and writing a task-agnostic fine-tuning pipeline, which we demonstrated using sentiment analysis. Despite being one of the earliest LLMs, BERT has remained relevant even today, and continues to find applications in both research and industry.

2024-05-28 Tags: bert, llm, embedding, google, nlp, encoder-only, transformer by klotz

Combining the Best of Both Worlds: Retrieval-Augmented Generation for Knowledge-Intensive Natural Language Processing This bookmark is certified by an admin user.

This article discusses Retrieval-Augmented Generation (RAG) models, a new approach that addresses the limitations of traditional models in knowledge-intensive Natural Language Processing (NLP) tasks. RAG models combine parametric memory from pre-trained seq2seq models with non-parametric memory from a dense vector index of Wikipedia, enabling dynamic knowledge access and integration.

2024-05-28 Tags: retrieval-augmented generation, nlp, llm, parametric memory by klotz

Training and Finetuning Embedding Models with Sentence Transformers v3 This bookmark is certified by an admin user.

This article explains how to use the Sentence Transformers library to finetune and train embedding models for a variety of applications, such as retrieval augmented generation, semantic search, and semantic textual similarity. It covers the training components, dataset format, loss function, training arguments, evaluators, and trainer.

2024-05-28 Tags: sentence transformers, finetune, embedding, models, similarity, llm, huggingface by klotz

A Step-by-Step Guide to Representation Finetuning LLAMA3 This bookmark is certified by an admin user.

"The paper introduces a technique called LoReFT (Low-rank Linear Subspace ReFT). Similar to LoRA (Low Rank Adaptation), it uses low-rank approximations to intervene on hidden representations. It shows that linear subspaces contain rich semantics that can be manipulated to steer model behaviors."

2024-05-26 Tags: linear subspace, lora, representation, fine tuning, reft, stanford, nlp, python, llm by klotz

grid.el - Public - GitHub This bookmark is certified by an admin user.

A GitHub repository containing a library for Emacs Lisp that allows you to put text data into boxes and align them horizontally, applying margin, padding, borders. This library also supports wrapping any text you want into a box, preserving its properties. The API provides functions for inserting boxes, rows, and columns, as well as retrieving box and column information.

2024-05-25 Tags: emacs, library, text, boxes by klotz

A Comprehensive Guide to Function Calling in LLMs This bookmark is certified by an admin user.

Learn about function calling in Large Language Models (LLMs) and the list of commercial and open source LLMs suitable for function calling.

2024-05-21 Tags: function calling, llm, nlp by klotz

Are GPTs Good Embedding Models This bookmark is certified by an admin user.

A surprising experiment to show that the devil is in the details

2024-05-19 Tags: gpt, embedding, machine learning, mteb leaderboard, nlp, similarity, cross entropy by klotz

SemanticScuttle - klotz.me

Tags: text*

Linked Tags

Related Tags