SemanticScuttle - klotz.me » Tags: nlp+text+deep learning+machine learning

Tags: nlp* + text* + deep learning* + machine learning*

0 bookmark(s) - Sort by: Date ↓ / Title /

BEAL: A Bayesian Deep Active Learning Method for Efficient Deep Multi-Label Text Classification

BEAL is a deep active learning method that uses Bayesian deep learning with dropout to infer the model’s posterior predictive distribution and introduces an expected confidence-based acquisition function to select uncertain samples. Experiments show that BEAL outperforms other active learning methods, requiring fewer labeled samples for efficient training.

2024-11-18 Tags: beal, bayesian, deep learning, active learning, multi-label, text, classification, bert, machine learning by klotz

Contextual Transformer Embeddings Using Self-Attention Explained with Diagrams and Python Code

This article is part of a series titled ‘LLMs from Scratch’, a complete guide to understanding and building Large Language Models (LLMs). In this article, we discuss the self-attention mechanism and how it is used by transformers to create rich and context-aware transformer embeddings.

The Self-Attention mechanism is used to add context to learned embeddings, which are vectors representing each word in the input sequence. The process involves the following steps:

1. Learned Embeddings: These are the initial vector representations of words, learned during the training phase. The weights matrix, storing the learned embeddings, is stored in the first linear layer of the Transformer architecture.

2. Positional Encoding: This step adds positional information to the learned embeddings. Positional information helps the model understand the order of the words in the input sequence, as transformers process all words in parallel, and without this information, they would lose the order of the words.

3. Self-Attention: The core of the Self-Attention mechanism is to update the learned embeddings with context from the surrounding words in the input sequence. This mechanism determines which words provide context to other words, and this contextual information is used to produce the final contextualized embeddings.

2024-06-01 Tags: transformer, attention, self-attention, embeddings, nlp, deep learning, llm, machine learning by klotz

Towards Generative AI for Model Architecture

With deep learning, the ROI for having clean and high quality data is immense, and this is realized in every phase of training. For context, the era right before BERT in the text classification world was one where you wanted an abundance of data, even at the expense of quality. It was more important to have representation via examples than for the examples to be perfect. This is because many Al systems did not use pre-trained embeddings (or they weren't any good, anyway) that could be leveraged by a model to apply practical generalizability. In 2018, BERT was a breakthrough for down-stream text tasks,

2023-11-11 Tags: deep learning, llm, generative, embeddings, bert by klotz

Dense vector embeddings

2022-12-24 Tags: embedding, bert, word2vec, deep learning by klotz

Word embeddings | Text | TensorFlow

2022-11-10 Tags: embedding, doc2vec, tensorflow, word2vec, classification, neural network by klotz

Part E: Text Classification with an Embedding Layer in a Feed-Forward Network - Deep Learning Tutorials with Keras - Medium

2022-11-10 Tags: classification, doc2vec, embedding, neural network by klotz

A formal mathematical model of transformer architecture

2022-07-30 Tags: large language models, gpt, bert, nlp, deep learning by klotz

openai codex exploration and review of the platform api

2022-07-11 Tags: codex, openai, nlp, deep learning, gpt-3 by klotz

Feature Extraction with BERT for Text Classification | by Marcello Politi | Jun, 2022 | Towards Data Science