SemanticScuttle - klotz.me » Tags: neural network+machine learning+deep learning

Tags: neural network* + machine learning* + deep learning*

0 bookmark(s) - Sort by: Date ↓ / Title /

How might LLMs store facts | Chapter 7, Deep Learning

The article delves into how large language models (LLMs) store facts, focusing on the role of multi-layer perceptrons (MLPs) in this process. It explains the mechanics of MLPs, including matrix multiplication, bias addition, and the Rectified Linear Unit (ReLU) function, using the example of encoding the fact that Michael Jordan plays basketball. The article also discusses the concept of superposition, which allows models to store a vast number of features by utilizing nearly perpendicular directions in high-dimensional spaces.

2025-02-21 Tags: 3blue1brown, llm, facts storage, multi-layer perceptrons, neural networks, deep learning, attention, gpt by klotz

Understanding The Self-Attention Mechanism

The self-attention mechanism is used to capture interactions between words within input and output sequences. It involves computing keys, queries, and values vectors, followed by matrix multiplications and a softmax transformation to produce an attention matrix.

2025-02-04 Tags: self-attention, llm, machine learning, neural network by klotz

Deep Dive into Self-Attention by Hand✍︎

Explore the intricacies of the attention mechanism responsible for fueling the transformers.

2025-02-04 Tags: transformers, self-attention, neural networks, llm, machine learning by klotz

How do neural networks learn? A mathematical formula explains how they detect relevant patterns

Researchers from the University of California San Diego have developed a mathematical formula that explains how neural networks learn and detect relevant patterns in data, providing insight into the mechanisms behind neural network learning and enabling improvements in machine learning efficiency.

2025-01-07 Tags: neural networks, machine learning, features, xai, explainability, llm by klotz

The Illustrated Transformer

A detailed explanation of the Transformer model, a key architecture in modern deep learning for tasks like neural machine translation, focusing on components like self-attention, encoder and decoder stacks, positional encoding, and training.

2024-12-28 Tags: transformer, machine translation, self-attention, encoder-decoder, positional encoding, neural network, ai, deep learning by klotz

Autoencoders: An Ultimate Guide for Data Scientists

A detailed overview of the architecture, Python implementation, and future of autoencoders, focusing on their use in feature extraction and dimension reduction in unsupervised learning.

2024-10-18 Tags: autoencoders, machine learning, data science, neural networks, feature extraction, dimension reduction, unsupervised learning, encoder, decoder by klotz

Scientists have traced all 54.5 million connections in a fruit fly’s brain

Researchers have mapped the complete neural connectome of a fruit fly, detailing all 139,255 nerve cells and their connections. This advance offers insights into how the brain processes information.

2024-10-03 Tags: fruit fly, brain, connectome, neural network, neuroanatomy, biology, ontology by klotz

Scalable spatiotemporal prediction with Bayesian neural fields

This article introduces the Bayesian Neural Field (BayesNF), a method combining deep neural networks with hierarchical Bayesian inference for scalable and flexible analysis of spatiotemporal data, such as environmental monitoring and cloud demand forecasting.

2024-09-28 Tags: bayesian, neural network, spatiotemporal prediction, mature, machine learning by klotz

Machine learning for email spam filtering: review, approaches and open research problems

"We present a systematic review of some of the popular machine learning based email spam filtering approaches."

"Our review covers survey of the important concepts, attempts, efficiency, and the research trend in spam filtering."

2024-09-18 Tags: computer science, privacy, machine learning, spam, anti-, filtering, deep learning, neural networks, support vector machines, naive bayes by klotz

Deep Learning Illustrated, Part 5: Long Short-Term Memory (LSTM)

An illustrated and intuitive guide on the inner workings of an LSTM, which are an improvement on Recurrent Neural Networks (RNNs) that struggle with retaining information over long distances.

2024-08-06 Tags: deep learning, lstm, rnn, neural networks, sequence modeling by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: neural network* + machine learning* + deep learning*

Linked Tags

Related Tags