SemanticScuttle - klotz.me » klotz: neural network

klotz: neural network*

A deep dive into the process of LLM inference, covering tokenization, transformer architecture, KV caching, and optimization techniques for efficient text generation.

2025-11-26 Tags: llm, inference, transformer, tokenization, kv cache, quantization, deep learning, machine learning, neural networks by klotz

Toy Models of Superposition

An exploration of simple transformer circuit models that illustrate how superposition arises in transformer architectures, introducing toy examples and analyzing their behavior.

2025-10-24 Tags: transformer, superposition, toy models, neural networks, attention mechanisms by klotz

PyTorch Explained: From Automatic Differentiation to Training Custom Neural Networks

The core mechanics of Deep Learning, and how to think the PyTorch way. This guide provides a whirlwind tour of PyTorch’s methodologies and design principles, covering tensors, automatic differentiation, and training custom neural networks.

2025-09-25 Tags: pytorch, deep learning, tensors, automatic differentiation, neural networks, machine learning by klotz

A ferroelectric-memristor memory for both training and inference

A unified memory stack that functions as a memristor as well as a ferroelectric capacitor is reported, enabling both energy-efficient inference and learning at the edge.

2025-09-23 Tags: ferroelectric memory, memristor, artificial intelligence, edge computing, in-memory computing, neural networks, training, inference by klotz

Predicting the past with Ithaca

DeepMind introduces Ithaca, a deep neural network that can restore damaged ancient Greek inscriptions, identify their original location, and help establish their creation date, collaborating with historians to advance understanding of ancient history.

2025-07-24 Tags: ai, deep learning, ancient history, ithaca, neural network, text restoration, historical dating, machine learning, ancient greece, classical antiquity by klotz

How o3 and Grok 4 Accidentally Vindicated Neurosymbolic AI

This article discusses the history of AI, the split between neural networks and symbolic AI, and the recent vindication of neurosymbolic AI through the advancements of models like o3 and Grok 4. It argues that combining the strengths of both approaches is crucial for achieving true AI and highlights the resistance to neurosymbolic AI from some leaders in the deep learning field.

2025-07-13 Tags: neurosymbolic ai, artificial intelligence, deep learning, neural networks, openai, o3, grok 4, ai history, gary marcus by klotz

PyTorch in One Hour: From Tensors to Training Neural Networks on Multiple GPUs

This tutorial introduces the essential topics of the PyTorch deep learning library in about one hour. It covers tensors, training neural networks, and training models on multiple GPUs.

2025-07-05 Tags: pytorch, deep learning, tensors, neural networks, gpu, automatic differentiation, machine learning, llm by klotz

Foundations of Computer Vision

This book covers foundational topics within computer vision, with an image processing and machine learning perspective. It aims to build the reader’s intuition through visualizations and is intended for undergraduate and graduate students, as well as experienced practitioners.

2025-06-24 Tags: computer vision, image processing, machine learning, neural networks, image formation, deep learning, mit, ai by klotz

Yann LeCun, Pioneer of AI, Thinks Today's LLM's Are Nearly Obsolete

Newsweek interview with Yann LeCun, Meta's chief AI scientist, detailing his skepticism of current LLMs and his focus on Joint Embedding Predictive Architecture (JEPA) as the future of AI, emphasizing world modeling and planning capabilities.

2025-04-03 Tags: ai, llm, yann lecun, meta, jepa, deep learning, neural networks by klotz

AlexNet, the AI model that started it all, released in source code form for all to download

AlexNet, a groundbreaking neural network developed in 2012 by Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton, has been released in source code form by the Computer History Museum in collaboration with Google. This model significantly advanced the field of AI by demonstrating a massive leap in image recognition capabilities.

2025-03-21 Tags: alexnet, ai, neural network, computer history museum, google, image recognition, deep learning, geoffrey hintonalex krizhevsky, ilya sutskeve by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: neural network*

Linked Tags

Related Tags