SemanticScuttle - klotz.me » Tags: machine learning+ai+deep learning+transformer

Tags: machine learning* + ai* + deep learning* + transformer*

0 bookmark(s) - Sort by: Date ↓ / Title /

The Illustrated Transformer

A detailed explanation of the Transformer model, a key architecture in modern deep learning for tasks like neural machine translation, focusing on components like self-attention, encoder and decoder stacks, positional encoding, and training.

2024-12-28 Tags: transformer, machine translation, self-attention, encoder-decoder, positional encoding, neural network, ai, deep learning by klotz
Surpassing Trillion Parameters and GPT-3 with Switch Transformers – a path to AGI? - KDnuggets

Combined with the growing trend of multimodality, or models that combine language, image, and other types of capabilities, we may see a trend of AI models operating more like a committee of different components rather than a monolithic block. This approach actually has many conceptual similarities to a set of interesting ideas described by Marvin Minsky and Seymour Paypert from the early days of AI.

2021-10-03 Tags: deep learning, gpt-3, transformer, switched, attention, nlp, ai, marvin minsky, society of mind by klotz
Google AI Blog: KELM: Integrating Knowledge Graphs with Language Model Pre-training Corpora

2021-05-21 Tags: google, ai, kelm, knowledge graph, bert, transformer, deep learning, nlp by klotz
OpenAI's GPT-3 Language Model: A Technical Overview

2020-07-24 Tags: open-ai, gpt-3, deep learning, transformer, nlp, ai, machine learning by klotz
GPT-3: The First Artificial General Intelligence? | by Julien Lauret | Jul, 2020 | Towards Data Science

2020-07-24 Tags: gpt-3, ai, machine learning, transformer, opinion, deep learning by klotz
The Evolved Transformer — Enhancing Transformer with Neural Architecture Search

2019-03-16 Tags: deep learning, machine learning, ai, search, transformer, metalearning by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle