Tags: gpt* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. The article delves into how large language models (LLMs) store facts, focusing on the role of multi-layer perceptrons (MLPs) in this process. It explains the mechanics of MLPs, including matrix multiplication, bias addition, and the Rectified Linear Unit (ReLU) function, using the example of encoding the fact that Michael Jordan plays basketball. The article also discusses the concept of superposition, which allows models to store a vast number of features by utilizing nearly perpendicular directions in high-dimensional spaces.

  2. An explanation of the differences between encoder- and decoder-style large language model (LLM) architectures, including their roles in tasks such as classification, text generation, and translation.

    2024-12-28 Tags: , , , , , , , , , by klotz
  3. A Github Gist containing a Python script for text classification using the TxTail API

  4. This article provides a beginner-friendly introduction to Large Language Models (LLMs) and explains the key concepts in a clear and organized way.

    2024-05-10 Tags: , , , , , by klotz
  5. Exploring the architecture of OpenAI’s Generative Pre-trained Transformers.

    2023-12-10 Tags: , , by klotz
  6. 2023-10-31 Tags: , , , , , by klotz
  7. training GPT to query with few-shot prompting.

    2023-10-05 Tags: , , , , , , by klotz
  8. 2023-08-14 Tags: , , , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "gpt+llm"

About - Propulsed by SemanticScuttle