SemanticScuttle - klotz.me » klotz: autoencoder+llm

klotz: autoencoder* + llm*

Gemma Scope is a research tool for analyzing and understanding the inner workings of the Gemma 2 generative AI models, allowing examination of individual AI model layers during request processing.

2024-11-14 Tags: gemma scope, gemma 2, autoencoder, analysis, llm by klotz

The Geometry of Concepts: Sparse Autoencoder Feature Structure

This paper explores the structure of the feature point cloud discovered by sparse autoencoders in large language models. It investigates three scales: atomic, brain, and galaxy. The atomic scale involves crystal structures with parallelograms or trapezoids, improved by projecting out distractor dimensions. The brain scale focuses on modular structures, similar to neural lobes. The galaxy scale examines the overall shape and clustering of the point cloud.

2024-11-06 Tags: autoencoder, features, llm, scale by klotz

Gemma Scope: helping the safety community shed light on the inner workings of language models

DeepMind's Gemma Scope provides researchers with tools to better understand how Gemma 2 language models work through a collection of sparse autoencoders. This helps in understanding the inner workings of these models and addressing concerns like hallucinations and potential manipulation.

2024-11-14 Tags: llm, interpretability, gemma scope, autoencoder, deepmind, visualization, xai, analysis by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: autoencoder* + llm*

Linked Tags

Related Tags