SemanticScuttle - klotz.me » klotz: interpretability+explainability

klotz: interpretability* + explainability*

Interpretable Causal Diffusion Language Models

Steerling-8B is an interpretable causal diffusion language model that combines masked diffusion language modeling with concept decomposition, enabling generation, attribution, steering, and extraction of hidden representations. It offers features like block-causal attention and decomposition of hidden states into known and unknown concepts.

2026-02-24 Tags: attribution, concepts, models, decomposition, features, diffusion, interpretability, explanations, explainability, llms, generative-ai by klotz

The Meaning of Explainability for AI

An article discussing the importance of explainability in machine learning and the challenges posed by neural networks. It highlights the difficulties in understanding the decision-making process of complex models and the need for more transparency in AI development.

2024-06-04 Tags: explainability, machine learning, neural networks, xai, interpretability by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: interpretability* + explainability*

Linked Tags

Related Tags