matlok 's Collections
LMM

Papers - Automated Interpretability

OpenAI has a 2024 tool referring to this technique: https://github.com/openai/transformer-debugger with https://transformer-circuits.pub/2023/monosema

OSZAR »