FILTER MODE ACTIVE

#interpretability

Records found: 8

#interpretability13/11/2025

OpenAI's New Transparent LLM Lets Researchers See How AI Thinks

'OpenAI developed a weight-sparse transformer that is far more interpretable than typical LLMs, enabling researchers to trace exact internal circuits that implement simple algorithms. While much smaller and slower than state-of-the-art models, this work could illuminate how larger models reason and fail.'