Post #935

@LinghaoCh

Parallel Experiments

Views889Post view count

PostedApr 1304/13/2025, 09:37 PM

Post content

https://www.anthropic.com/research/tracing-thoughts-language-model Anthropic 这个 LLM Interpretability 的研究得到了不少有趣的结论。想要 TLDR 可以读这篇博客；有兴趣可以看看两篇对应的论文，有更多细节并且页面交互做得不错。 #llm https://transformer-circuits.pub/2025/attribution-graphs/biology.html https://transformer-circuits.pub/2025/attribution-graphs/methods.html