TGTGInsighttelegram intelligenceLIVE / telegram public index
Post content
Post content
https://www.anthropic.com/research/tracing-thoughts-language-model Anthropic 这个 LLM Interpretability 的研究得到了不少有趣的结论。想要 TLDR 可以读这篇博客;有兴趣可以看看两篇对应的论文,有更多细节并且页面交互做得不错。 #llm https://transformer-circuits.pub/2025/attribution-graphs/biology.html https://transformer-circuits.pub/2025/attribution-graphs/methods.html