Post #924

@LinghaoCh

Parallel Experiments

Views3,030Post view count

PostedMar 1103/11/2025, 07:22 PM

Post content

前段时间准备 ML Interview (with a focus on LLMs)，浏览了不少学习资源，这里分享一些： CMU 11-711 Advanced NLP Language Modeling 综述。 The Transformer Blueprint: A Holistic Guide to the Transformer Neural Network Architecture 比较好的一篇 Transformer 综述。 3Blue1Brown: Attention in transformers, step-by-step 解释 Attention 最好的视频，没有之一。 Hugging Face: Mixture of Experts Explained Hugging Face: RLHF Hugging Face: Introduction to Deep Reinforcement Learning Hugging Face: Multimodal Models HF 这几个资源很适合快速查漏补缺相关的话题。 Lilian Weng: Agents 依然是最好的 Agents 综述之一。 Understanding Reasoning LLMs 一些 post-training 的细节，侧重分析了 DeepSeek R1 和 R1 Zero。 Designing Machine Learning Systems 笔记 by @tms_ur_way 适合快速查漏补缺 ML 实践中的要点。 Stable Diffusion Explained From Scratch 关于 Diffusion 基本原理的解释。除此之外以下这几位的内容都很不错，可以针对话题有选择性地摄入。 - Andrej Karpathy 的 YouTube 视频 - Lilian Weng 的博客 - Chip Huyen 的博客这里推荐的基本都比较入门 / high level，更多是为了查漏补缺。要深度挖掘具体话题还是得去看进一步的资源和论文等。 #ml#llm