wave
Pod
Get Wave AI
Sign In
103: 用Attention串起大模型优化史,详解DeepSeek、Kimi最新注意力机制改进 - 晚点聊 LateTalk | Wave AI Podcast Notes
Back to 晚点聊 LateTalk
103: 用Attention串起大模型优化史,详解DeepSeek、Kimi最新注意力机制改进
晚点聊 LateTalk
Wed Feb 26 2025
「与 InfLLM 与 MoA 的两位作者一起聊注意力。注意“注意力”是为了可预见的长长长……文本。」
Sign in to process episode
Loading summary...
No transcript available.
Send to Email