Wave Pod
Discover
Library
Get Wave AI
Sign In
103: 用Attention串起大模型优化史,详解DeepSeek、Kimi最新注意力机制改进 - 晚点聊 LateTalk | Wave AI Podcast Notes
← 晚点聊 LateTalk
晚点聊 LateTalk
103: 用Attention串起大模型优化史,详解DeepSeek、Kimi最新注意力机制改进
February 26, 2025
·
01:28:15
「与 InfLLM 与 MoA 的两位作者一起聊注意力。注意“注意力”是为了可预见的长长长……文本。」
Send to my inbox
Business
News
Technology
Loading summary
Sign in to save
Share
Sign in to transcribe