DeepSeek联合北大、清华研究团队发布DualPath推理系统，打破大模型存储带宽瓶颈

星*** · 发表于 2026-2-26 18:01:08

北京大学、清华大学与DeepSeek-AI的联合研究团队发布了一项针对大语言模型推理架构优化的最新研究成果。该团队成功研发了名为DualPath的全新推理系统，专门解决智能体工作负载下KV-Cache存储I/O带宽受限的技术难题。

依然27B小模型

arXiv.org (https://arxiv.org/abs/2602.21548)

DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference (https://arxiv.org/abs/2602.21548)
The performance of multi-turn, agentic LLM inference is increasingly dominated by KV-Cache storage I/O rather than computation. In prevalent disaggregated architectures, loading the massive KV-Cache from external storage creates a fundamental...

		自动登录	找回密码
密码			立即注册