| 标题 |
H3: Hybrid Architecture Using High Bandwidth Memory and High Bandwidth Flash for Cost-Efficient LLM Inference |
| 网址 | |
| DOI | |
| 其它 |
期刊:IEEE Computer Architecture Letters 作者:Minho Ha; Euiseok Kim; Hoshik Kim 出版日期:2026 |
| 求助人 | |
| 下载 |
PDF的下载单位、IP信息已删除
(2025-6-4)