Lv6
1934 积分 2024-02-04 加入
BitLoRA: Quantization-Compatible Adapter Tuning for 1.58-bit LLM in Federated On-Device AI-Agent
1天前
已完结
Lembda: Optimizing LLM Inference on Embedded Platforms via CPU/FPGA Co-processing
3天前
已完结
AiDE: Attention-FFN Disaggregated Execution for Cost-Effective LLM Decoding on CXL-PNM
16天前
已完结
A Review of Optimization Techniques for Large Language Model Inference
29天前
已完结
Dynamic ECN marking threshold algorithm for TCP congestion control in data center networks
1个月前
已完结
Adaptive marking threshold method for delay-sensitive TCP in data center network
1个月前
已完结
A-ECN Minimizing Queue Length for Datacenter Networks
1个月前
已完结
Security Opportunities and Challenges for Disaggregated Architectures (Invited)
1个月前
已完结
Multi-Host Sharing of a Single-Function NVMe Device in a PCIe Cluster
1个月前
已完结
Towards Automated Generation of Chiplet-Based Systems Invited Paper
1个月前
已完结