[논문] Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization > 자료실

본문 바로가기

자료실

자료실

[논문] Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid …

페이지 정보

작성일25-12-31 | 조회461회

관련링크

목록

본문

학술지명 : International Symposium on Computer Architecture (ISCA'25)

제목 : Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization

주저자 : Minsu Kim

게재일 : 2025. 6.

목록