Skip to content

MLA kv cache: fix split graph backend assignment when kv cache store on CPU#13648

Closed
xiang1guo wants to merge 1 commit intoggml-org:masterfrom
xiang1guo:xiang/master/fix-mla-kv-cache-offload
Closed

MLA kv cache: fix split graph backend assignment when kv cache store on CPU#13648
xiang1guo wants to merge 1 commit intoggml-org:masterfrom
xiang1guo:xiang/master/fix-mla-kv-cache-offload

Commits