Skip to content

Refactoring of multi-head attention and support for KV caching #6757

Refactoring of multi-head attention and support for KV caching

Refactoring of multi-head attention and support for KV caching #6757

Triggered via pull request June 1, 2025 15:53
Status Success
Total duration 5m 10s
Artifacts

cpu-tests.yml

on: pull_request
Matrix: pytester
Matrix: testing-imports
testing-guardian
0s
testing-guardian
Fit to window
Zoom out
Zoom in