does vllm use Flash-Decoding? #1362

Closed

Closed

does vllm use Flash-Decoding?#1362

As vllm depends on xformers, is vllm already using this Flash-Decoding algorithm?

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests