Skip to content

Support Cohere Command-A (Cohere2ForCausalLM arch) #2912

@aikitoria

Description

@aikitoria

It would be great to support this new model! https://cohere.com/blog/command-a

They use a fairly unique architecture, where some layers use sliding window attention while others use global attention with no position embeddings, so even though I read through the documentation on how to add a model I'm a little lost on how to do this myself.

Metadata

Metadata

Assignees

Labels

InvestigatingKV-Cache Managementkv-cache management for efficient LLM inferencetriagedIssue has been triaged by maintainers

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions