NPU support in whisper.cpp

Christmas is coming soon, and I want to take some time to research something interesting, such as edge low-power inference. Although current whisper.cpp can run on [Raspberry Pi](https://github.com/ggerganov/whisper.cpp/discussions/166), the inference performance cannot achieve real-time transcription. ~~Fortunately, there are now some development boards that use processors with NPUs, which can be used to achieve real-time transcription of large models. My primary goal is to first support RK3566 and RK3588.~~

### Roadmap:
- [ ] MatMul offloading
- [ ] Conv-Gelu offloading
- [ ] LayerNorm offloading
 ...

### Reference:
~~https://github.com/rockchip-linux/rknpu2~~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NPU support in whisper.cpp #1557

Roadmap:

Reference:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

NPU support in whisper.cpp #1557

Description

Roadmap:

Reference:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions