[Feature request] Real time whisper transcription

**Real time whisper transcription**

Right now the demo works for a recording but does it in one shot. I'd love to be able to do it as I speak. Sadly the interface seems to be accepting only a Float32Array (or arrays of) and not a way to keep feeding it float32 arrays as we receive them from the audio source.

Would be great to be able to do it in a streaming fashion.

- Demo: https://huggingface.co/spaces/Xenova/whisper-web
- Code: https://github.com/xenova/whisper-web/blob/main/src/worker.js#L152
- API Doc: https://huggingface.co/docs/transformers.js/main/en/api/pipelines#module_pipelines.AutomaticSpeechRecognitionPipeline

**Reason for request**

I want to build a tool to help recording off voice and want to get a real time transcription to overlay on-top of the existing one to help get a sense of progress.

Thanks <3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature request] Real time whisper transcription #405

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature request] Real time whisper transcription #405

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions