Closed
Description
Real time whisper transcription
Right now the demo works for a recording but does it in one shot. I'd love to be able to do it as I speak. Sadly the interface seems to be accepting only a Float32Array (or arrays of) and not a way to keep feeding it float32 arrays as we receive them from the audio source.
Would be great to be able to do it in a streaming fashion.
- Demo: https://huggingface.co/spaces/Xenova/whisper-web
- Code: https://github.com/xenova/whisper-web/blob/main/src/worker.js#L152
- API Doc: https://huggingface.co/docs/transformers.js/main/en/api/pipelines#module_pipelines.AutomaticSpeechRecognitionPipeline
Reason for request
I want to build a tool to help recording off voice and want to get a real time transcription to overlay on-top of the existing one to help get a sense of progress.
Thanks <3