RFC: Streaming Inference / Application #1072

mthrok · 2020-12-08T02:24:03Z

Torchaudio team is looking for a way to support streaming applications. We are trying to define the problem space and scope the challenge we tackle. For this purpose, we would like to learn your thoughts and experience in streaming applications. If you have a thought, please let us know by leaving a comment.

Questions include, but not limited to

How are you feeding your input stream to your system?
- WebSocket? FFMpeg + STDIN? gRPC?
What technology stack do you use?
- Kaldi? ONNX? TorchScript? TensorRT? DeepStream?
What are the pain points in your application lifecycle?
- Development?
- Deployment?
- Maintenance?
What type of application do you run?
- Speech recognition?
- Audio enhancement?
- noise reduction
- audio event detection
What kind of device is your application running on?
- Web server?
- Desktop system?
- Mobile device?
- Embedded system?

tongjinle123 · 2020-12-31T06:18:29Z

1、grpc
2、onnx，torchscript
3、deployment
4、speech recognition
5、～

mthrok · 2021-01-05T16:06:01Z

@tongjinle123 Thanks for the comment. Is your production environment Python or C++?

* Dispatcher tutorial Signed-off-by: Edward Z. Yang <ezyang@fb.com> * typofix Signed-off-by: Edward Z. Yang <ezyang@fb.com> * morefix Signed-off-by: Edward Z. Yang <ezyang@fb.com>

hbredin · 2021-04-12T11:40:07Z

As promised in #1442, here are my potential needs for pyannote.audio.
I don't have much experience in this area so bear with me if my answers fall flat.

* How are you feeding your input stream to your system?

I'd love to be able to expand this streamlit demo with the actual microphone of the end user. I guess with something like streamlit-webrtc.

* What technology stack do you use?
  * Kaldi? ONNX? TorchScript? TensorRT? DeepStream?

None of these for now... Just regular PyTorch as pyannote.audio is designed for research purposes, not actual production.

* What type of application do you run?

speaker diarization
audio event detection

* What kind of device is your application running on?

Web server

mthrok changed the title ~~[RFC] Streaming Application~~ [RFC] Streaming Inference / Application Dec 8, 2020

mthrok added the RFC label Dec 9, 2020

mthrok changed the title ~~[RFC] Streaming Inference / Application~~ RFC: Streaming Inference / Application Jan 1, 2021

mthrok mentioned this issue Jan 5, 2021

Sharing My Projects 2021 H1 #1154

Closed

vincentqb mentioned this issue Jan 8, 2021

DRAFT #1163

Closed

vincentqb mentioned this issue Jan 25, 2021

Roadmap ahead for torchaudio #1196

Closed

mogwai mentioned this issue Feb 8, 2021

Add support for file handle to pyannote.audio.core.io.Audio pyannote/pyannote-audio#564

Closed

mthrok mentioned this issue Apr 8, 2021

Pass pre-computed info to torchaudio.load() for file-like objects #1442

Closed

mthrok closed this as completed Oct 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Streaming Inference / Application #1072

RFC: Streaming Inference / Application #1072

mthrok commented Dec 8, 2020

tongjinle123 commented Dec 31, 2020

mthrok commented Jan 5, 2021

hbredin commented Apr 12, 2021

RFC: Streaming Inference / Application #1072

RFC: Streaming Inference / Application #1072

Comments

mthrok commented Dec 8, 2020

tongjinle123 commented Dec 31, 2020

mthrok commented Jan 5, 2021

hbredin commented Apr 12, 2021