Transcript of session - Custom media stream handling?

mvander115 · January 5, 2022, 4:35pm

Does OpenVidu have a better way to implement transcription other than sending the datastreams from the client to two different endpoints?

Before implementing OpenVidu we used Kurento directly, to support transcription we sent audio streams to 2 endpoints, the Kurento server and our own transcription processing endpoint. With our cutover to OpenVidu Enterprise (with MediaSoup) we have replicated the same logic.
We don’t like the implementation, it means we are forcing the clients to send the data twice to the same datacenter.
What we would like to do is to have the client only send media streams to our OpenVidu server and have that either process transcription or forward the audio stream to our transcription endpoint.
Are there any hooks or interfaces we can implement to allow this to occur?
Is there a better way to handle this requirement under OpenVidu?

micael.gallego · January 7, 2022, 3:47pm

You can implement an special OpenVidu participant to work as a “transcriptor”. This participant will be subscribed to streams sent by real participants.

To do it, you can implement the OpenVidu protocol simultating a participant in a browser. The tricky part is that you will need a webrtc stack to receive the stream (but you already have one to receive the media from the browser). You can implement the transcriptor participant from scratch based on OpenVidu-protocol or you can reuse openvidu-browser implementation.

Best regards

Topic		Replies	Views
Media Streams for AI Processing How to implement? v2	2	380	June 24, 2021
Real-Time Audio Stream Access for ML Transcription with OpenVidu 2.31.0 How to implement?	1	20	April 15, 2025
How to get media streams for further processing? Issues developing apps v2	11	591	May 23, 2021
Media (video, audio) extraction or real-time transmission Issues developing apps v2	5	391	June 4, 2024
Openvidu online radio station How to implement? v2	3	334	August 24, 2020

Transcript of session - Custom media stream handling?

Related topics