Real-Time Audio Stream Access for ML Transcription with OpenVidu 2.31.0

KinDo · April 11, 2025, 7:38pm

Hello everyone,

I hope you’re all doing great!

I’m currently working with OpenVidu 2.31.0 and looking for a way to access the audio stream from participants in real-time. My goal is to pass the audio directly to a machine learning model for live transcription and audio analysis.

Has anyone here implemented something similar or can point me in the right direction for capturing audio streams on the server side (or browser, if that’s the only way)?

Any insights or suggestions would be greatly appreciated. Thanks in advance!

cruizba · April 15, 2025, 9:02am

Unfortunately, this is not possible with 2.31.0.

But you can try to mix v3 PRO with the v2compatibility module enabled, and use LiveKit Agents to extract the audio in real-time.

Your app will still work with v2 and you can use the Agents functionality to access the audio.

Topic		Replies	Views
Transcription in openvidu3 How to implement?	1	41	November 5, 2024
I want to use my own LLM to get speech-to-text transcription How to implement?	2	10	July 8, 2025
Transcript of session - Custom media stream handling? How to implement? v2	1	436	January 7, 2022
Media Streams for AI Processing How to implement? v2	2	378	June 24, 2021
Performance of Audio Only Streams How to implement? v2	0	215	March 12, 2021

Real-Time Audio Stream Access for ML Transcription with OpenVidu 2.31.0

Related topics