From: Juliusz Chroboczek <jch@irif.fr> To: Jeroen van Veen <jvanveen@protonmail.com> Cc: "galene@lists.galene.org" <galene@lists.galene.org> Subject: [Galene] Re: Whisper transcriptions? Date: Thu, 24 Nov 2022 20:52:27 +0100 [thread overview] Message-ID: <87k03kw2o4.wl-jch@irif.fr> (raw) In-Reply-To: <99vOLGYuEKU4tptMtFO3zCb3JaLrGyndhvDJj_ztZ9NtSgOR8rLSItyhzSSXIcncXsFoO_XZJxP-yCQdYk6xmqNNR0bR8N_y2Yid4LrAEZU=@protonmail.com> > I was wondering whether someone considered integrating Galene with > automatic speech recognition software like Whisper for transcriptions. I think it would be a good idea. > It would be interesting to have realtime transcriptions per stream, but > I don't know yet how challenging this is technically. Galene doesn't decode the audio data, so this would need to be done on the client side. You could either do it in the client itself, or write a specialised client that receives the audio, transcribes it, and sends the transcription to the other participants. (I'd start by having the transcription in the chat, we could later design a protocol extension that allows a client to publish captions.) The advantage of doing this in the client is that the transcription client can be written in whatever language is convenient, as long as it has a WebRTC library. -- Juliusz
prev parent reply other threads:[~2022-11-24 19:52 UTC|newest] Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top 2022-11-19 20:35 [Galene] " Jeroen van Veen 2022-11-24 19:52 ` Juliusz Chroboczek [this message]
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style List information: https://lists.galene.org/postorius/lists/galene.lists.galene.org/ * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=87k03kw2o4.wl-jch@irif.fr \ --to=jch@irif.fr \ --cc=galene@lists.galene.org \ --cc=jvanveen@protonmail.com \ --subject='[Galene] Re: Whisper transcriptions?' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox