Galène videoconferencing server discussion list archives
 help / color / mirror / Atom feed
From: Juliusz Chroboczek <jch@irif.fr>
To: Jeroen van Veen <jvanveen@protonmail.com>
Cc: "galene@lists.galene.org" <galene@lists.galene.org>
Subject: [Galene] Re: Whisper transcriptions?
Date: Thu, 24 Nov 2022 20:52:27 +0100	[thread overview]
Message-ID: <87k03kw2o4.wl-jch@irif.fr> (raw)
In-Reply-To: <99vOLGYuEKU4tptMtFO3zCb3JaLrGyndhvDJj_ztZ9NtSgOR8rLSItyhzSSXIcncXsFoO_XZJxP-yCQdYk6xmqNNR0bR8N_y2Yid4LrAEZU=@protonmail.com>

> I was wondering whether someone considered integrating Galene with
> automatic speech recognition software like Whisper for transcriptions.

I think it would be a good idea.

> It would be interesting to have realtime transcriptions per stream, but
> I don't know yet how challenging this is technically.

Galene doesn't decode the audio data, so this would need to be done on the
client side.  You could either do it in the client itself, or write
a specialised client that receives the audio, transcribes it, and sends
the transcription to the other participants.  (I'd start by having the
transcription in the chat, we could later design a protocol extension that
allows a client to publish captions.)

The advantage of doing this in the client is that the transcription client
can be written in whatever language is convenient, as long as it has
a WebRTC library.

-- Juliusz

      reply	other threads:[~2022-11-24 19:52 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-11-19 20:35 [Galene] " Jeroen van Veen
2022-11-24 19:52 ` Juliusz Chroboczek [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://lists.galene.org/postorius/lists/galene.lists.galene.org/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87k03kw2o4.wl-jch@irif.fr \
    --to=jch@irif.fr \
    --cc=galene@lists.galene.org \
    --cc=jvanveen@protonmail.com \
    --subject='[Galene] Re: Whisper transcriptions?' \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox