From mboxrd@z Thu Jan 1 00:00:00 1970 Authentication-Results: mail.toke.dk; spf=pass (mailfrom) smtp.mailfrom=irif.fr (client-ip=2001:660:3301:8000::1:2; helo=korolev.univ-paris7.fr; envelope-from=jch@irif.fr; receiver=) Authentication-Results: mail.toke.dk; dkim=pass (2048-bit key; unprotected) header.d=irif.fr header.i=@irif.fr header.a=rsa-sha256 header.s=dkim-irif header.b=UObFUBJz Received: from korolev.univ-paris7.fr (korolev.univ-paris7.fr [IPv6:2001:660:3301:8000::1:2]) by mail.toke.dk (Postfix) with ESMTPS id DAF6B9C5BCE for ; Thu, 24 Nov 2022 20:52:31 +0100 (CET) Received: from potemkin.univ-paris7.fr (potemkin.univ-paris7.fr [IPv6:2001:660:3301:8000::1:1]) by korolev.univ-paris7.fr (8.14.4/8.14.4/relay1/82085) with ESMTP id 2AOJqTej010045 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=NO); Thu, 24 Nov 2022 20:52:29 +0100 Received: from mailhub.math.univ-paris-diderot.fr (mailhub.math.univ-paris-diderot.fr [81.194.30.253]) by potemkin.univ-paris7.fr (8.14.4/8.14.4/relay2/82085) with ESMTP id 2AOJqTPV005700; Thu, 24 Nov 2022 20:52:29 +0100 Received: from mailhub.math.univ-paris-diderot.fr (localhost [127.0.0.1]) by mailhub.math.univ-paris-diderot.fr (Postfix) with ESMTP id 6C9ABD57CC; Thu, 24 Nov 2022 20:52:29 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=irif.fr; h= content-type:content-type:mime-version:user-agent:references :in-reply-to:subject:subject:from:from:message-id:date:date :received:received; s=dkim-irif; t=1669319548; x=1670183549; bh= K+5n/nhp1MlvE0dIoMweZGsojz6fDXOkx6GOoPm9jt8=; b=UObFUBJzEDqAWvsf gKz1Cyp4HVnj6J5SsniUHCxt4+lweuPco0tj7BQbbnmrMp5xLfnj39rBmMJw/7BJ 0wJMMrVSuUNUYw1XNqJTM+clzXjGDnNb533dCJw2NCm+Xx80SPdk7QkAMe/5fryx FwHHOH89ss2x7eE0OyHFcQ5Ct54koiGt5HyU1Mu9KSAqKmqA3Py3qk8I6dcPp0XH 4WUtnfaLw7t3GL8F157UTsUmf4hSZgq3uIl91Y1YjiGVXjbW+EqhFdiU0rYCE8zI idQVjt3o2e+da1VOvPg91y0/igHqGJPkzwc0Yg8cGDMs+LOVUIWB76ZfYoFoabdS 4S22eA== X-Virus-Scanned: amavisd-new at math.univ-paris-diderot.fr Received: from mailhub.math.univ-paris-diderot.fr ([127.0.0.1]) by mailhub.math.univ-paris-diderot.fr (mailhub.math.univ-paris-diderot.fr [127.0.0.1]) (amavisd-new, port 10023) with ESMTP id d3N5w3HHIJQL; Thu, 24 Nov 2022 20:52:28 +0100 (CET) Received: from pirx.irif.fr (unknown [78.194.40.74]) (Authenticated sender: jch) by mailhub.math.univ-paris-diderot.fr (Postfix) with ESMTPSA id 1500CD57C9; Thu, 24 Nov 2022 20:52:28 +0100 (CET) Date: Thu, 24 Nov 2022 20:52:27 +0100 Message-ID: <87k03kw2o4.wl-jch@irif.fr> From: Juliusz Chroboczek To: Jeroen van Veen In-Reply-To: <99vOLGYuEKU4tptMtFO3zCb3JaLrGyndhvDJj_ztZ9NtSgOR8rLSItyhzSSXIcncXsFoO_XZJxP-yCQdYk6xmqNNR0bR8N_y2Yid4LrAEZU=@protonmail.com> References: <99vOLGYuEKU4tptMtFO3zCb3JaLrGyndhvDJj_ztZ9NtSgOR8rLSItyhzSSXIcncXsFoO_XZJxP-yCQdYk6xmqNNR0bR8N_y2Yid4LrAEZU=@protonmail.com> User-Agent: Wanderlust/2.15.9 (Almost Unreal) Emacs/28.1 Mule/6.0 MIME-Version: 1.0 (generated by SEMI-EPG 1.14.7 - "Harue") Content-Type: text/plain; charset=US-ASCII X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.2.7 (korolev.univ-paris7.fr [IPv6:2001:660:3301:8000::1:2]); Thu, 24 Nov 2022 20:52:29 +0100 (CET) X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.2.7 (potemkin.univ-paris7.fr [194.254.61.141]); Thu, 24 Nov 2022 20:52:29 +0100 (CET) X-Miltered: at korolev with ID 637FCB7D.001 by Joe's j-chkmail (http : // j-chkmail dot ensmp dot fr)! X-Miltered: at potemkin with ID 637FCB7D.000 by Joe's j-chkmail (http : // j-chkmail dot ensmp dot fr)! X-j-chkmail-Enveloppe: 637FCB7D.001 from potemkin.univ-paris7.fr/potemkin.univ-paris7.fr/null/potemkin.univ-paris7.fr/ X-j-chkmail-Enveloppe: 637FCB7D.000 from mailhub.math.univ-paris-diderot.fr/mailhub.math.univ-paris-diderot.fr/null/mailhub.math.univ-paris-diderot.fr/ X-j-chkmail-Score: MSGID : 637FCB7D.001 on korolev.univ-paris7.fr : j-chkmail score : . : R=. U=. O=. B=0.000 -> S=0.000 X-j-chkmail-Score: MSGID : 637FCB7D.000 on potemkin.univ-paris7.fr : j-chkmail score : . : R=. U=. O=. B=0.000 -> S=0.000 X-j-chkmail-Status: Ham X-j-chkmail-Status: Ham Message-ID-Hash: U3B4AXQCQH56PGEBAPMESTA6R22UIW22 X-Message-ID-Hash: U3B4AXQCQH56PGEBAPMESTA6R22UIW22 X-MailFrom: jch@irif.fr X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header CC: "galene@lists.galene.org" X-Mailman-Version: 3.3.7 Precedence: list Subject: [Galene] Re: Whisper transcriptions? List-Id: =?utf-8?q?Gal=C3=A8ne_videoconferencing_server_discussion_list?= Archived-At: List-Archive: List-Help: List-Owner: List-Post: List-Subscribe: List-Unsubscribe: > I was wondering whether someone considered integrating Galene with > automatic speech recognition software like Whisper for transcriptions. I think it would be a good idea. > It would be interesting to have realtime transcriptions per stream, but > I don't know yet how challenging this is technically. Galene doesn't decode the audio data, so this would need to be done on the client side. You could either do it in the client itself, or write a specialised client that receives the audio, transcribes it, and sends the transcription to the other participants. (I'd start by having the transcription in the chat, we could later design a protocol extension that allows a client to publish captions.) The advantage of doing this in the client is that the transcription client can be written in whatever language is convenient, as long as it has a WebRTC library. -- Juliusz