From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.ffmpeg.org (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTPS id 1B28A502C8 for ; Thu, 10 Jul 2025 12:41:42 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.ffmpeg.org (Postfix) with ESMTP id 796FD68FC8F; Thu, 10 Jul 2025 15:41:38 +0300 (EEST) Received: from relay15.mail.gandi.net (relay15.mail.gandi.net [217.70.178.235]) by ffbox0-bg.ffmpeg.org (Postfix) with ESMTPS id A3E9668FC76 for ; Thu, 10 Jul 2025 15:41:31 +0300 (EEST) Received: by mail.gandi.net (Postfix) with ESMTPSA id B84A64420E for ; Thu, 10 Jul 2025 12:41:30 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=niedermayer.cc; s=gm1; t=1752151290; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=yOrKQq02rj86O4asK3mp6/L9bGtswrlpQ3ta3C2XvFM=; b=T+43rIIZoORm4AC0yx4vaIMMOPR7scvjHMnzzUB5Y4uiqV9fCWMamNJ1uM3TvaHyNis8F2 ZAjD15JxcGZYkxp4flBWa+HEuzi1Wjr3NFjt6MQWSyRCBD2FNZtva+It74zXpTzT4nnA4T Rl9wrNAs0kxZwDgy3EtpKcsq3Vhde50GoktxtHTtM2XORJbtxphRfb7OqD8VirBJStBbrm W9iHGjN/WXtBLOr0YDfuNFdcC3r0qcSMQRydyBzpiOxP3F17lG7H2W998h9/Zp/AeAvnu4 OPf98LZZF8bKFSXWr40NaZ9UagJdsBiG82FRM4WD+4NSaH+X6O0Ujt+tizbyPA== Date: Thu, 10 Jul 2025 14:41:29 +0200 From: Michael Niedermayer To: FFmpeg development discussions and patches Message-ID: <20250710124129.GQ29660@pb2> References: <20250709072350.578693-1-vpalmisano@gmail.com> MIME-Version: 1.0 In-Reply-To: X-GND-State: clean X-GND-Score: -85 X-GND-Cause: gggruggvucftvghtrhhoucdtuddrgeeffedrtdefgdegtdegjecutefuodetggdotefrodftvfcurfhrohhfihhlvgemucfitefpfffkpdcuggftfghnshhusghstghrihgsvgenuceurghilhhouhhtmecufedtudenucesvcftvggtihhpihgvnhhtshculddquddttddmnegfrhhlucfvnfffucdludehmdenucfjughrpeffhffvuffkfhggtggujgesghdtreertddtjeenucfhrhhomhepofhitghhrggvlhcupfhivgguvghrmhgrhigvrhcuoehmihgthhgrvghlsehnihgvuggvrhhmrgihvghrrdgttgeqnecuggftrfgrthhtvghrnhepleekgefgffeiudefjeeuffejudehtddtudeltdehveevvedtieeulefhtdeutdeknecukfhppeeguddrieeirdeihedrudejieenucevlhhushhtvghrufhiiigvpedtnecurfgrrhgrmhepihhnvghtpeeguddrieeirdeihedrudejiedphhgvlhhopehlohgtrghlhhhoshhtpdhmrghilhhfrhhomhepmhhitghhrggvlhesnhhivgguvghrmhgrhigvrhdrtggtpdhnsggprhgtphhtthhopedupdhrtghpthhtohepfhhfmhhpvghgqdguvghvvghlsehffhhmphgvghdrohhrgh Subject: Re: [FFmpeg-devel] [PATCH] Whisper audio filter X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: multipart/mixed; boundary="===============8917314656977983275==" Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: --===============8917314656977983275== Content-Type: multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="xLq3kWpvYyGsH3bw" Content-Disposition: inline --xLq3kWpvYyGsH3bw Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Hi On Wed, Jul 09, 2025 at 11:24:26PM +0800, Zhao Zhili wrote: >=20 > > On Jul 9, 2025, at 15:23, Vittorio Palmisano wrote: > >=20 > > It adds a new audio filter for running audio transcriptions with the wh= isper model. > > Documentation and examples are included into the patch. >=20 > The patch doesn=E2=80=99t following ffmpeg coding style. >=20 > Setting aside the coding style issues, I have a few concerns. >=20 > There are DNN support with three backends in FFmpeg (libavfilter/dnn_inte= rface.h), which > are supposed to be robust extensibility. If someone implements some high quality speech recognition more natively that surely would be cool. Our own model would be cool too, ... or supporting the whisper models natively, all that would be very cool > I guess incorporating Whisper natively into our DNN > architecture can be difficult, making wrapper another library more feasib= le than direct > integration. yes, i think so too thx [...] --=20 Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB No human being will ever know the Truth, for even if they happen to say it by chance, they would not even known they had done so. -- Xenophanes --xLq3kWpvYyGsH3bw Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iF0EABEKAB0WIQSf8hKLFH72cwut8TNhHseHBAsPqwUCaG+09gAKCRBhHseHBAsP qymNAJ9Oog5NiaNCMZv0mTObD+Yxgu04RQCglR8/wPqfwEeskLvPNBQHaFJCEz0= =xg7P -----END PGP SIGNATURE----- --xLq3kWpvYyGsH3bw-- --===============8917314656977983275== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe". --===============8917314656977983275==--