From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.ffmpeg.org (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTPS id BC7C44A785 for ; Fri, 18 Jul 2025 11:00:05 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.ffmpeg.org (Postfix) with ESMTP id D0AED68C726; Fri, 18 Jul 2025 14:00:01 +0300 (EEST) Received: from relay15.mail.gandi.net (relay15.mail.gandi.net [217.70.178.235]) by ffbox0-bg.ffmpeg.org (Postfix) with ESMTPS id 5958068C34B for ; Fri, 18 Jul 2025 13:59:55 +0300 (EEST) Received: by mail.gandi.net (Postfix) with ESMTPSA id 6BFF14317D for ; Fri, 18 Jul 2025 10:59:54 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=niedermayer.cc; s=gm1; t=1752836394; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=nvhnze9S7Jy6IgGcx+ZmEedEnkOhMzS1vZeqfb1JWr8=; b=aue97C52N6MImwJ3WukIyII4dYl9dsgTlRQrbGn7MEGv40KHnzZ/vrAh5LbRc//gCQ30zt mPGSiEE9qIXoxQeVlDDpV9Jp+EV0+MIyFg+mep4RMRGzQf9V1slPLNXNvbt7vBjSVlYQUz EFLwS+0+gayZ6lT9o4lbYCoTwDrUWVPKNldGuuoTptpjCRUd0HqTHGPILWq3+zz9M6iMO9 W+qkI035yiELEGDOXPh7H7i/3RGHgpaHkpj2TJ1ljnSBF52qeoeWMCcichTeT3jud88nTT T/fyt7kasltbYNEL1EQfTVCK3B70tQhoMZst4uMue+Ct45SVr+hyHmgyUU4cug== Date: Fri, 18 Jul 2025 12:59:53 +0200 From: Michael Niedermayer To: FFmpeg development discussions and patches Message-ID: <20250718105953.GX29660@pb2> References: MIME-Version: 1.0 In-Reply-To: X-GND-State: clean X-GND-Score: -85 X-GND-Cause: gggruggvucftvghtrhhoucdtuddrgeeffedrtdefgdeifedvkecutefuodetggdotefrodftvfcurfhrohhfihhlvgemucfitefpfffkpdcuggftfghnshhusghstghrihgsvgenuceurghilhhouhhtmecufedtudenucesvcftvggtihhpihgvnhhtshculddquddttddmnegfrhhlucfvnfffucdludehmdenucfjughrpeffhffvuffkfhggtggujgesghdtreertddtvdenucfhrhhomhepofhitghhrggvlhcupfhivgguvghrmhgrhigvrhcuoehmihgthhgrvghlsehnihgvuggvrhhmrgihvghrrdgttgeqnecuggftrfgrthhtvghrnhepieegkedtjeduffejhfetgeejtdegteetgfegtdfhjefgvefhteegkeejtddvhfevnecukfhppeeguddrieeirdeihedrudejieenucevlhhushhtvghrufhiiigvpedtnecurfgrrhgrmhepihhnvghtpeeguddrieeirdeihedrudejiedphhgvlhhopehlohgtrghlhhhoshhtpdhmrghilhhfrhhomhepmhhitghhrggvlhesnhhivgguvghrmhgrhigvrhdrtggtpdhnsggprhgtphhtthhopedupdhrtghpthhtohepfhhfmhhpvghgqdguvghvvghlsehffhhmphgvghdrohhrgh Subject: Re: [FFmpeg-devel] [PATCH v2 1/1] swscale/aarch64/output: Implement neon assembly for yuv2planeX_10_c_template() X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: multipart/mixed; boundary="===============1905254779673779407==" Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: --===============1905254779673779407== Content-Type: multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="HQZOnxSrMQCzt9JM" Content-Disposition: inline --HQZOnxSrMQCzt9JM Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Tue, Jul 15, 2025 at 05:06:54AM +0000, Logaprakash Ramajayam wrote: > Handled all the comments and attaching the checkasm results. >=20 > Checkasm Benchmark results: >=20 >=20 > yuv2yuvX_10_LE_16_0_512_accurate_c: 7836.9 ( 1.00x) > yuv2yuvX_10_LE_16_0_512_accurate_neon: 840.4 ( 9.33x) > yuv2yuvX_10_LE_16_0_512_approximate_c: 7930.8 ( 1.00x) > yuv2yuvX_10_LE_16_0_512_approximate_neon: 838.5 ( 9.46x) > yuv2yuvX_10_LE_16_16_512_accurate_c: 7594.3 ( 1.00x) > yuv2yuvX_10_LE_16_16_512_accurate_neon: 815.2 ( 9.32x) > yuv2yuvX_10_LE_16_16_512_approximate_c: 7687.0 ( 1.00x) > yuv2yuvX_10_LE_16_16_512_approximate_neon: 811.9 ( 9.47x) > yuv2yuvX_10_LE_16_32_512_accurate_c: 7366.4 ( 1.00x) > yuv2yuvX_10_LE_16_32_512_accurate_neon: 785.8 ( 9.37x) > yuv2yuvX_10_LE_16_32_512_approximate_c: 7426.5 ( 1.00x) > yuv2yuvX_10_LE_16_32_512_approximate_neon: 786.4 ( 9.44x) > yuv2yuvX_10_LE_16_48_512_accurate_c: 7123.1 ( 1.00x) > yuv2yuvX_10_LE_16_48_512_accurate_neon: 761.7 ( 9.35x) > yuv2yuvX_10_LE_16_48_512_approximate_c: 7182.7 ( 1.00x) > yuv2yuvX_10_LE_16_48_512_approximate_neon: 763.0 ( 9.41x) > yuv2yuvX_10_BE_16_0_512_accurate_c: 8092.6 ( 1.00x) > yuv2yuvX_10_BE_16_0_512_accurate_neon: 860.2 ( 9.41x) > yuv2yuvX_10_BE_16_0_512_approximate_c: 8183.5 ( 1.00x) > yuv2yuvX_10_BE_16_0_512_approximate_neon: 861.4 ( 9.50x) > yuv2yuvX_10_BE_16_16_512_accurate_c: 7837.4 ( 1.00x) > yuv2yuvX_10_BE_16_16_512_accurate_neon: 834.0 ( 9.40x) > yuv2yuvX_10_BE_16_16_512_approximate_c: 7927.9 ( 1.00x) > yuv2yuvX_10_BE_16_16_512_approximate_neon: 834.6 ( 9.50x) > yuv2yuvX_10_BE_16_32_512_accurate_c: 7605.1 ( 1.00x) > yuv2yuvX_10_BE_16_32_512_accurate_neon: 807.5 ( 9.42x) > yuv2yuvX_10_BE_16_32_512_approximate_c: 7691.4 ( 1.00x) > yuv2yuvX_10_BE_16_32_512_approximate_neon: 807.3 ( 9.53x) > yuv2yuvX_10_BE_16_48_512_accurate_c: 7344.3 ( 1.00x) > yuv2yuvX_10_BE_16_48_512_accurate_neon: 782.7 ( 9.38x) > yuv2yuvX_10_BE_16_48_512_approximate_c: 7440.1 ( 1.00x) > yuv2yuvX_10_BE_16_48_512_approximate_neon: 781.9 ( 9.51x) >=20 > From 3e14b4c2e763d2d0c8979e3e99578f5492b7130c Mon Sep 17 00:00:00 2001 > From: Logaprakash Ramajayam > Date: Tue, 1 Jul 2025 23:48:36 -0700 > Subject: [PATCH v2 1/1] swscale/aarch64/output: Implement neon assembly f= or yuv2planeX_10_c_template() >=20 > --- > libswscale/aarch64/output.S | 189 +++++++++++++++++++++++++++++++++++ > libswscale/aarch64/swscale.c | 38 +++++++ > tests/checkasm/sw_scale.c | 170 ++++++++++++++++++++----------- > 3 files changed, 337 insertions(+), 60 deletions(-) Applying: swscale/aarch64/output: Implement neon assembly for yuv2planeX_10= _c_template() Using index info to reconstruct a base tree... M tests/checkasm/sw_scale.c error: patch failed: libswscale/aarch64/output.S:20 error: libswscale/aarch64/output.S: patch does not apply error: patch failed: libswscale/aarch64/swscale.c:158 error: libswscale/aarch64/swscale.c: patch does not apply error: patch failed: tests/checkasm/sw_scale.c:52 error: tests/checkasm/sw_scale.c: patch does not apply error: Did you hand edit your patch? It does not apply to blobs recorded in its index. Patch failed at 0001 swscale/aarch64/output: Implement neon assembly for yu= v2planeX_10_c_template() [...] --=20 Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB No great genius has ever existed without some touch of madness. -- Aristotle --HQZOnxSrMQCzt9JM Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iF0EABEKAB0WIQSf8hKLFH72cwut8TNhHseHBAsPqwUCaHopIwAKCRBhHseHBAsP q6EoAJ9OlKsT4gnSPGY1rZKuuUri/ViO2QCeMxWi6qoQRVQBWuZFPQQsdkEoihw= =R7eR -----END PGP SIGNATURE----- --HQZOnxSrMQCzt9JM-- --===============1905254779673779407== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe". --===============1905254779673779407==--