From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.ffmpeg.org (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTPS id D51214B43C for ; Mon, 21 Jul 2025 07:27:49 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.ffmpeg.org (Postfix) with ESMTP id 3F26068BE1C; Mon, 21 Jul 2025 10:27:45 +0300 (EEST) Received: from MA0PR01CU012.outbound.protection.outlook.com (mail-southindiaazon11021092.outbound.protection.outlook.com [40.107.57.92]) by ffbox0-bg.ffmpeg.org (Postfix) with ESMTPS id 29D7568ACF5 for ; Mon, 21 Jul 2025 10:27:38 +0300 (EEST) ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=h54UwpKRe3t7SH4Xadru0yFMB6S5rCepi8hp1E7HOYPGsVH6VJIVWXPQmUY8miJFX3Ed5tWcuui9Bv3InTmBpQv/Hjjak8ju90fAwTseNyRFGj6cck8GYZIVFIurIHYSvN8GwUaezt4qyU93W92mpq7dGfLygpIUd0coLFlWg+R1tTZoAiujGZ0dOJ+N23+ar2cStD/xOvi02fy6SbnMR+7Jax90+jJao5XxzlXAR6ykQT9OaBpCuCXJC3nge4a30J7KqKasdjOqvVcdasBa2wYeBQ86CiCXMLhWEq6uiPLdqUgzeST0LfAk8XBCaqWJ8qkLgGzudjs0uqtOJouQIA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=vn2pSz3BNXDHytxtilGuNL49PIXXSZI5Q4Hwt+5R9z4=; b=hxtiJJlfRoM1XlD/VDXpTwRfQb6U8NG6/D8hPweOg0O4sX2mSHrHuW7fT06ebmNVA6RJFoFlbkIYkHCqpMXZb40SGrDkZd+FM9LhHKBNWt/2Ul+SauilcZGmZQM1S9xRkMKzpGsdZNlZfnL3qLJytbwBWfgON7SKxmNwr6uVhw8FMnPXqyXfvSj4nDROnvCkApQgRZN5t/hX98C6iVNEGysQjpwC+mo+vPTQPuWt4wemissIGMOSa7+ny8FBO/R9OWC9Tjj4Oawmv14Qy2DeapA8LGz/Kdjm06SZryH/BekNVV4SsBjXLgXW/JpXNC2qHnLC2+154bUNUmTc4imORg== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=multicorewareinc.com; dmarc=pass action=none header.from=multicorewareinc.com; dkim=pass header.d=multicorewareinc.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=multicorewareinc.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=vn2pSz3BNXDHytxtilGuNL49PIXXSZI5Q4Hwt+5R9z4=; b=fQhUYYfMDSva68Fls7pSzqA5/roxX/vhjkciQn2MefnTsV658W1t6f68pC7ahaZkkwqgL7Zd2yThuvGlix0eDaKzmMIL4jM1FL1bROzE95MzN9RfmLSDPWaDKE18z5rLyyDXVepqEyZlrnfm7UxPEzfAya38kv8/LeWM3q/WXnK6GgQutjH/bNYj+pTubJ3qw5q/xraAWit5H6deh2IwRewrbwybTP9rq99s8CbggNRhXv+Z3XBaZJmfF5rMgSLbgv1HcFBzUlT5bRuXaRdxG8Hctptgpl5JpDFJr9Ig2iIkXM9hU91MhOCdKLeV/P2rlEfaKNtQUpWTuQU6kMOE3Q== Received: from PN3P287MB3339.INDP287.PROD.OUTLOOK.COM (2603:1096:c01:22d::6) by MA0P287MB0829.INDP287.PROD.OUTLOOK.COM (2603:1096:a01:e4::6) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.8943.30; Mon, 21 Jul 2025 07:27:33 +0000 Received: from PN3P287MB3339.INDP287.PROD.OUTLOOK.COM ([fe80::4a22:77a3:8f7d:445]) by PN3P287MB3339.INDP287.PROD.OUTLOOK.COM ([fe80::4a22:77a3:8f7d:445%4]) with mapi id 15.20.8943.029; Mon, 21 Jul 2025 07:27:33 +0000 From: Logaprakash Ramajayam To: FFmpeg development discussions and patches Thread-Topic: [FFmpeg-devel] [PATCH v2 1/1] swscale/aarch64/output: Implement neon assembly for yuv2planeX_10_c_template() Thread-Index: AQHb9UKPgTMtv7k500G9ixUFc3lDRrQ3u6iAgAR7MwU= Date: Mon, 21 Jul 2025 07:27:33 +0000 Message-ID: References: <20250718105953.GX29660@pb2> In-Reply-To: <20250718105953.GX29660@pb2> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: msip_labels: authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=multicorewareinc.com; x-ms-publictraffictype: Email x-ms-traffictypediagnostic: PN3P287MB3339:EE_|MA0P287MB0829:EE_ x-ms-office365-filtering-correlation-id: 40135b2c-9864-46df-d2b4-08ddc8280ef5 x-ms-exchange-senderadcheck: 1 x-ms-exchange-antispam-relay: 0 x-microsoft-antispam: BCL:0; ARA:13230040|1800799024|376014|366016|8096899003|13003099007|7053199007|38070700018; x-microsoft-antispam-message-info: =?us-ascii?Q?yug2Bqw8jN+qh3eipRVawn2NRPRRkuUYynAlBDVa5LSv28cswNLA+ijK9Q0R?= =?us-ascii?Q?4Z5waM/X8L4X0YM/inKH357B1gY39uouNhAnpsozfsRcZCny97jMNWWuSTCr?= =?us-ascii?Q?h05EvfFzQqOB8UJACcDVgU4HXsrp/CnEU3nYKIk2l73CrGtKzfjmZyacMexN?= =?us-ascii?Q?rWxihLeQKVSCFaT/yKQY1+Etl8Fu8l55s7WACUa+ETLlc1wvCPC5DP7Vzk6c?= =?us-ascii?Q?r35a/f0Z3l7JYuPSCClE+MV7FZ23hPhr8ULB+JYcWdh9H9E9mnKf2khwUK8X?= =?us-ascii?Q?IuxUKDJ61KpZnZk/NUrSZJ8ioJtK4aHNjCsI2IlZReT3tAqfDjs0k5L7+Xzy?= =?us-ascii?Q?YwURr2V7YwM675WIY8kWuUMiCHlrGeXzqFe7or86cD+jXrRdrkYbwmgkqB3N?= =?us-ascii?Q?dZsBipzSWquMdUbxvMTyH71CBlu8x1vNnMubBGe9nj3dY+7qe8xhJOyAbB0U?= =?us-ascii?Q?DDRGkbdAnkB8JOO+qVjKmryXU/n680D2fzwdtKC4iApftD1hF/fxX+oaEno/?= =?us-ascii?Q?ODoPCDEuOPnb+2u7pb8DdhqCJalWMadFGz96WgKB2ds6ieOg0wEi49jvtbYk?= =?us-ascii?Q?cB4KI+GUbE0kW8nVrKF5u7JnR+PWTiSkHJEpf5kHT/AB2s2/Bnv/F5YjraFf?= =?us-ascii?Q?+v9UXSKw+ll+5a+bGa52JBVUv4mtH0KaYWkMhDXK2rYnlFY1DwOJR8PVCAeh?= =?us-ascii?Q?5oXvvWeYuUsEtcxcDDLZiyUCu47wihtxg++mJExK8oA1rRZ0ldLnwhK0Cg9l?= =?us-ascii?Q?GAkjezy0JTmM0rGu3xB3HBHgdQXO14lpUdypi3bN8oDR0r1PDSX+QYKpewRh?= =?us-ascii?Q?txFXyT6YLeRn6Bkn3dlXdHaqymIT0uxuO1h3nHTijFKHq8ed/X5hfKrp63Pu?= =?us-ascii?Q?CnF3MuWWRi/fPufIwjKJxbKsHrLjh7zTk76sb+J2MURa9BvOScvI4lSYpQRQ?= =?us-ascii?Q?YuI5yvCR48kzX0O0y98pEmF3d+ZmWYXitprytNT8Qa9YyURaQFI0igJ+QEx6?= =?us-ascii?Q?6I+PD58zdYSWP1SFwjZ4WV/53GU10zXLBImwph2qu1m01CG15p16pSU127il?= =?us-ascii?Q?ZbxBzj7WfUXUcdZX4vKFQKJJJa4dQl1ZRuYki1M+qFl+ueYso/gWS5Ryo1NL?= =?us-ascii?Q?YtRJd4Wod4+OphBkSr5yk5YwCe7+e9ZemipVjGxbK7EVkGkdlAiqj3AuhnZG?= =?us-ascii?Q?AYz4v/yiKydR9Em6AgiZoPWZ0Sg/G2Mqd/Xli3kUwb4DYeTu4yWHW05rj/Gp?= =?us-ascii?Q?waigq22aqrpAMLtySFd2AwsJtxZo2UEyegNgI0P+c9IWDPq3cNGXx+ICeuy0?= =?us-ascii?Q?i//8ZGB3EYkh2NBVTYNCyE6wK/vsVytl9iH3NjDZtHxr2LcnztBx6dCawFd8?= =?us-ascii?Q?pQjLfT60Q46ESQFL5PsSPz40/Cb06/Xwf8L5464K1shB26owwsJ6DJBsLUms?= =?us-ascii?Q?9YN8Ae3MHIg=3D?= x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:PN3P287MB3339.INDP287.PROD.OUTLOOK.COM; PTR:; CAT:NONE; SFS:(13230040)(1800799024)(376014)(366016)(8096899003)(13003099007)(7053199007)(38070700018); DIR:OUT; SFP:1102; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?us-ascii?Q?9uFfdjBicEVh2s4O4XWK3s+4nRS7ImrNcFJg8seA2OqvhJTh7sYxAt8VJ4Hn?= =?us-ascii?Q?Haw2zWPRQ0gciDIxebfj5IR2HcQ+cfcCD6RE+pWJyEq36NhbSRtXNmpKg2te?= =?us-ascii?Q?MzB3zXKXLEPStzrGeSL+Rf3+3X4IVeReE9Mct/aAfu5OC+aFaoD9fj3DqnWU?= =?us-ascii?Q?70QU2lLOWa61OscUyb8uSFTSZ3jWHBuozEN5iSQTO1Cx8KbtM8ZFHyG6FUwe?= =?us-ascii?Q?lHH1c00unbZBoCUFLugvtPjHmBbCK+fFK3jSuWRg5bBFIiqxyMgjMfW2y/TF?= =?us-ascii?Q?1b9ZjBQFiwOwABKAgIHNvPcQ61iGluU9EetLC9J2ISfYAFahyObdGvM4TZx6?= =?us-ascii?Q?z9TY1SfrcY2f7sNB/0J3KN1+s+TM8PW55URdQzqNK01tXJ5wArOExa1/DEL8?= =?us-ascii?Q?5rJ/lHeaTbgsqBDtRSQYJiCyZhsMUf16Ht5ZuBcqhMKkfRK4ML/9CGnCkdFo?= =?us-ascii?Q?4r6pZVcVPAaQ5oIDOjbNutBdgesjpBQcDXVYP0In9YS3+vkC8uuW6jeVks85?= =?us-ascii?Q?D10LL1NwCZWga1DwmmQypLPITGUEHAibo7Yia/tlUhB06vp4MPbQ070n0A9/?= =?us-ascii?Q?XJMCkNUuWviwc6o1tLs77sFam6AKapD+7VPTLtOKL7eeAk87Am7eiCfRpEmU?= =?us-ascii?Q?z3KO+f0PYScbfHcIsMYLq6xIJIgp9mG7km6pi+62Jh/JyVh3poGMtufELdrz?= =?us-ascii?Q?cK0BYmuLMKnr3aS23OFT6YvVftQ4eSI2tdq+1xpCrV5h59BoEFiEs6bvdZPE?= =?us-ascii?Q?8vg8nqTJBfXycJFCCDmiuPDd1AshBF1z524asWesFLR4mvxnoSIO0KSz2vX0?= =?us-ascii?Q?aJteUb/2TD8JFwDkk3cT4eRJ1Tu3eLwjg/RSVT7RSW/A/+VYeh+171GVnDgP?= =?us-ascii?Q?i4W2mWfGugE+UZVd5dDcDyOkxG5gTksL2Unf3vp3686b/xCFK8o8KZVbHMTN?= =?us-ascii?Q?nRUCeS7hw6Fn7LmIWlNkCe+jZjb4AawNIdfsRo/z3uRdwtldVb+4gY6ziIHV?= =?us-ascii?Q?1groVC1Kjmv/MRWdMyxE20DqU0pdkhITjM0opZB0q20RTS6+/PMwjFVkaS7u?= =?us-ascii?Q?BiXOBNmgPdjPIsaSjN7LToro1LOHGaWF5rPwY2dtffrMxDhp50lUTDCqKM3Z?= =?us-ascii?Q?piHTt7DUjss/FM+73U7p5uLUQNZ+w9LMa161Rkb47qPSBDRCKlC8SvynqDjd?= =?us-ascii?Q?VlmkSAjVCtTWzezTdvpFhovF6TqHWQ75NMioPlQLu6hoT3lD4PKG1hZPLUSB?= =?us-ascii?Q?PgJaklKAoiK0doWeGt4K3cMKKvtwUTv7y8PBrsbJEMOeI1kojPXeoDjgozji?= =?us-ascii?Q?6X+E3LNyxUXoze0yCbRgxj04d12fXVIk62gYFVLx+/BMVQxsKvOZJTGLFX6p?= =?us-ascii?Q?hiXKJ/b6vyxRHHndFlWjznD/1pckpIn9sTbRnHXuxhLrHhsNrLZuXaWcsMqH?= =?us-ascii?Q?YsBl9b8n0WViMo7OfXKjTKYa4vMRMTOX8M05nsWZGkD38vNgtwmvwk89UmRJ?= =?us-ascii?Q?YczCUf8lRox/2EeQQWVF/IKibwY2ELPa8Zkhd2K8Ln6OzKcqWUuzOV1c01TJ?= =?us-ascii?Q?5TAxZaWPp0Ua9LjKFgoOigdMjw58B1XSFwZGjvX1DH+2WqUTL6Zbxd1nkE+V?= =?us-ascii?Q?r8EluP3vhctzbG3LzsO4moc=3D?= MIME-Version: 1.0 X-OriginatorOrg: multicorewareinc.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: PN3P287MB3339.INDP287.PROD.OUTLOOK.COM X-MS-Exchange-CrossTenant-Network-Message-Id: 40135b2c-9864-46df-d2b4-08ddc8280ef5 X-MS-Exchange-CrossTenant-originalarrivaltime: 21 Jul 2025 07:27:33.1695 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: ffc5e88b-3fa2-4d69-a468-344b6b766e7d X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: UwOkxFekNobdH/3wb81Bn8I2rgsrqn6YgdnDN7ryZjd3zXPuEpvcXKqxnmK01CJBRFRZp5MDNrmzg8VhsohmbBUbmFBy3Av24TDI+eUb9EkXAlciK0DK31HytvV4ibMBr5c72mN32bw/hzyw+C+0PA== X-MS-Exchange-Transport-CrossTenantHeadersStamped: MA0P287MB0829 X-Content-Filtered-By: Mailman/MimeDel 2.1.29 Subject: Re: [FFmpeg-devel] [PATCH v2 1/1] swscale/aarch64/output: Implement neon assembly for yuv2planeX_10_c_template() X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: Tried applying the diff from patchwork (https://patchwork.ffmpeg.org/project/ffmpeg/patch/PN3P287MB3339E53168F261F6CCCF13B79A57A@PN3P287MB3339.INDP287.PROD.OUTLOOK.COM/raw/), it applies fine on master. The mbox patch (https://patchwork.ffmpeg.org/project/ffmpeg/patch/PN3P287MB3339E53168F261F6CCCF13B79A57A@PN3P287MB3339.INDP287.PROD.OUTLOOK.COM/mbox/) thou fails to apply saying "empty patch". I had to delete lines 1-199 manually and then it applied. Thanks, Logaprakash. ________________________________ From: ffmpeg-devel on behalf of Michael Niedermayer Sent: Friday, July 18, 2025 4:29 PM To: FFmpeg development discussions and patches Subject: Re: [FFmpeg-devel] [PATCH v2 1/1] swscale/aarch64/output: Implement neon assembly for yuv2planeX_10_c_template() On Tue, Jul 15, 2025 at 05:06:54AM +0000, Logaprakash Ramajayam wrote: > Handled all the comments and attaching the checkasm results. > > Checkasm Benchmark results: > > > yuv2yuvX_10_LE_16_0_512_accurate_c: 7836.9 ( 1.00x) > yuv2yuvX_10_LE_16_0_512_accurate_neon: 840.4 ( 9.33x) > yuv2yuvX_10_LE_16_0_512_approximate_c: 7930.8 ( 1.00x) > yuv2yuvX_10_LE_16_0_512_approximate_neon: 838.5 ( 9.46x) > yuv2yuvX_10_LE_16_16_512_accurate_c: 7594.3 ( 1.00x) > yuv2yuvX_10_LE_16_16_512_accurate_neon: 815.2 ( 9.32x) > yuv2yuvX_10_LE_16_16_512_approximate_c: 7687.0 ( 1.00x) > yuv2yuvX_10_LE_16_16_512_approximate_neon: 811.9 ( 9.47x) > yuv2yuvX_10_LE_16_32_512_accurate_c: 7366.4 ( 1.00x) > yuv2yuvX_10_LE_16_32_512_accurate_neon: 785.8 ( 9.37x) > yuv2yuvX_10_LE_16_32_512_approximate_c: 7426.5 ( 1.00x) > yuv2yuvX_10_LE_16_32_512_approximate_neon: 786.4 ( 9.44x) > yuv2yuvX_10_LE_16_48_512_accurate_c: 7123.1 ( 1.00x) > yuv2yuvX_10_LE_16_48_512_accurate_neon: 761.7 ( 9.35x) > yuv2yuvX_10_LE_16_48_512_approximate_c: 7182.7 ( 1.00x) > yuv2yuvX_10_LE_16_48_512_approximate_neon: 763.0 ( 9.41x) > yuv2yuvX_10_BE_16_0_512_accurate_c: 8092.6 ( 1.00x) > yuv2yuvX_10_BE_16_0_512_accurate_neon: 860.2 ( 9.41x) > yuv2yuvX_10_BE_16_0_512_approximate_c: 8183.5 ( 1.00x) > yuv2yuvX_10_BE_16_0_512_approximate_neon: 861.4 ( 9.50x) > yuv2yuvX_10_BE_16_16_512_accurate_c: 7837.4 ( 1.00x) > yuv2yuvX_10_BE_16_16_512_accurate_neon: 834.0 ( 9.40x) > yuv2yuvX_10_BE_16_16_512_approximate_c: 7927.9 ( 1.00x) > yuv2yuvX_10_BE_16_16_512_approximate_neon: 834.6 ( 9.50x) > yuv2yuvX_10_BE_16_32_512_accurate_c: 7605.1 ( 1.00x) > yuv2yuvX_10_BE_16_32_512_accurate_neon: 807.5 ( 9.42x) > yuv2yuvX_10_BE_16_32_512_approximate_c: 7691.4 ( 1.00x) > yuv2yuvX_10_BE_16_32_512_approximate_neon: 807.3 ( 9.53x) > yuv2yuvX_10_BE_16_48_512_accurate_c: 7344.3 ( 1.00x) > yuv2yuvX_10_BE_16_48_512_accurate_neon: 782.7 ( 9.38x) > yuv2yuvX_10_BE_16_48_512_approximate_c: 7440.1 ( 1.00x) > yuv2yuvX_10_BE_16_48_512_approximate_neon: 781.9 ( 9.51x) > > From 3e14b4c2e763d2d0c8979e3e99578f5492b7130c Mon Sep 17 00:00:00 2001 > From: Logaprakash Ramajayam > Date: Tue, 1 Jul 2025 23:48:36 -0700 > Subject: [PATCH v2 1/1] swscale/aarch64/output: Implement neon assembly for yuv2planeX_10_c_template() > > --- > libswscale/aarch64/output.S | 189 +++++++++++++++++++++++++++++++++++ > libswscale/aarch64/swscale.c | 38 +++++++ > tests/checkasm/sw_scale.c | 170 ++++++++++++++++++++----------- > 3 files changed, 337 insertions(+), 60 deletions(-) Applying: swscale/aarch64/output: Implement neon assembly for yuv2planeX_10_c_template() Using index info to reconstruct a base tree... M tests/checkasm/sw_scale.c error: patch failed: libswscale/aarch64/output.S:20 error: libswscale/aarch64/output.S: patch does not apply error: patch failed: libswscale/aarch64/swscale.c:158 error: libswscale/aarch64/swscale.c: patch does not apply error: patch failed: tests/checkasm/sw_scale.c:52 error: tests/checkasm/sw_scale.c: patch does not apply error: Did you hand edit your patch? It does not apply to blobs recorded in its index. Patch failed at 0001 swscale/aarch64/output: Implement neon assembly for yuv2planeX_10_c_template() [...] -- Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB No great genius has ever existed without some touch of madness. -- Aristotle _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".