From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.ffmpeg.org (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTPS id 24C114CEEF for ; Thu, 29 May 2025 21:38:07 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.ffmpeg.org (Postfix) with ESMTP id AB00A68DCD2; Fri, 30 May 2025 00:38:03 +0300 (EEST) Received: from EUR03-AM7-obe.outbound.protection.outlook.com (mail-am7eur03olkn2084.outbound.protection.outlook.com [40.92.59.84]) by ffbox0-bg.ffmpeg.org (Postfix) with ESMTPS id 1ECBB68DB7F for ; Fri, 30 May 2025 00:37:57 +0300 (EEST) ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=M3VG342pJwS1AHtzpVLQH+wo6mEwmhS0OArbMQJO9aFxxqZ3G0b/zofRz8A42uAxngB5weZbT9BHpXXnwNLolsCE6KYRFAgT+UOetYJ70R5BmfIUAmDjumbPkfWiWy0ycPX1p0t9r50EhriHIxuSFEIS+/1n7Rx8m14ufbmdzwxR8KVPxB44rooQLrwvpshb+eGZ1eyR9zmsPY26BAMiVQ6BlzzSpa4C5i0J+fstu8WsK0cX33sevX1U0YCbvRopGg8Nlld0oajETn+kEi6gTX8ejE171HUTjAcuWORVOswMjflZY4lTP5NMvKG2tK31Ka2L0seDrUOX4HkApYTZUg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=hct+5S26AF1rEwGINc21p7l0HIUDMrowmo+t6I55wlA=; b=Ti2k1UJ4uD5wqjtTlVvIN0GKcEqXMPECUS+Gnbw1WEBxsrGEs88gisagfHtRdY7mRHyR0ZE0HEmVTNOHyyjfyj2ayUq6DkARnbc1aqsxpKP80l8wgmOcS9vzXBoSeZKPIRZdpYcmnfHd1bLnZX38lyMmKmTah5gClw+Eh2RwAa+G4zaTYXa4SovswrQrwEo5X54v0IdHkX3tpcHBtVFVyo1nYv5thkR3l7kVp6WtiUiD42ukVSMjnuBaAExxbp8xyT8TgF62gvz7TOKl3ZfvjTseu3WHhzQLnSkBOuCp6i73UklTpR0n1eSTK4urao2wP4P7otRzWLz6ViYM+p7oYw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=none; dmarc=none; dkim=none; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=outlook.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=hct+5S26AF1rEwGINc21p7l0HIUDMrowmo+t6I55wlA=; b=flm7qfrqtZP+MNMaInA0ukj+qcZqFUSxrxfyMiQNd8ndfU344AmzEcQHUapByoU1zQRA/lc2VcVE5xBfVFT/AaTSt/kWNin5yKstcVdSVnD/yfersRECI0DqxVyDrQ5WRxPfdsPa5aW+SyCBXyjLbGR5+HhhrAzzZwcRWMlhzGYtmuY1n49v23qArUiSNGRU8g76+qgHs6wptTOl1wv+mEMnprFe382wYsZduVspZ/tNlgkhnMHYGQnlwILbVh24nzgGDjh2Woo0AzR3giIkM8GBIY43EA3GqGC17lmuna1P+w0G+p8g8qseridQ3Ay/uZtlQ6NxHoZS3OU9IT5Bmw== Received: from DBAP193MB0956.EURP193.PROD.OUTLOOK.COM (2603:10a6:10:1c5::19) by DU2P193MB2211.EURP193.PROD.OUTLOOK.COM (2603:10a6:10:2fe::14) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.8769.31; Thu, 29 May 2025 21:37:55 +0000 Received: from DBAP193MB0956.EURP193.PROD.OUTLOOK.COM ([fe80::ed13:9f9d:e088:ae31]) by DBAP193MB0956.EURP193.PROD.OUTLOOK.COM ([fe80::ed13:9f9d:e088:ae31%3]) with mapi id 15.20.8769.022; Thu, 29 May 2025 21:37:55 +0000 From: Dmitriy Kovalenko To: ffmpeg-devel@ffmpeg.org Date: Thu, 29 May 2025 23:34:22 +0200 Message-ID: X-Mailer: git-send-email 2.49.0 X-ClientProxiedBy: AM0PR10CA0086.EURPRD10.PROD.OUTLOOK.COM (2603:10a6:208:15::39) To DBAP193MB0956.EURP193.PROD.OUTLOOK.COM (2603:10a6:10:1c5::19) X-Microsoft-Original-Message-ID: <20250529213746.53845-1-dmtr.kovalenko@outlook.com> MIME-Version: 1.0 X-MS-Exchange-MessageSentRepresentingType: 1 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: DBAP193MB0956:EE_|DU2P193MB2211:EE_ X-MS-Office365-Filtering-Correlation-Id: a1a6f79a-f492-4e19-6ca1-08dd9ef91236 X-MS-Exchange-SLBlob-MailProps: Cq7lScuPrnoSu68Y5FdRDlCPOBk43FEt+032BgwOkqvrlh8mnd2v03qe2+bEgKdhjEq8WDSa+krgvXoEQxu43lj79WRO1TMVqGLdc7Z4bhpK7wXX4y5W/Yi06FdKPsX0eQpq9l6Ve6dqlwu1mDdfKTB2aw2ZYA8sCOnD6VIJIxc8f0Zd7/cPM2HTmQtzV4e+EWu/xVA3hc1HvOguhIyrsQ9BGoo52cQziWd8X5D2sRPE7TujI/cz1yVT8P+IE/0tkmQIqVsiD1l0T/fijMIPBaC/2eClYln5zmygKp9DvxZAIeRSjlbyDzfl7Fd+J8TTO2et8jHv9lROl/NhwJWiutNtHoF+PfBsF7lkahWYAIZs232O+5LwbWf0Puw+lv234NsUy2QSCJPl3H6rmqB84UAQolj2SHyp0/7UgouJK10O8QG1BhyTOUsK1po8GWaum0il/+vujLhv7BzQy1JCbyUsKRWyLogOY2xP/DI2fc5M7Qvc2vxV2UNbVUSFVqKPCgZyzSq1i8epUBKLhKy/mJIxnT29QgwjwedIQaTEwTJhiDKGRBhH4l7FpVvjNI+CZICWKBeCo6Pe6vF2Z8656FcdS/pSzw7ukeWhlsJ5ne+00uxlSYcmEm0rDO0k2FlF1VD87xKAlKM9OU7FCHXBkoHCep5JWEyPr/FHLDTcenCbnXiAEvuCsw6h+RHafE4X+aConudftVVjjTbqVo/M43mG80yyrLGutrKtmXSu4WEGOKNqyV5p4zD6h/DExPVdjcZ8qvCuWmE= X-Microsoft-Antispam: BCL:0; ARA:14566002|19110799006|41001999006|7092599006|8060799009|6090799003|15080799009|461199028|5072599009|440099028|3412199025|1710799026; X-Microsoft-Antispam-Message-Info: =?us-ascii?Q?7Oqzv+goWCmahLDSWL2mf9Dw5SrbhS2WptZ9AxecL9z+GIniSzHhXxhrQe5F?= =?us-ascii?Q?6Ub+DCWHDNVEX8jYJi4qX3e8RGgMyCBzo+/TQv1oepP8KZpAY7zcKFRVa/J9?= =?us-ascii?Q?gVVtPUKxm77tlE/uvy8+G6xbbx07yjL7aZ/LQE7fBzMSPemiuZQuwq3uNc+7?= =?us-ascii?Q?mBE+mzjp3lmZ1AX4dmLjSN77XYWvnX+S4jezayjq9gZ3u9SM0PSY0iDOrX41?= =?us-ascii?Q?B9+S8OkNThErL2qrSu/TE4zz2cxDNUjz1RllVF7JVucWakOGX5AgMSBN4cX4?= =?us-ascii?Q?2tNOWMZtoqcl8i3O8dhQAETMSZgIBGHtWl/DhmDIFN067VqxH37484bSpQW2?= =?us-ascii?Q?ss79iP3qXRH4J5Z3yJLJGfyPJwvS+gSZGURB9PR4DF0lU2ls78PcJbPctKR/?= =?us-ascii?Q?ck5Ionujyc4UhzI2yxkRff/5pokpDWC6WFgEf2s7ldjEmVisydDmubUM83JV?= =?us-ascii?Q?2S18tWksP2BP3dune3Ea9Tmh+QJpaZIGpHS0lP3A4B7Zinz89xsaXnuTR+HP?= =?us-ascii?Q?13Ykeng4CwswTD/LgmUAoLCjU6yYUZspfYiknQ/3kT6UCwOdPYM6fZPRHDzv?= =?us-ascii?Q?WHOvIf3sZ98FNbc5dvJf7lWvFr078nYI9QPvqzVat7MKBa7z/bxJ8+z6vSoI?= =?us-ascii?Q?6DdOiBCiPdOQglhx9v5VM0hU+JlqZGotpgFFoPba7RhtU5v/nYFWfyBsQnt7?= =?us-ascii?Q?swSzCCgf7f8cbjZC8arSZtq9+lyWR4XkulxE1ot2H5CRK3kZUm7keQk5JSjD?= =?us-ascii?Q?26AGvt7j2l4trk10TIiTvNOWSfx5NwFM6TIyL7BQdZBo/SR0FbNiKRqwfzpg?= =?us-ascii?Q?lyVrwbXaZAY7oxk/PaEjbkMRHn7eFVkr1J0G15p6F/jWoefTbDjjE9FX2iRv?= =?us-ascii?Q?hvtMmHyDMJLHo28pzm3KHXmXIFTGnxJFa5TA+VAXDFuNMhvIm5VEEPU+kRXO?= =?us-ascii?Q?wtK0a51Sbg5sgDIuolmGa7/iwStT9NIW1fNjf7TqvxOPt7SidqEHyVC+XKOp?= =?us-ascii?Q?zKsq6/KlzWhGasqg1qR8DtWqqMT2j/2ltrQonQgsDHHIVVkOcGSlMlF32XMl?= =?us-ascii?Q?HSsspG2Hjb4NipMQrOor87EmdJvn+1pUXIky1V9BXob6ckQAGWmdJZthDddu?= =?us-ascii?Q?4mlsQ0Ho72fONCQ5dhasM2B7DfW2+H3plUYQ4yUTecKccX5IPLe+g3L9NhNQ?= =?us-ascii?Q?q2TZajxYxdLoUcl5eD1HT/6xgQW3ChjIcQTkOQ=3D=3D?= X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?us-ascii?Q?7N2zndQoGrSYwbMvyCLjrsdA1AEJfpmLk5PAAZCESjKFAhf/d8MdFO3CuosN?= =?us-ascii?Q?FsR3NXD7iGUzHC8+q95KAi99B9tUlNIsWVUgD5j2uAIO2IM/Sj+KHr3ATGFD?= =?us-ascii?Q?GfszU+oOaUiBmcK9xixXrpNCjmeVlljZAbRsMCeNMotoJSeIQogCiV/LXIez?= =?us-ascii?Q?jjILSIMKAqFz62emtPW/DXBxq8R97w3AWWKWbEA78k2jX8DW2rEmLQE+20kk?= =?us-ascii?Q?DSDCEkDcMqp3tj2wl+yQktPKnPj++6C6FK28lhQVK/JxvDy30xNK+k+DVfHP?= =?us-ascii?Q?NOXEsaYsjX96xpMt6AVY3YQRKYqR++47oshDRBz/pYvz64+7mS6sZKuVolRC?= =?us-ascii?Q?phOJKAwgK4jDkMmHiNPPOnBMWoEmjnpbUO1pPMbzvGIfSaNmt0UTEfpII62w?= =?us-ascii?Q?B63Z3poZIEyydoZ0Eqn3wynoPW+sKn84arsMvo7qgnSoBJH44c+g/HNLelYT?= =?us-ascii?Q?N0NMPMpGt+RSBLM88fn5KAdyiI/9uAkfDAC9k92QtkyPRW5/vJzRmtDA/Kk+?= =?us-ascii?Q?IAjsVMXqLdhst9ho6QnBmnboPpWGRK2Iy9DC0cQk2Sju83nPGZj6AO8ttkSY?= =?us-ascii?Q?2jTBOpY14wNy6T9LzTb0CXJyBHNTF/pX+OrRfn5KukBW6Fcybt4vzMxSfuMV?= =?us-ascii?Q?LISCEFngzc/DyrHR8bj/Z4tK0scqnAd7xETFHH2Yq6Vh8mNq/JxH6lwwvCaJ?= =?us-ascii?Q?0eTU5TSEKeN7HmIs7AOHbgT66WgQl8N+yMsioa7vtqvpZpz5pX6e6i+ICHK7?= =?us-ascii?Q?vpRAYPayrDxh+itNySEWjsb8Q8faBmLPW4uScy+7dd5C9mBvLs9PnW/k3Y8o?= =?us-ascii?Q?lmpfOwix3rz1/RtQn/bflhZxQ53Ftwvs/Xu/GieWVbKdFRF5cHenQVyhfn2p?= =?us-ascii?Q?KhLq+Tfc7jTlc7RVo0SrVBPTU7a5wF0pfej7ROpOLMAjOF9kb1iMxfwXIL4M?= =?us-ascii?Q?Iy9toH4QSpb2dDC9BVMIyGOGkS7Ff3VJOSd1e47yXTqmI2Stcb12uKJHgfpc?= =?us-ascii?Q?Uuz0iCa6wKhGqSWs6/V+aS9vfcGOylQBEI+sZ8tQ0O3rIheR7yVha4cn2UhS?= =?us-ascii?Q?AfA2mmpMkpaPUGNTXflP7wZ0tqTNupoCZ2azXvR+2hSIWjKRbG9VgvXQKVfL?= =?us-ascii?Q?J3bvjTxlgotgxxGF8/hFSmOsLgO3fYectPePjlyh2DWxVmx9GySwwVGc/ObY?= =?us-ascii?Q?NS5+gk3tzPnU/C2PWxraadk+W8lXkmzpgr6u3C5quu4AElFkvFoTmjkTZr+T?= =?us-ascii?Q?IQWQHTGWsBHc5KEiXBfi?= X-OriginatorOrg: outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: a1a6f79a-f492-4e19-6ca1-08dd9ef91236 X-MS-Exchange-CrossTenant-AuthSource: DBAP193MB0956.EURP193.PROD.OUTLOOK.COM X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 29 May 2025 21:37:55.1268 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa X-MS-Exchange-CrossTenant-RMS-PersistedConsumerOrg: 00000000-0000-0000-0000-000000000000 X-MS-Exchange-Transport-CrossTenantHeadersStamped: DU2P193MB2211 Subject: [FFmpeg-devel] [PATCH v2 0/2] swscale: neon aarch64 rgb_to_yuv optimizationsj X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: Dmitriy Kovalenko Content-Type: multipart/mixed; boundary="===============4602092455877571315==" Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: --===============4602092455877571315== Content-Type: text/plain; charset=y Content-Transfer-Encoding: 8bit This is a follow up based on the review feedbakc by Martin Storsjö. I fixed all the identation issues and added post load increment to the all used macros and macroed back all the code that was unmacroed in the previous version. Per the prefetching instructions: They definetely not giving visible difference on a new CPUs like Macbook Pro, but I see a noticable 3-5% performance difference on my tests for the more mobile devices for example IPhone 8 with A11 Bionic CPU or Amazon Fire HD which I am interested especially to optimize for. Per my checkasm tests they are not slowing down nethier macos nor linux arm builds so why not to keep them? Dmitriy Kovalenko (2): swscale: rgb_to_yuv neon optimizations swscale: Neon rgb_to_yuv_half process 32 pixels at a time libswscale/aarch64/input.S | 212 +++++++++++++++++++++++++++---------- 1 file changed, 155 insertions(+), 57 deletions(-) -- 2.49.0 --===============4602092455877571315== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe". --===============4602092455877571315==--