From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTP id 00F8343C81 for ; Wed, 27 Jul 2022 21:33:57 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 2715C68B94D; Thu, 28 Jul 2022 00:33:55 +0300 (EEST) Received: from EUR03-VE1-obe.outbound.protection.outlook.com (mail-oln040092072022.outbound.protection.outlook.com [40.92.72.22]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 290A268B88C for ; Thu, 28 Jul 2022 00:33:48 +0300 (EEST) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=NHlXdi9uCq8PgXCf0CxOSWoya8zoPpGiGd1Y75MNilMxVj/l+zkmElIywdd/D5hmEDBIaQyEbQe8DFCPOe5ZegCK1RG2sBTpyGcYX1QqzgoLbgBwl0g9GneuXH3iOjWCe98W1UGKdpOHMGQkQg3VpAnu1tpFDeNmlRKoUs8ADHMYm382SYxfFIUJo9zxoHiDskBs5uVa1GpKrXJkYBizxTfvASck8wnFHAbGZly38wdjoIelzmr34IQ6recfzKZBhxLiE0ZH6W15aI604F4UYZ6awaoP9FKR4vMiFC+bizJLqqz1gDY6jyQiscQVJqeAVfAznRvmC788PInGdm34eA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=wZ/b1oeaEgjnLnWzgcjmmtc8EgJXhLk02NA7vanCgDQ=; b=hhAhxtli/f8jisEEFTODEgW52JqP4FqYSO0ixmpxyCiL6ZRaTliL9NAio2B75lLua0wLzqecw2IG8mOVdHuhmHr5ekxXuUwGDw0blaLiQnoIL+Haz/voeZagHuBrSWTbFkxdGaUSiVmMOEOXw62krFESg/tfuLyPkBdTN9pc3CJOT1lyGyhSlVqZAJKGaneMz7aziWsLvFT1Z8ugA/Ml//DoLDy8w+0TkTs/r8Li5KTa23Yqi3aoafILiwIGiZNY55j91/xL3Npc9mxhqC+E6SbbGoUaReocrMKQiEuRpCTWP1kZcb/FvrqyZtD0vmBVNFQhHQ33kw4mad/oKz18qA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=none; dmarc=none; dkim=none; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=outlook.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=wZ/b1oeaEgjnLnWzgcjmmtc8EgJXhLk02NA7vanCgDQ=; b=a2P8Efk5aItg0jpXnMlEZ0HXuwQakcRFLJqsWwxw8631TdEbu1awo5cX7gsGCYRQv/SU3anXbc5lJ6bUk4O82YZ1PmhcedF5ysopsCSY8gCA4HkJgkDy1B/bYPd9yEInuT0QfCYZlCTJrDGbDDs47hjUZNzUO1jkQnjnI95+8cBbW57JYl1k1ulx0JRzvBG21dITAfvuxt25lD322bImjzu8Y4LTQXKbtYwvQ0Ir6c1OOmg2abWMxc9YS8K7lEhd+eMbKZ/6ryiNjjB7FHJQM4u+9kS/hNjbha9WMep99kcqHD686vl9r9lkQPhhPr+apkwaTn5QrD1bTAaGQkm3Kg== Received: from DB6PR0101MB2214.eurprd01.prod.exchangelabs.com (2603:10a6:4:42::27) by AS8PR01MB7512.eurprd01.prod.exchangelabs.com (2603:10a6:20b:288::16) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.5458.19; Wed, 27 Jul 2022 21:33:46 +0000 Received: from DB6PR0101MB2214.eurprd01.prod.exchangelabs.com ([fe80::210e:b627:bcc9:8c46]) by DB6PR0101MB2214.eurprd01.prod.exchangelabs.com ([fe80::210e:b627:bcc9:8c46%11]) with mapi id 15.20.5458.025; Wed, 27 Jul 2022 21:33:46 +0000 Message-ID: Date: Wed, 27 Jul 2022 23:33:45 +0200 Content-Language: en-US To: ffmpeg-devel@ffmpeg.org References: <05a46152f1b2458ea326edd9cfb6d817@amazon.com> From: Andreas Rheinhardt In-Reply-To: X-TMN: [kOgo8chR4YQgpF/I8wXXh9JnGIUcIH3X] X-ClientProxiedBy: ZR0P278CA0024.CHEP278.PROD.OUTLOOK.COM (2603:10a6:910:1c::11) To DB6PR0101MB2214.eurprd01.prod.exchangelabs.com (2603:10a6:4:42::27) X-Microsoft-Original-Message-ID: <86812a67-9ad8-ff4c-7baf-a15941c168b9@outlook.com> MIME-Version: 1.0 X-MS-Exchange-MessageSentRepresentingType: 1 X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id: 9cd8a7c3-f0b3-49bb-1389-08da7017afc5 X-MS-TrafficTypeDiagnostic: AS8PR01MB7512:EE_ X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: TO04NTaei+SJRJoWoo/Ryk610UIwOlSnG7s9O2p40S1nhuXRRsq1xVUGPP6KSWtSsnTgDa/PT9awurlxHiekWC+Tew1ZGCkBNHaIHkO3ceqT9CMDJYB5FPMWY6l9pRHOztkpfdjKxHzdEq75/CwVNEWdKVR0dIs6w1GPuQ6oslvJc6Wu+O/+TSeOG8nN7fVoPRYmfKIRMiegtrbFvfKgxldulpt9/e5zp7Y3VOAs2YdsPk3nVnUIY/ZxfbzShL7Q3NHrA2wxVsFOu4iTvvKBinrADVwNB/1+/gdIfhsd4YsHlyT3+Rf4pleK29zBAwbslWk3nz4yGNG/Ut+OTVmSll1APQABkRvMw0+0s3q9blKCZ6nZhJYOY9qiQg1VxKfYe71cnFCsPShea2KzROdF3A5Yau8AiiGKm+HyP5kal/qsrFoCH3wE4gLfu2geMdtWn5+GKbAtuLpk7PDa4qxFduZJ2/s3976078eSfjVrg5h0eGSSj/aIFCunCwl8QoJxSWVCBkU2OFocqRGNTBoLGDvVCswHrwo+Lj7boMZSRwXVSdlQthtgCq/ADmb+y+7kGrajtad9R7FRPYsyN8MTUoXUGRscsosbD6m+3eO3fjT7QzwocVVX5Q804/+TC7mPTxmL/Ws24uaPnPDvs8Hr70DgyGKpmcz11S906myy+TFvtY2Sl/7OwGzE9q9DmIu2 X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?T0hRTVRQY0drRkhJVEhqaXVrOFZSdXhLRFYzODVSTVh4TDJaWW9scHNVelp4?= =?utf-8?B?QjgvaW9aTU5YNk4yZTNLVHR6ZEdBTkZLb3ZXell6ZnpTd2NOTmdwSWxvYVBv?= =?utf-8?B?MUk2M0QzNXY5K3N6aDBlS1pWSmxFUU9vNEVyc1EyRHV3cFJTaVJNVWR2VzBr?= =?utf-8?B?cWFnWEJkK3p3YVAwbkxlUGJ6YktlOUdZSlI5TTM4cjgrSWhsTFo0TWRWUGti?= =?utf-8?B?U1duOFo4SWJkTnZ2UmMyK3FnQ0sxd291MDZXcTRrUDAvbzNVaGEwL1VUNGJM?= =?utf-8?B?U2lpZHFyMmpkK0RjTmk4OXNMUXZhZ0w2Rk1wUDJxdWs5VHgyM1NEL2g0VklL?= =?utf-8?B?Z2JkdCs5ZnJXUm1nbVlKRnNSZHVJR0xOSVJXeHROSHg0NGY3cWJxMHl6SGlL?= =?utf-8?B?aDVYS2hEa1RnWU1URG5YeDBaZ1Z5bk13M0Y5cHhGUFlHTWo1Z2U5OWRNb2l2?= =?utf-8?B?VDFpNXZYdmhjMXhMdTdkV2NBMzhxZFBIN2FBOHliYjRMY2hwcURlOW13UHpD?= =?utf-8?B?bVZCSW5VZ01oTEhlK1kzZnVWSngyQnRTMmJ5WUdSbkYrTzROMTNkY09CWFJp?= =?utf-8?B?Y0EzTVNmNXNUQ3g5eDQrQ0VEenlpaDdPYmNmZEpGcHhwYVg3UitqdGMrc2ht?= =?utf-8?B?UURtNDhKK2IrNENhbzlCUlYzQkhtMWxaUFlHV0dzb2UyZnErKzJlaUJxMGNB?= =?utf-8?B?T08rWGNCanVucUxLZG9FcGQ3KzlXR25vZk5FU0wvWTdCWm5GOVFQa2loVG1p?= =?utf-8?B?UDUrOVRQRWxLeWRqMTJCdGZmODMveFBkMjZUMU80cG5QQnpVN09CVEVlYjIx?= =?utf-8?B?b3lKZ05LT2pCdFVGZGorQTZwVGxnTXNvcTdteG5zdTd0UmMzUG1qS1hlK24x?= =?utf-8?B?dzdoeWUybjRIQU5LampKaU5KK3kvRTVyb0tpRnIvVXg3ZnloeWwzaTYyTksr?= =?utf-8?B?Q2EyQmdVa2UrWk9LOHQxS1FQRFcyL3hKenRsQXYrNS8yV1BDcDhKekFzRW1Y?= =?utf-8?B?cUl3clRUZ3BWNjFDZ1JNMUI0QjAwcDErbHFwYlgvam5teVpXRktkOVNhdG9n?= =?utf-8?B?UzJIS3llaXdra25wdThBT0NxQTVjZ08veXZFckhyZlFkNFdDZEdmSmRYd3pV?= =?utf-8?B?WGxmdVU3UE9mN2JuaFdWeG45ZTlKNytJOTMrMjRzd1Ixa1pCMGVONmFSUVVj?= =?utf-8?B?bndKL2NHUlozc3hWVWpRRWd4aGhKTytVZVpmK2FhUlZEdGVsUGUybVhmcUpk?= =?utf-8?B?MkhJaUFmUkZnYmtRUzFIVVY4ekc5bGpOWXIyU0dMZWl6TjV1SkNYQ0xFQ2RB?= =?utf-8?B?VlBiK2pmYWQ2ZUFFRUUrMlBNcENBQkhPTnd2aVJvMysvL1o5b3hMZ1hPZ3VG?= =?utf-8?B?NGZIRE9sZGc4Zm1SQmhrZ0ZxUS9acWEreFpEUGFuMHY4VkNQdmQ5MWFWTmo2?= =?utf-8?B?bXpxdUlWaWl3QkxENm5VV1RueW4zVGM5VlpRMGI4WU9XSEJrSFY2WjQ2K2pM?= =?utf-8?B?UGRsWXM2ZThyelFtSEMxdWFJbkRJbXlkOFoxSjZZU2FXTW52Qko2WWE1Yll1?= =?utf-8?B?RXdkSWJCNWhWV0RyL1o2TUdiWnRkcHJEWkJGMHduZ3pQdGZtTW1BV0dwT3pD?= =?utf-8?B?emZEUDUyZHBvMjFaSG9Ka1UvbFlzUkZ0Uk85VVhYa0VMNFExd0EwdFBKVDd1?= =?utf-8?B?NHUxRTVuNllGN3M1WnZsYzFhazBGZENSUk9DT3k4VUxRUFY4ZmhreUZocUlV?= =?utf-8?Q?HCqw73S2HonOdDeFLQ=3D?= X-OriginatorOrg: outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: 9cd8a7c3-f0b3-49bb-1389-08da7017afc5 X-MS-Exchange-CrossTenant-AuthSource: DB6PR0101MB2214.eurprd01.prod.exchangelabs.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 27 Jul 2022 21:33:46.5895 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa X-MS-Exchange-CrossTenant-RMS-PersistedConsumerOrg: 00000000-0000-0000-0000-000000000000 X-MS-Exchange-Transport-CrossTenantHeadersStamped: AS8PR01MB7512 Subject: Re: [FFmpeg-devel] [PATCH] enable auto vectorization for gcc 7 and higher X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: Hendrik Leppkes: > On Wed, Jul 27, 2022 at 7:39 PM James Almer wrote: >> >> On 7/27/2022 2:34 PM, Swinney, Jonathan wrote: >>> I recognize that this patch is going to be somewhat controversial. I'm submitting it mostly to see what the opinions are and evaluate options. I am working on improving performance for aarch64. On that architecture, there are fewer hand written assembly implementations of hot functions than there are for x86_64 and allowing gcc to auto-vectorize yields noticeable improvements. >>> >>> Gcc vectorization has improved recently and it hasn't been evaluated on the mailing list for a few years. This is the latest discussion I found in my searches: http://ffmpeg.org/pipermail/ffmpeg-devel/2016-May/193977.html >> >> Every time this was done, it was inevitably reverted after complains and >> crash reports started piling up because gcc can't really handle all the >> inline code our codebase has, among other things. >> > > No need to wait for issues, I just tested, and the same issues still > persist that have existed for years with GCC now. They don't seem to > care to make it compatible with inline asm, which might be fair > enough, but it means it just can't work here. > Have the GCC devs been informed of this issue? > In file included from libavcodec/cabac_functions.h:49, > from libavcodec/h264_cabac.c:36: > libavcodec/h264_cabac.c: In function 'ff_h264_decode_mb_cabac': > libavcodec/x86/cabac.h:199:5: error: 'asm' operand has impossible constraints > > GCC 11.3, configure --cpu=haswell, mingw32 > > So this is a NACK. It just flat out breaks builds. > > - Hendrik _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".