From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTP id 255EA43C5E for ; Wed, 27 Jul 2022 17:49:53 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id D187968B8E5; Wed, 27 Jul 2022 20:49:50 +0300 (EEST) Received: from EUR03-VE1-obe.outbound.protection.outlook.com (mail-oln040092072032.outbound.protection.outlook.com [40.92.72.32]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 892CF68B698 for ; Wed, 27 Jul 2022 20:49:44 +0300 (EEST) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=fWB4imhBG7kLPbMOYNAm5SwoYZeokHNLT1ModUShG89Ck6lONZQCgCvekc1P2R1D0t9xTl/uikJ35ijYzePu0lPQtIoT7Rc87qmPpD9Kght6EkMXnecsg3rPQjro75BezVdaj8+/cOvubj05U4rSyTJQNL0nflgVc3rD5zk7D9uK1Xe2y9SbP3oEjkcZxoB5Cg+7ZwNh/CDvsA5PVrI4PJjiBE0PFrsQhuw7sSX/HL4UKVGqx8VG/g4o3qMfjMTfnmoRD4a4cgv3wWT5OxrOxCa/0esDODyyUrhnp0jEpVW4gtFb7ID1/NxWg9g+xRpCdVFR4r7onRgUludVk0Br4g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=5Gy0K5HqStLRX/M99y5TuEwZK10tr2sKBNohk2DE4IA=; b=dh7iir9xMDieCmbCCKLlvLQAuZCL1fs/kZWS57KleQkC3RYcnXQ87kQUv767vtt7z36PgckF6C3qnybSZyvPlqFRQIDcKQJXpP/ODwHmAHvoIiWhZQkP22eGR5BGPwxYVtkFkbx14VitlxtZFWsnyhS0s+RfbE7MxNSCDowMmJZX0t0HsYr88VuzpGTsmMyrvTJhsKfA+Fstby0RlPLA1l79y8fLoTPCuyVOeUYFxG819/wflAVATkaQrx2M27pWDCR09TEmi7PJfqkBUlAJl9NCYvwLtF/ufKo873X43q0dMwqpN967S1/IwEKUwtfvE3pEJ9Ii+CK23ke6PNN7EQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=none; dmarc=none; dkim=none; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=outlook.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=5Gy0K5HqStLRX/M99y5TuEwZK10tr2sKBNohk2DE4IA=; b=IgV+mpskV3MMLmyCGEKN6R6F4MdEi/ctCiusFzJ7TiZwwrwD1KukPnc2PB5cuYwCIaXpIIc3ZVvq3l1yQQ5ob16SnKO84o5wMW18fhCWhoDz+JFjS2scUBom5HiegjMdUbvpZbawUp1GVn++QuUgOPzNA/8+iO3wmIqwt+0u37Nc7Ui3mdDb2RTD8c3Apii18ifSCmYZkj2nZ8VEh7px6zuTILA3MYZVp0FIb7d5kiNV9x3DRfWRQjGt0uzycTrnMe4Vliz8wEPkAA64+OhNHiYyk6QSbXy/ILeZwUjBFV9mEBY2BEZQIunOHLWKbKnzWhJe/SFuLW6FjoNCrE0u7g== Received: from DB6PR0101MB2214.eurprd01.prod.exchangelabs.com (2603:10a6:4:42::27) by DB7PR01MB4363.eurprd01.prod.exchangelabs.com (2603:10a6:5:2c::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.5458.19; Wed, 27 Jul 2022 17:49:43 +0000 Received: from DB6PR0101MB2214.eurprd01.prod.exchangelabs.com ([fe80::210e:b627:bcc9:8c46]) by DB6PR0101MB2214.eurprd01.prod.exchangelabs.com ([fe80::210e:b627:bcc9:8c46%11]) with mapi id 15.20.5458.025; Wed, 27 Jul 2022 17:49:42 +0000 Message-ID: Date: Wed, 27 Jul 2022 19:49:41 +0200 Content-Language: en-US To: ffmpeg-devel@ffmpeg.org References: <05a46152f1b2458ea326edd9cfb6d817@amazon.com> From: Andreas Rheinhardt In-Reply-To: X-TMN: [YBsyoCleyB+mPiui1OiQTi0TusRb35Gd] X-ClientProxiedBy: ZR0P278CA0080.CHEP278.PROD.OUTLOOK.COM (2603:10a6:910:22::13) To DB6PR0101MB2214.eurprd01.prod.exchangelabs.com (2603:10a6:4:42::27) X-Microsoft-Original-Message-ID: <7d83a19a-c81c-fd02-37c7-236af78cec28@outlook.com> MIME-Version: 1.0 X-MS-Exchange-MessageSentRepresentingType: 1 X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id: 536757f7-d3d5-4602-ddda-08da6ff862bb X-MS-TrafficTypeDiagnostic: DB7PR01MB4363:EE_ X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: KWKtwTR7SOZt05rr3YNh6wOGbue0K9+8fjv7fk6kQDrNiCzuswj4sJ4GWISqv/3CajF/mbTC7HcFydwj+OQZWtOBu//4buSQ+t0xWcDQ1aZBfsbHIdwFF7cHu8/545gRjMNHWymR7yUOTAN0G8Zgv1Bbsq7fQ6AQZfOfsKGXs+TIBfNcBULzbzIg6KWyC88cN6aVxdIKmQ4+v7couiNxGCF/NEHq0Wy79nn/4/RuVd4wOD/MitdF2wIHn1Mfo3bkFICEMe6n2zXWXmj2j0rJIHFo0M2PxSh3J2TN/Bakv7so1REE5sAkUCUUk6ic9bd6jcq981ijyD4XXZcOw2yg+vMJcnIPOzDdGKpnzzIqLf4WNG98WjWCamBQBRbs5ftVr3aw40DnO6IcM3ylfuMBo+f5XOVW1+4fRrbQlJiHljgc+m/1hJzwSDv4Ozux6jhXnaYTwZosk+V1zF2upzX9R1ybzrAGSR82uVXWFzPlHe7ghnl9yQxMbXo0fJTAYNTeNIufE2qHqV5O2Cf3hu/wIGSi0LFaeMBPM7aL2hApNYiA6VXQ85uHmksd2w38tXHASYwN20WVMOXDHcQU+mzq5tGat/lYxR290GQs4RLBel7pYdmzvvZprhK1XOcGCtlwY+vR3pC4mcw4BBp7hB0Um9hoF63nVRw5TbyCeCO3sG5L4kllUo+u7ouPIn/ogO5t X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?djR0YTFYcVF3UnlzM21EZXUwL2VDakpLT1FVSS9MS1F1MnpxYTZKZjk0WXpN?= =?utf-8?B?WGE2MGRoMUJUSWRQZS81cytkVDcxVDVnQldjOVhZdGs5ekRVMk8yMzE1T1E4?= =?utf-8?B?azh2VTBBSm9tbWJRaGhJdVFBd0YrUE1zZlp6V0Zab1ErdlJMN2djb1ZuUVBD?= =?utf-8?B?blVGYzQ5Tmp6eTJWNmFEUUVHUUlYVDdoeG9LQ2x4VzdseC9MNzYvclRtK3ZQ?= =?utf-8?B?dmE0TUVQSVZkMUxONXRvMjdqRHlCM0Jua1JwNHBvcnZwWHY0cjlaK1lYekZH?= =?utf-8?B?NytxalJ4V2tPOXZ6YUx5TDhPeFF5cUNQYW9xVmJMT0UvY3VOWjRTYXVScFRy?= =?utf-8?B?Y1hPclZNWWlEVUprMHFFVmN3amNqUkdZd1B4TnM1SENGUTM5Vkg3cUp5VElp?= =?utf-8?B?NDZHUzVWaEdKL0NtTW1QYkJiTGJSRGNlVzVuMkI2eHNZZmQ2M3pWMjNHTE9C?= =?utf-8?B?aVhtTm1ZNmVrM0hiRTVSdmZtMFBZUmNnalFWdmZ3NUpMSjNqVEJNT0tKd2kx?= =?utf-8?B?YlJKSTdYU3VqOTFONDc5R1NFOUxPRFcwUHp0MDhZL052YWh3QkNwS20zbUYv?= =?utf-8?B?NnRLQlVFNmtIWHB3UjRydnBXSzJaWVN1YktkR2VDVEo5VVNGTTZjRWNqV1FK?= =?utf-8?B?a3JRVUxCTmxIeGMvSmhPM0NRcmFLd3NybGpEU3FPNE5PTExoWWVpRDVTUDBl?= =?utf-8?B?OUVmQUhTbkN5Sy9PY0hQdkYrYVY3TjQxWjlCaHowVGxEOC96WkdRWSsrOGVv?= =?utf-8?B?WFd2VUs3VGtqOGU1ZE8rdzFwdkt4ekdHQmJ4VmdJa0Y1V1hBekJPNTgzZXRu?= =?utf-8?B?VjJOL1NlaFRoZEY2cVNnWW52T1k3ZGZNVksrMm8vd3NuOHBoOGZhNHhjMm1R?= =?utf-8?B?SG5tQ0xpenZySUtibjVwZXFoVWxDNitVakhUSkdjZ3ZhMUczeHBOUmU4RlRM?= =?utf-8?B?MHJoSTFING1jV0RPNkFZRVN6bWNBVkdlWEJDYTA5OVdxWWs4VnIzN2N6QmVS?= =?utf-8?B?ekpyUkoybTNLQW4yNWxuM1YyKzlPMm1MRE5vbk1qTVJHWU9DMFI0VTgrR0RH?= =?utf-8?B?cFNaQW5DenRsK2ppMDd5V0dNRlFkWjJNSWN2NDROSjBJK1pLeDZ3M09tclN2?= =?utf-8?B?dThURlVDSDBQQURoS0pHVS9Vcnd5bGs5WDZnbG1lc0N0TTJFNHJ5ekFObGZt?= =?utf-8?B?UC92QU16Yzh2M3o0NTNXY0V1U090ZURDTW4wb1B6YlNwVWczRVRQS2xxYnN6?= =?utf-8?B?c0E0MzlVRHhOZzhzQ1luZ3A3SmZTUGNFeUFERk5nME5VbUFPa3QzSmNacVVC?= =?utf-8?B?MFNrekFrZWExN1B0YnFoajRTN0VtdkJiRzRUSjVDUlVnL0lmT1FLZnVXNnVo?= =?utf-8?B?RVdmOWtUYjk0U05vemJFQnpMSDdOcmk0c1dqVS9vRlZCb2RCeEZqMS9mU3Vx?= =?utf-8?B?VWNSNmRmYk84L2NXK3NvMnRqY1JrV2ZpejU0VGlkRnBYMHdZRmVVR21SMm5z?= =?utf-8?B?bFZrODA1ZnRPSXkraVJ0cG1uY3AvOGJOU0Q0U1FEWTQzb0NXWk82OXdJWDBJ?= =?utf-8?B?OTN0MUh6cEwxckpwTGhKOUNqdmsvTE9vd293VGYvOFQ5eUszckJSN2VtTFU3?= =?utf-8?B?b1AzdS9NS0NlQi9qQ3JtYTFZOUxnN0xIVkVzVzJ2MW5NUUI4ZTdnUTFlZmFv?= =?utf-8?B?SlovUDhMNllGODlWTUxTVFd4c0JLTlpWWEQyVGNreE95TXJiZ3N3ZG5Wc0ho?= =?utf-8?Q?1o+UnScPhd3LRfYmmY=3D?= X-OriginatorOrg: outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: 536757f7-d3d5-4602-ddda-08da6ff862bb X-MS-Exchange-CrossTenant-AuthSource: DB6PR0101MB2214.eurprd01.prod.exchangelabs.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 27 Jul 2022 17:49:42.9147 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa X-MS-Exchange-CrossTenant-RMS-PersistedConsumerOrg: 00000000-0000-0000-0000-000000000000 X-MS-Exchange-Transport-CrossTenantHeadersStamped: DB7PR01MB4363 Subject: Re: [FFmpeg-devel] [PATCH] enable auto vectorization for gcc 7 and higher X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: James Almer: > On 7/27/2022 2:34 PM, Swinney, Jonathan wrote: >> I recognize that this patch is going to be somewhat controversial. I'm >> submitting it mostly to see what the opinions are and evaluate >> options. I am working on improving performance for aarch64. On that >> architecture, there are fewer hand written assembly implementations of >> hot functions than there are for x86_64 and allowing gcc to >> auto-vectorize yields noticeable improvements. >> >> Gcc vectorization has improved recently and it hasn't been evaluated >> on the mailing list for a few years. This is the latest discussion I >> found in my searches: >> http://ffmpeg.org/pipermail/ffmpeg-devel/2016-May/193977.html > > Every time this was done, it was inevitably reverted after complains and > crash reports started piling up because gcc can't really handle all the > inline code our codebase has, among other things. > >> >> If the community is not comfortable accepting a patch like this >> outright, would you be willing to accept a new option to the configure >> script, something like --enable-auto-vectorization? > > --extra-cflags can be used for this. > No, it can't, because what is given via --extra-cflags is inserted at the start of CFLAGS, so that the automatically added -fno-tree-vectorize overwrites it. - Andreas _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".