From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTP id 9756E437B0 for ; Thu, 28 Jul 2022 18:51:48 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id AEE1968B9B5; Thu, 28 Jul 2022 21:51:46 +0300 (EEST) Received: from smtp-fw-9102.amazon.com (smtp-fw-9102.amazon.com [207.171.184.29]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 19ECC68B863 for ; Thu, 28 Jul 2022 21:51:38 +0300 (EEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amazon.com; i=@amazon.com; q=dns/txt; s=amazon201209; t=1659034305; x=1690570305; h=from:to:cc:date:message-id:references:in-reply-to: content-id:content-transfer-encoding:mime-version:subject; bh=xadtoQXf/BlU9ppnWjfMQILOcDuxa7T/ilXmfXGAZ+Q=; b=Ch8QQApX0UpQQhJ9mDv/MhW+6fgJi1LtbJPhspkPgtKiZZV2aok+dkIO 3Ay2wBBYY3ZiokJYDci6SUteowQ3amHk8ijMYBMTtoGxmAhiBNZOKqZK1 Rjgr8HH3kzW/sky7jjODQI8e7IVeEOTOl/aVDUwyGz5U58sLxnQ0p0tNp Q=; X-IronPort-AV: E=Sophos;i="5.93,199,1654560000"; d="scan'208";a="243231139" Thread-Topic: [PATCH] lavc/aarch64: Add neon implementation for sse4 Received: from pdx4-co-svc-p1-lb2-vlan2.amazon.com (HELO email-inbound-relay-pdx-2c-4213ea4c.us-west-2.amazon.com) ([10.25.36.210]) by smtp-border-fw-9102.sea19.amazon.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 28 Jul 2022 18:51:36 +0000 Received: from EX13MTAUWB001.ant.amazon.com (pdx1-ws-svc-p6-lb9-vlan3.pdx.amazon.com [10.236.137.198]) by email-inbound-relay-pdx-2c-4213ea4c.us-west-2.amazon.com (Postfix) with ESMTPS id 576E782407; Thu, 28 Jul 2022 18:51:36 +0000 (UTC) Received: from EX19D001UWB002.ant.amazon.com (10.13.138.54) by EX13MTAUWB001.ant.amazon.com (10.43.161.207) with Microsoft SMTP Server (TLS) id 15.0.1497.36; Thu, 28 Jul 2022 18:51:36 +0000 Received: from EX19D007UWB001.ant.amazon.com (10.13.138.75) by EX19D001UWB002.ant.amazon.com (10.13.138.54) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.986.26; Thu, 28 Jul 2022 18:51:35 +0000 Received: from EX19D007UWB001.ant.amazon.com ([fe80::bcaa:e18f:a569:3851]) by EX19D007UWB001.ant.amazon.com ([fe80::bcaa:e18f:a569:3851%6]) with mapi id 15.02.1118.009; Thu, 28 Jul 2022 18:51:35 +0000 From: "Swinney, Jonathan" To: Hubert Mazur , "ffmpeg-devel@ffmpeg.org" Thread-Index: AQHYoBf/r0obBiWgxEinHneDzmBqn62T0bwAgAAAYYA= Date: Thu, 28 Jul 2022 18:51:35 +0000 Message-ID: References: <20220715080228.686736-3-hum@semihalf.com> <20220725111541.44618-1-hum@semihalf.com> In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.43.161.236] Content-ID: MIME-Version: 1.0 Subject: Re: [FFmpeg-devel] [PATCH] lavc/aarch64: Add neon implementation for sse4 X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: "martin@martin.st" , "mw@semihalf.com" , "upstream@semihalf.com" , "Pop, Sebastian" , "gjb@semihalf.com" Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: PiBUaGVyZSBpcyBvbmUgbW9yZSBwbGFjZSB0byBtb3ZlIHRoZSBzdWIsIGNtcCBhbmQgYnJhbmNo IGluc3RydWN0aW9ucyBhcGFydCBpbiBzc2UxNl9uZW9uLiBJdCBkb2Vzbid0IHNlZW0gdG8gbWFr ZSBhbnkgZGlmZmVyZW5jZSB0byBOZW92ZXJzZSBOMSBhbmQgVjEgYW5kIGl0IG1heSBoZWxwIEE1 My4NCg0KU29ycnktLSBJIG1lYW50IHNzZTRfbmVvbi4NCg0KLS0gDQoNCkpvbmF0aGFuIFN3aW5u ZXkNCg0K77u/T24gNy8yOC8yMiwgMTo1MCBQTSwgIlN3aW5uZXksIEpvbmF0aGFuIiA8anN3aW5u ZXlAYW1hem9uLmNvbT4gd3JvdGU6DQoNCiAgICBZb3VyIGxhdGVzdCBzZXQgb2YgcGF0Y2hlcyBk aWRu4oCZdCBnZXQgaW50ZXJwcmV0ZWQgY29ycmVjdGx5IGJ5IHRoZSBwYXRjaHdvcmsgdG9vbC4g SSBzdXNwZWN0IGl0IHRvb2sgdGhlbSBpbiB0aGUgd3Jvbmcgb3JkZXIuIA0KDQogICAgaHR0cHM6 Ly9wYXRjaHdvcmsuZmZtcGVnLm9yZy9wcm9qZWN0L2ZmbXBlZy9saXN0Lz9zZXJpZXM9JnN1Ym1p dHRlcj0xNDc5JnN0YXRlPSZxPSZhcmNoaXZlPSZkZWxlZ2F0ZT0NCg0KICAgIFRoZXJlIGlzIG9u ZSBtb3JlIHBsYWNlIHRvIG1vdmUgdGhlIHN1YiwgY21wIGFuZCBicmFuY2ggaW5zdHJ1Y3Rpb25z IGFwYXJ0IGluIHNzZTE2X25lb24uIEl0IGRvZXNuJ3Qgc2VlbSB0byBtYWtlIGFueSBkaWZmZXJl bmNlIHRvIE5lb3ZlcnNlIE4xIGFuZCBWMSBhbmQgaXQgbWF5IGhlbHAgQTUzLg0KDQogICAgSSBk aWRuJ3Qgc2VlIGFueXRoaW5nIGVsc2UuDQoNCiAgICBUaGFua3MhDQogICAgLS0gDQoNCiAgICBK b25hdGhhbiBTd2lubmV5DQoNCiAgICBPbiA3LzI1LzIyLCA2OjE2IEFNLCAiSHViZXJ0IE1henVy IiA8aHVtQHNlbWloYWxmLmNvbT4gd3JvdGU6DQoNCiAgICAgICAgQ0FVVElPTjogVGhpcyBlbWFp bCBvcmlnaW5hdGVkIGZyb20gb3V0c2lkZSBvZiB0aGUgb3JnYW5pemF0aW9uLiBEbyBub3QgY2xp Y2sgbGlua3Mgb3Igb3BlbiBhdHRhY2htZW50cyB1bmxlc3MgeW91IGNhbiBjb25maXJtIHRoZSBz ZW5kZXIgYW5kIGtub3cgdGhlIGNvbnRlbnQgaXMgc2FmZS4NCg0KDQoNCiAgICAgICAgUHJvdmlk ZSBuZW9uIGltcGxlbWVudGF0aW9uIGZvciBzc2U0IGZ1bmN0aW9uLg0KDQogICAgICAgIFBlcmZv cm1hbmNlIGNvbXBhcmlzb24gdGVzdHMgYXJlIHNob3duIGJlbG93Lg0KICAgICAgICAtIHNzZV8y X2M6IDc0LjANCiAgICAgICAgLSBzc2VfMl9uZW9uOiAyNC4wDQoNCiAgICAgICAgQmVuY2htYXJr cyBhbmQgdGVzdHMgYXJlIHJ1biB3aXRoIGNoZWNrYXNtIHRvb2wgb24gQVdTIEdyYXZpdG9uIDMu DQoNCiAgICAgICAgU2lnbmVkLW9mZi1ieTogSHViZXJ0IE1henVyIDxodW1Ac2VtaWhhbGYuY29t Pg0KICAgICAgICAtLS0NCiAgICAgICAgIGxpYmF2Y29kZWMvYWFyY2g2NC9tZV9jbXBfaW5pdF9h YXJjaDY0LmMgfCAgMyArKw0KICAgICAgICAgbGliYXZjb2RlYy9hYXJjaDY0L21lX2NtcF9uZW9u LlMgICAgICAgICB8IDY1ICsrKysrKysrKysrKysrKysrKysrKysrKw0KICAgICAgICAgMiBmaWxl cyBjaGFuZ2VkLCA2OCBpbnNlcnRpb25zKCspDQoNCiAgICAgICAgZGlmZiAtLWdpdCBhL2xpYmF2 Y29kZWMvYWFyY2g2NC9tZV9jbXBfaW5pdF9hYXJjaDY0LmMgYi9saWJhdmNvZGVjL2FhcmNoNjQv bWVfY21wX2luaXRfYWFyY2g2NC5jDQogICAgICAgIGluZGV4IDNmZjU3NjdiZDAuLjcyYTIwNjJl N2UgMTAwNjQ0DQogICAgICAgIC0tLSBhL2xpYmF2Y29kZWMvYWFyY2g2NC9tZV9jbXBfaW5pdF9h YXJjaDY0LmMNCiAgICAgICAgKysrIGIvbGliYXZjb2RlYy9hYXJjaDY0L21lX2NtcF9pbml0X2Fh cmNoNjQuYw0KICAgICAgICBAQCAtMzIsNiArMzIsOCBAQCBpbnQgZmZfcGl4X2FiczE2X3gyX25l b24oTXBlZ0VuY0NvbnRleHQgKnYsIHVpbnQ4X3QgKnBpeDEsIHVpbnQ4X3QgKnBpeDIsDQoNCiAg ICAgICAgIGludCBzc2UxNl9uZW9uKE1wZWdFbmNDb250ZXh0ICp2LCB1aW50OF90ICpwaXgxLCB1 aW50OF90ICpwaXgyLA0KICAgICAgICAgICAgICAgICAgICAgICAgICAgcHRyZGlmZl90IHN0cmlk ZSwgaW50IGgpOw0KICAgICAgICAraW50IHNzZTRfbmVvbihNcGVnRW5jQ29udGV4dCAqdiwgdWlu dDhfdCAqcGl4MSwgdWludDhfdCAqcGl4MiwNCiAgICAgICAgKyAgICAgICAgICAgICAgICAgIHB0 cmRpZmZfdCBzdHJpZGUsIGludCBoKTsNCg0KICAgICAgICAgYXZfY29sZCB2b2lkIGZmX21lX2Nt cF9pbml0X2FhcmNoNjQoTUVDbXBDb250ZXh0ICpjLCBBVkNvZGVjQ29udGV4dCAqYXZjdHgpDQog ICAgICAgICB7DQogICAgICAgIEBAIC00NCw1ICs0Niw2IEBAIGF2X2NvbGQgdm9pZCBmZl9tZV9j bXBfaW5pdF9hYXJjaDY0KE1FQ21wQ29udGV4dCAqYywgQVZDb2RlY0NvbnRleHQgKmF2Y3R4KQ0K DQogICAgICAgICAgICAgICAgIGMtPnNhZFswXSA9IGZmX3BpeF9hYnMxNl9uZW9uOw0KICAgICAg ICAgICAgICAgICBjLT5zc2VbMF0gPSBzc2UxNl9uZW9uOw0KICAgICAgICArICAgICAgICBjLT5z c2VbMl0gPSBzc2U0X25lb247DQogICAgICAgICAgICAgfQ0KICAgICAgICAgfQ0KICAgICAgICBk aWZmIC0tZ2l0IGEvbGliYXZjb2RlYy9hYXJjaDY0L21lX2NtcF9uZW9uLlMgYi9saWJhdmNvZGVj L2FhcmNoNjQvbWVfY21wX25lb24uUw0KICAgICAgICBpbmRleCA5OGM5MTJiNjA4Li4zMzM2ZDg4 ODQ4IDEwMDY0NA0KICAgICAgICAtLS0gYS9saWJhdmNvZGVjL2FhcmNoNjQvbWVfY21wX25lb24u Uw0KICAgICAgICArKysgYi9saWJhdmNvZGVjL2FhcmNoNjQvbWVfY21wX25lb24uUw0KICAgICAg ICBAQCAtMzUyLDMgKzM1Miw2OCBAQCBmdW5jdGlvbiBzc2UxNl9uZW9uLCBleHBvcnQ9MQ0KICAg ICAgICAgICAgICAgICByZXQNCg0KICAgICAgICAgZW5kZnVuYw0KICAgICAgICArDQogICAgICAg ICtmdW5jdGlvbiBzc2U0X25lb24sIGV4cG9ydD0xDQogICAgICAgICsgICAgICAgIC8vIHgwIC0g dW51c2VkDQogICAgICAgICsgICAgICAgIC8vIHgxIC0gcGl4MQ0KICAgICAgICArICAgICAgICAv LyB4MiAtIHBpeDINCiAgICAgICAgKyAgICAgICAgLy8geDMgLSBzdHJpZGUNCiAgICAgICAgKyAg ICAgICAgLy8gdzQgLSBoDQogICAgICAgICsNCiAgICAgICAgKyAgICAgICAgbW92aSAgICAgICAg ICAgIGQxOCwgIzANCiAgICAgICAgKyAgICAgICAgbW92aSAgICAgICAgICAgIGQxNywgIzANCiAg ICAgICAgKyAgICAgICAgY21wICAgICAgICAgICAgIHc0LCAjNA0KICAgICAgICArICAgICAgICBi LmxlICAgICAgICAgICAgMmYNCiAgICAgICAgKw0KICAgICAgICArLy8gbWFrZSA0IGl0ZXJhdGlv bnMgYXQgb25jZQ0KICAgICAgICArMToNCiAgICAgICAgKw0KICAgICAgICArICAgICAgICAvLyBy ZXMgPSBhYnMocGl4MVswXSAtIHBpeDJbMF0pDQogICAgICAgICsgICAgICAgIC8vIHJlcyAqIHJl cw0KICAgICAgICArDQogICAgICAgICsgICAgICAgIGxkMSAgICAgICAgICAgICB7djAuc31bMF0s IFt4MV0sIHgzICAgICAgICAgICAgIC8vIExvYWQgcGl4MSwgZmlyc3QgaXRlcmF0aW9uDQogICAg ICAgICsgICAgICAgIGxkMSAgICAgICAgICAgICB7djEuc31bMF0sIFt4Ml0sIHgzICAgICAgICAg ICAgIC8vIExvYWQgcGl4MiwgZmlyc3QgaXRlcmF0aW9uDQogICAgICAgICsgICAgICAgIHVhYmRs ICAgICAgICAgICB2MzAuOGgsIHYwLjhiLCB2MS44YiAgICAgICAgICAgIC8vIEFic29sdXRlIGRp ZmZlcmVuY2UsIGZpcnN0IGl0ZXJhdGlvbg0KICAgICAgICArICAgICAgICBsZDEgICAgICAgICAg ICAge3YyLnN9WzBdLCBbeDFdLCB4MyAgICAgICAgICAgICAvLyBMb2FkIHBpeDEsIHNlY29uZCBp dGVyYXRpb24NCiAgICAgICAgKyAgICAgICAgbGQxICAgICAgICAgICAgIHt2My5zfVswXSwgW3gy XSwgeDMgICAgICAgICAgICAgLy8gTG9hZCBwaXgyLCBzZWNvbmQgaXRlcmF0aW9uDQogICAgICAg ICsgICAgICAgIHVtdWxsICAgICAgICAgICB2MTYuNHMsIHYzMC40aCwgdjMwLjRoICAgICAgICAg IC8vIE11bHRpcGx5IHZlY3RvcnMsIGZpcnN0IGl0ZXJhdGlvbg0KICAgICAgICArICAgICAgICB1 YWJkbCAgICAgICAgICAgdjI5LjhoLCB2Mi44YiwgdjMuOGIgICAgICAgICAgICAvLyBBYnNvbHV0 ZSBkaWZmZXJlbmNlLCBzZWNvbmQgaXRlcmF0aW9uDQogICAgICAgICsgICAgICAgIGxkMSAgICAg ICAgICAgICB7djQuc31bMF0sIFt4MV0sIHgzICAgICAgICAgICAgIC8vIExvYWQgcGl4MSwgdGhp cmQgaXRlcmF0aW9uDQogICAgICAgICsgICAgICAgIGxkMSAgICAgICAgICAgICB7djUuc31bMF0s IFt4Ml0sIHgzICAgICAgICAgICAgIC8vIExvYWQgcGl4MiwgdGhpcmQgaXRlcmF0aW9uDQogICAg ICAgICsgICAgICAgIHVtbGFsICAgICAgICAgICB2MTYuNHMsIHYyOS40aCwgdjI5LjRoICAgICAg ICAgIC8vIE11bHRpcGx5IGFuZCBhY2N1bXVsYXRlLCBzZWNvbmQgaXRlcmF0aW9uDQogICAgICAg ICsgICAgICAgIHVhYmRsICAgICAgICAgICB2MjguOGgsIHY0LjhiLCB2NS44YiAgICAgICAgICAg IC8vIEFic29sdXRlIGRpZmZlcmVuY2UsIHRoaXJkIGl0ZXJhdGlvbg0KICAgICAgICArICAgICAg ICBsZDEgICAgICAgICAgICAge3Y2LnN9WzBdLCBbeDFdLCB4MyAgICAgICAgICAgICAvLyBMb2Fk IHBpeDEsIGZvdXJ0aCBpdGVyYXRpb24NCiAgICAgICAgKyAgICAgICAgbGQxICAgICAgICAgICAg IHt2Ny5zfVswXSwgW3gyXSwgeDMgICAgICAgICAgICAgLy8gTG9hZCBwaXgyLCBmb3VydGggaXRl cmF0aW9uDQogICAgICAgICsgICAgICAgIHVtbGFsICAgICAgICAgICB2MTYuNHMsIHYyOC40aCwg djI4LjRoICAgICAgICAgIC8vIE11bHRpcGx5IGFuZCBhY2N1bXVsYXRlLCB0aGlyZCBpdGVyYXRp b24NCiAgICAgICAgKyAgICAgICAgdWFiZGwgICAgICAgICAgIHYyNy44aCwgdjYuOGIsIHY3Ljhi ICAgICAgICAgICAgLy8gQWJzb2x1ZSBkaWZmZXJlbmNlLCBmb3VydGggaXRlcmF0aW9uDQogICAg ICAgICsgICAgICAgIHVtbGFsICAgICAgICAgICB2MTYuNHMsIHYyNy40aCwgdjI3LjRoICAgICAg ICAgIC8vIE11bHRpcGx5IGFuZCBhY2N1bXVsYXRlLCBmb3VydGggaXRlcmF0aW9uDQogICAgICAg ICsNCiAgICAgICAgKyAgICAgICAgdWFkZGx2ICAgICAgICAgIGQxNywgdjE2LjRzICAgICAgICAg ICAgICAgICAgICAgLy8gQWRkIHZlY3Rvcg0KICAgICAgICArICAgICAgICBhZGQgICAgICAgICAg ICAgZDE4LCBkMTgsIGQxNw0KICAgICAgICArDQogICAgICAgICsgICAgICAgIHN1YiAgICAgICAg ICAgICB3NCwgdzQsICM0DQogICAgICAgICsgICAgICAgIGNtcCAgICAgICAgICAgICB3NCwgIzQN CiAgICAgICAgKyAgICAgICAgYi5nZSAgICAgICAgICAgIDFiDQogICAgICAgICsNCiAgICAgICAg KyAgICAgICAgY2JueiAgICAgICAgICAgIHc0LCAyZg0KICAgICAgICArICAgICAgICBmbW92ICAg ICAgICAgICAgdzAsIHMxOA0KICAgICAgICArDQogICAgICAgICsgICAgICAgIHJldA0KICAgICAg ICArDQogICAgICAgICsvLyBpdGVyYXRlIGJ5IG9uZQ0KICAgICAgICArMjoNCiAgICAgICAgKyAg ICAgICAgbGQxICAgICAgICAgICAgIHt2MC5zfVswXSwgW3gxXSwgeDMgICAgICAgICAgICAgICAv LyBMb2FkIHBpeDENCiAgICAgICAgKyAgICAgICAgbGQxICAgICAgICAgICAgIHt2MS5zfVswXSwg W3gyXSwgeDMgICAgICAgICAgICAgICAvLyBMb2FkIHBpeDINCiAgICAgICAgKyAgICAgICAgdWFi ZGwgICAgICAgICAgIHYzMC44aCwgdjAuOGIsIHYxLjhiDQogICAgICAgICsgICAgICAgIHVtdWxs ICAgICAgICAgICB2MTYuNHMsIHYzMC40aCwgdjMwLjRoDQogICAgICAgICsNCiAgICAgICAgKyAg ICAgICAgdWFkZGx2ICAgICAgICAgIGQxNywgdjE2LjRzDQogICAgICAgICsgICAgICAgIGFkZCAg ICAgICAgICAgICBkMTgsIGQxOCwgZDE3DQogICAgICAgICsNCiAgICAgICAgKyAgICAgICAgc3Vi cyAgICAgICAgICAgIHc0LCB3NCwgIzENCiAgICAgICAgKyAgICAgICAgYi5uZSAgICAgICAgICAg IDJiDQogICAgICAgICsgICAgICAgIGZtb3YgICAgICAgICAgICB3MCwgczE4DQogICAgICAgICsN CiAgICAgICAgKyAgICAgICAgcmV0DQogICAgICAgICsNCiAgICAgICAgK2VuZGZ1bmMNCiAgICAg ICAgLS0NCiAgICAgICAgMi4zNC4xDQoNCg0KDQpfX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fX19fX19fX19fX19fXwpmZm1wZWctZGV2ZWwgbWFpbGluZyBsaXN0CmZmbXBlZy1kZXZl bEBmZm1wZWcub3JnCmh0dHBzOi8vZmZtcGVnLm9yZy9tYWlsbWFuL2xpc3RpbmZvL2ZmbXBlZy1k ZXZlbAoKVG8gdW5zdWJzY3JpYmUsIHZpc2l0IGxpbmsgYWJvdmUsIG9yIGVtYWlsCmZmbXBlZy1k ZXZlbC1yZXF1ZXN0QGZmbXBlZy5vcmcgd2l0aCBzdWJqZWN0ICJ1bnN1YnNjcmliZSIuCg==