From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTP id 74C2D40086 for ; Sat, 17 Sep 2022 12:46:09 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 5130668BC10; Sat, 17 Sep 2022 15:45:47 +0300 (EEST) Received: from ursule.remlab.net (vps-a2bccee9.vps.ovh.net [51.75.19.47]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 0F7A668BBCA for ; Sat, 17 Sep 2022 15:45:42 +0300 (EEST) Received: from basile.remlab.net (localhost [IPv6:::1]) by ursule.remlab.net (Postfix) with ESMTP id 29518C00B2 for ; Sat, 17 Sep 2022 15:45:38 +0300 (EEST) From: remi@remlab.net To: ffmpeg-devel@ffmpeg.org Date: Sat, 17 Sep 2022 15:45:37 +0300 Message-Id: <20220917124537.66238-6-remi@remlab.net> X-Mailer: git-send-email 2.37.2 In-Reply-To: <5602047.DvuYhMxLoT@basile.remlab.net> References: <5602047.DvuYhMxLoT@basile.remlab.net> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 6/6] lavc/pixblockdsp: RISC-V scalar optimisations X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: RnJvbTogUsOpbWkgRGVuaXMtQ291cm1vbnQgPHJlbWlAcmVtbGFiLm5ldD4KCkJlbmNobWFya3M6 CmdldF9waXhlbHNfYzogMTgwLjAKZ2V0X3BpeGVsc19ydmk6IDEzNi43Ci0tLQogbGliYXZjb2Rl Yy9waXhibG9ja2RzcC5jICAgICAgICAgICAgfCAgMiArCiBsaWJhdmNvZGVjL3BpeGJsb2NrZHNw LmggICAgICAgICAgICB8ICAyICsKIGxpYmF2Y29kZWMvcmlzY3YvTWFrZWZpbGUgICAgICAgICAg IHwgIDIgKwogbGliYXZjb2RlYy9yaXNjdi9waXhibG9ja2RzcF9pbml0LmMgfCA0MyArKysrKysr KysrKysrKysrKysrKysrCiBsaWJhdmNvZGVjL3Jpc2N2L3BpeGJsb2NrZHNwX3J2aS5TICB8IDU3 ICsrKysrKysrKysrKysrKysrKysrKysrKysrKysrCiA1IGZpbGVzIGNoYW5nZWQsIDEwNiBpbnNl cnRpb25zKCspCiBjcmVhdGUgbW9kZSAxMDA2NDQgbGliYXZjb2RlYy9yaXNjdi9waXhibG9ja2Rz cF9pbml0LmMKIGNyZWF0ZSBtb2RlIDEwMDY0NCBsaWJhdmNvZGVjL3Jpc2N2L3BpeGJsb2NrZHNw X3J2aS5TCgpkaWZmIC0tZ2l0IGEvbGliYXZjb2RlYy9waXhibG9ja2RzcC5jIGIvbGliYXZjb2Rl Yy9waXhibG9ja2RzcC5jCmluZGV4IDE3YzQ4N2RhMWUuLjQyOTQwNzVjZWUgMTAwNjQ0Ci0tLSBh L2xpYmF2Y29kZWMvcGl4YmxvY2tkc3AuYworKysgYi9saWJhdmNvZGVjL3BpeGJsb2NrZHNwLmMK QEAgLTEwOSw2ICsxMDksOCBAQCBhdl9jb2xkIHZvaWQgZmZfcGl4YmxvY2tkc3BfaW5pdChQaXhi bG9ja0RTUENvbnRleHQgKmMsIEFWQ29kZWNDb250ZXh0ICphdmN0eCkKICAgICBmZl9waXhibG9j a2RzcF9pbml0X2FybShjLCBhdmN0eCwgaGlnaF9iaXRfZGVwdGgpOwogI2VsaWYgQVJDSF9QUEMK ICAgICBmZl9waXhibG9ja2RzcF9pbml0X3BwYyhjLCBhdmN0eCwgaGlnaF9iaXRfZGVwdGgpOwor I2VsaWYgQVJDSF9SSVNDVgorICAgIGZmX3BpeGJsb2NrZHNwX2luaXRfcmlzY3YoYywgYXZjdHgs IGhpZ2hfYml0X2RlcHRoKTsKICNlbGlmIEFSQ0hfWDg2CiAgICAgZmZfcGl4YmxvY2tkc3BfaW5p dF94ODYoYywgYXZjdHgsIGhpZ2hfYml0X2RlcHRoKTsKICNlbGlmIEFSQ0hfTUlQUwpkaWZmIC0t Z2l0IGEvbGliYXZjb2RlYy9waXhibG9ja2RzcC5oIGIvbGliYXZjb2RlYy9waXhibG9ja2RzcC5o CmluZGV4IDA3YzJlYzRmNDAuLjliMDAyYWEzZDYgMTAwNjQ0Ci0tLSBhL2xpYmF2Y29kZWMvcGl4 YmxvY2tkc3AuaAorKysgYi9saWJhdmNvZGVjL3BpeGJsb2NrZHNwLmgKQEAgLTUyLDYgKzUyLDgg QEAgdm9pZCBmZl9waXhibG9ja2RzcF9pbml0X2FybShQaXhibG9ja0RTUENvbnRleHQgKmMsIEFW Q29kZWNDb250ZXh0ICphdmN0eCwKICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgdW5zaWdu ZWQgaGlnaF9iaXRfZGVwdGgpOwogdm9pZCBmZl9waXhibG9ja2RzcF9pbml0X3BwYyhQaXhibG9j a0RTUENvbnRleHQgKmMsIEFWQ29kZWNDb250ZXh0ICphdmN0eCwKICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgdW5zaWduZWQgaGlnaF9iaXRfZGVwdGgpOwordm9pZCBmZl9waXhibG9ja2Rz cF9pbml0X3Jpc2N2KFBpeGJsb2NrRFNQQ29udGV4dCAqYywgQVZDb2RlY0NvbnRleHQgKmF2Y3R4 LAorICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIHVuc2lnbmVkIGhpZ2hfYml0X2RlcHRo KTsKIHZvaWQgZmZfcGl4YmxvY2tkc3BfaW5pdF94ODYoUGl4YmxvY2tEU1BDb250ZXh0ICpjLCBB VkNvZGVjQ29udGV4dCAqYXZjdHgsCiAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIHVuc2ln bmVkIGhpZ2hfYml0X2RlcHRoKTsKIHZvaWQgZmZfcGl4YmxvY2tkc3BfaW5pdF9taXBzKFBpeGJs b2NrRFNQQ29udGV4dCAqYywgQVZDb2RlY0NvbnRleHQgKmF2Y3R4LApkaWZmIC0tZ2l0IGEvbGli YXZjb2RlYy9yaXNjdi9NYWtlZmlsZSBiL2xpYmF2Y29kZWMvcmlzY3YvTWFrZWZpbGUKaW5kZXgg NDE0YTllOWJkOC4uZGEwN2YxZmU5NiAxMDA2NDQKLS0tIGEvbGliYXZjb2RlYy9yaXNjdi9NYWtl ZmlsZQorKysgYi9saWJhdmNvZGVjL3Jpc2N2L01ha2VmaWxlCkBAIC0xLDIgKzEsNCBAQAogT0JK Uy0kKENPTkZJR19BVURJT0RTUCkgKz0gcmlzY3YvYXVkaW9kc3BfaW5pdC5vIFwKICAgICAgICAg ICAgICAgICAgICAgICAgICAgIHJpc2N2L2F1ZGlvZHNwX3J2Zi5vCitPQkpTLSQoQ09ORklHX1BJ WEJMT0NLRFNQKSArPSByaXNjdi9waXhibG9ja2RzcF9pbml0Lm8gXAorICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgcmlzY3YvcGl4YmxvY2tkc3BfcnZpLm8KZGlmZiAtLWdpdCBhL2xpYmF2 Y29kZWMvcmlzY3YvcGl4YmxvY2tkc3BfaW5pdC5jIGIvbGliYXZjb2RlYy9yaXNjdi9waXhibG9j a2RzcF9pbml0LmMKbmV3IGZpbGUgbW9kZSAxMDA2NDQKaW5kZXggMDAwMDAwMDAwMC4uZjQ4OWVj NTI4YgotLS0gL2Rldi9udWxsCisrKyBiL2xpYmF2Y29kZWMvcmlzY3YvcGl4YmxvY2tkc3BfaW5p dC5jCkBAIC0wLDAgKzEsNDMgQEAKKy8qCisgKiBUaGlzIGZpbGUgaXMgcGFydCBvZiBGRm1wZWcu CisgKgorICogRkZtcGVnIGlzIGZyZWUgc29mdHdhcmU7IHlvdSBjYW4gcmVkaXN0cmlidXRlIGl0 IGFuZC9vcgorICogbW9kaWZ5IGl0IHVuZGVyIHRoZSB0ZXJtcyBvZiB0aGUgR05VIExlc3NlciBH ZW5lcmFsIFB1YmxpYworICogTGljZW5zZSBhcyBwdWJsaXNoZWQgYnkgdGhlIEZyZWUgU29mdHdh cmUgRm91bmRhdGlvbjsgZWl0aGVyCisgKiB2ZXJzaW9uIDIuMSBvZiB0aGUgTGljZW5zZSwgb3Ig KGF0IHlvdXIgb3B0aW9uKSBhbnkgbGF0ZXIgdmVyc2lvbi4KKyAqCisgKiBGRm1wZWcgaXMgZGlz dHJpYnV0ZWQgaW4gdGhlIGhvcGUgdGhhdCBpdCB3aWxsIGJlIHVzZWZ1bCwKKyAqIGJ1dCBXSVRI T1VUIEFOWSBXQVJSQU5UWTsgd2l0aG91dCBldmVuIHRoZSBpbXBsaWVkIHdhcnJhbnR5IG9mCisg KiBNRVJDSEFOVEFCSUxJVFkgb3IgRklUTkVTUyBGT1IgQSBQQVJUSUNVTEFSIFBVUlBPU0UuICBT ZWUgdGhlIEdOVQorICogTGVzc2VyIEdlbmVyYWwgUHVibGljIExpY2Vuc2UgZm9yIG1vcmUgZGV0 YWlscy4KKyAqCisgKiBZb3Ugc2hvdWxkIGhhdmUgcmVjZWl2ZWQgYSBjb3B5IG9mIHRoZSBHTlUg TGVzc2VyIEdlbmVyYWwgUHVibGljCisgKiBMaWNlbnNlIGFsb25nIHdpdGggRkZtcGVnOyBpZiBu b3QsIHdyaXRlIHRvIHRoZSBGcmVlIFNvZnR3YXJlCisgKiBGb3VuZGF0aW9uLCBJbmMuLCA1MSBG cmFua2xpbiBTdHJlZXQsIEZpZnRoIEZsb29yLCBCb3N0b24sIE1BIDAyMTEwLTEzMDEgVVNBCisg Ki8KKworI2luY2x1ZGUgPHN0ZGludC5oPgorCisjaW5jbHVkZSAibGliYXZ1dGlsL2F0dHJpYnV0 ZXMuaCIKKyNpbmNsdWRlICJsaWJhdnV0aWwvY3B1LmgiCisjaW5jbHVkZSAibGliYXZjb2RlYy9h dmNvZGVjLmgiCisjaW5jbHVkZSAibGliYXZjb2RlYy9waXhibG9ja2RzcC5oIgorCit2b2lkIGZm X2dldF9waXhlbHNfOF9ydmkoaW50MTZfdCAqYmxvY2ssIGNvbnN0IHVpbnQ4X3QgKnBpeGVscywK KyAgICAgICAgICAgICAgICAgICAgICAgICBwdHJkaWZmX3Qgc3RyaWRlKTsKK3ZvaWQgZmZfZ2V0 X3BpeGVsc18xNl9ydmkoaW50MTZfdCAqYmxvY2ssIGNvbnN0IHVpbnQ4X3QgKnBpeGVscywKKyAg ICAgICAgICAgICAgICAgICAgICAgICAgcHRyZGlmZl90IHN0cmlkZSk7CisKK2F2X2NvbGQgdm9p ZCBmZl9waXhibG9ja2RzcF9pbml0X3Jpc2N2KFBpeGJsb2NrRFNQQ29udGV4dCAqYywKKyAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIEFWQ29kZWNDb250ZXh0ICphdmN0eCwK KyAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIHVuc2lnbmVkIGhpZ2hfYml0 X2RlcHRoKQoreworICAgIGludCBjcHVfZmxhZ3MgPSBhdl9nZXRfY3B1X2ZsYWdzKCk7CisKKyAg ICBpZiAoY3B1X2ZsYWdzICYgQVZfQ1BVX0ZMQUdfUlZJKSB7CisgICAgICAgIGlmIChoaWdoX2Jp dF9kZXB0aCkKKyAgICAgICAgICAgIGMtPmdldF9waXhlbHMgPSBmZl9nZXRfcGl4ZWxzXzE2X3J2 aTsKKyAgICAgICAgZWxzZQorICAgICAgICAgICAgYy0+Z2V0X3BpeGVscyA9IGZmX2dldF9waXhl bHNfOF9ydmk7CisgICAgfQorfQpkaWZmIC0tZ2l0IGEvbGliYXZjb2RlYy9yaXNjdi9waXhibG9j a2RzcF9ydmkuUyBiL2xpYmF2Y29kZWMvcmlzY3YvcGl4YmxvY2tkc3BfcnZpLlMKbmV3IGZpbGUg bW9kZSAxMDA2NDQKaW5kZXggMDAwMDAwMDAwMC4uZGJmNTFiMGFkOQotLS0gL2Rldi9udWxsCisr KyBiL2xpYmF2Y29kZWMvcmlzY3YvcGl4YmxvY2tkc3BfcnZpLlMKQEAgLTAsMCArMSw1NyBAQAor LyoKKyAqIFRoaXMgZmlsZSBpcyBwYXJ0IG9mIEZGbXBlZy4KKyAqCisgKiBGRm1wZWcgaXMgZnJl ZSBzb2Z0d2FyZTsgeW91IGNhbiByZWRpc3RyaWJ1dGUgaXQgYW5kL29yCisgKiBtb2RpZnkgaXQg dW5kZXIgdGhlIHRlcm1zIG9mIHRoZSBHTlUgTGVzc2VyIEdlbmVyYWwgUHVibGljCisgKiBMaWNl bnNlIGFzIHB1Ymxpc2hlZCBieSB0aGUgRnJlZSBTb2Z0d2FyZSBGb3VuZGF0aW9uOyBlaXRoZXIK KyAqIHZlcnNpb24gMi4xIG9mIHRoZSBMaWNlbnNlLCBvciAoYXQgeW91ciBvcHRpb24pIGFueSBs YXRlciB2ZXJzaW9uLgorICoKKyAqIEZGbXBlZyBpcyBkaXN0cmlidXRlZCBpbiB0aGUgaG9wZSB0 aGF0IGl0IHdpbGwgYmUgdXNlZnVsLAorICogYnV0IFdJVEhPVVQgQU5ZIFdBUlJBTlRZOyB3aXRo b3V0IGV2ZW4gdGhlIGltcGxpZWQgd2FycmFudHkgb2YKKyAqIE1FUkNIQU5UQUJJTElUWSBvciBG SVRORVNTIEZPUiBBIFBBUlRJQ1VMQVIgUFVSUE9TRS4gIFNlZSB0aGUgR05VCisgKiBMZXNzZXIg R2VuZXJhbCBQdWJsaWMgTGljZW5zZSBmb3IgbW9yZSBkZXRhaWxzLgorICoKKyAqIFlvdSBzaG91 bGQgaGF2ZSByZWNlaXZlZCBhIGNvcHkgb2YgdGhlIEdOVSBMZXNzZXIgR2VuZXJhbCBQdWJsaWMK KyAqIExpY2Vuc2UgYWxvbmcgd2l0aCBGRm1wZWc7IGlmIG5vdCwgd3JpdGUgdG8gdGhlIEZyZWUg U29mdHdhcmUKKyAqIEZvdW5kYXRpb24sIEluYy4sIDUxIEZyYW5rbGluIFN0cmVldCwgRmlmdGgg Rmxvb3IsIEJvc3RvbiwgTUEgMDIxMTAtMTMwMSBVU0EKKyAqLworCisjaW5jbHVkZSAiY29uZmln LmgiCisjaW5jbHVkZSAiLi4vbGliYXZ1dGlsL3Jpc2N2L2FzbS5TIgorCitmdW5jIGZmX2dldF9w aXhlbHNfOF9ydmkKKy5pcnAgcm93LCAwLCAxLCAyLCAzLCA0LCA1LCA2LCA3CisgICAgICAgIGxk ICAgICAgdDAsICAgIChhMSkKKyAgICAgICAgYWRkICAgICBhMSwgYTEsIGEyCisgICAgICAgIHNk ICAgICAgemVybywgKChccm93ICogMTYpICsgMCkoYTApCisgICAgICAgIGFkZGkgICAgdDYsIHQ2 LCAtMQorICAgICAgICBzZCAgICAgIHplcm8sICgoXHJvdyAqIDE2KSArIDgpKGEwKQorICAgICAg ICBzcmxpICAgIHQxLCB0MCwgOAorICAgICAgICBzYiAgICAgIHQwLCAoKFxyb3cgKiAxNikgKyAw KShhMCkKKyAgICAgICAgc3JsaSAgICB0MiwgdDAsIDE2CisgICAgICAgIHNiICAgICAgdDEsICgo XHJvdyAqIDE2KSArIDIpKGEwKQorICAgICAgICBzcmxpICAgIHQzLCB0MCwgMjQKKyAgICAgICAg c2IgICAgICB0MiwgKChccm93ICogMTYpICsgNCkoYTApCisgICAgICAgIHNybGkgICAgdDQsIHQw LCAzMgorICAgICAgICBzYiAgICAgIHQzLCAoKFxyb3cgKiAxNikgKyA2KShhMCkKKyAgICAgICAg c3JsaSAgICB0MSwgdDAsIDQwCisgICAgICAgIHNiICAgICAgdDQsICgoXHJvdyAqIDE2KSArIDgp KGEwKQorICAgICAgICBzcmxpICAgIHQyLCB0MCwgNDgKKyAgICAgICAgc2IgICAgICB0MSwgKChc cm93ICogMTYpICsgMTApKGEwKQorICAgICAgICBzcmxpICAgIHQzLCB0MCwgNTYKKyAgICAgICAg c2IgICAgICB0MiwgKChccm93ICogMTYpICsgMTIpKGEwKQorICAgICAgICBzYiAgICAgIHQzLCAo KFxyb3cgKiAxNikgKyAxNCkoYTApCisuZW5kcgorICAgICAgICByZXQKK2VuZGZ1bmMKKworZnVu YyBmZl9nZXRfcGl4ZWxzXzE2X3J2aQorLmlycCByb3csIDAsIDEsIDIsIDMsIDQsIDUsIDYsIDcK KyAgICAgICAgbGQgICAgICB0MCwgMChhMSkKKyAgICAgICAgbGQgICAgICB0MSwgOChhMSkKKyAg ICAgICAgYWRkICAgICBhMSwgYTEsIGEyCisgICAgICAgIHNkICAgICAgdDAsICgoXHJvdyAqIDE2 KSArIDApKGEwKQorICAgICAgICBzZCAgICAgIHQxLCAoKFxyb3cgKiAxNikgKyA4KShhMCkKKy5l bmRyCisgICAgICAgIHJldAorZW5kZnVuYwotLSAKMi4zNy4yCgpfX19fX19fX19fX19fX19fX19f X19fX19fX19fX19fX19fX19fX19fX19fX19fXwpmZm1wZWctZGV2ZWwgbWFpbGluZyBsaXN0CmZm bXBlZy1kZXZlbEBmZm1wZWcub3JnCmh0dHBzOi8vZmZtcGVnLm9yZy9tYWlsbWFuL2xpc3RpbmZv L2ZmbXBlZy1kZXZlbAoKVG8gdW5zdWJzY3JpYmUsIHZpc2l0IGxpbmsgYWJvdmUsIG9yIGVtYWls CmZmbXBlZy1kZXZlbC1yZXF1ZXN0QGZmbXBlZy5vcmcgd2l0aCBzdWJqZWN0ICJ1bnN1YnNjcmli ZSIuCg==