From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTP id 5208C4284B for ; Sun, 4 Sep 2022 13:56:23 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 341C868BB10; Sun, 4 Sep 2022 16:55:14 +0300 (EEST) Received: from ursule.remlab.net (vps-a2bccee9.vps.ovh.net [51.75.19.47]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 47CCA68BAD2 for ; Sun, 4 Sep 2022 16:55:04 +0300 (EEST) Received: from basile.remlab.net (localhost [IPv6:::1]) by ursule.remlab.net (Postfix) with ESMTP id B8E17C00AC for ; Sun, 4 Sep 2022 16:55:03 +0300 (EEST) From: remi@remlab.net To: ffmpeg-devel@ffmpeg.org Date: Sun, 4 Sep 2022 16:54:56 +0300 Message-Id: <20220904135503.116704-3-remi@remlab.net> X-Mailer: git-send-email 2.37.2 In-Reply-To: <3372981.QJadu78ljV@basile.remlab.net> References: <3372981.QJadu78ljV@basile.remlab.net> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 03/10] riscv: float vector-scalar multiplication with RVV X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: RnJvbTogUsOpbWkgRGVuaXMtQ291cm1vbnQgPHJlbWlAcmVtbGFiLm5ldD4KClRoaXMgaXMgYmFz ZWQgb24gZXhpc3RpbmcgY29kZSBmcm9tIHRoZSBWTEMgZ2l0IHRyZWUgd2l0aCB0d28gbWlub3IK Y2hhbmdlcyB0byBhY2NvdW50IGZvciB0aGUgZGlmZmVyZW50IGZ1bmN0aW9uIHByb3RvdHlwZXMu Ci0tLQogbGliYXZ1dGlsL2Zsb2F0X2RzcC5jICAgICAgICAgICAgfCAgMiArKwogbGliYXZ1dGls L2Zsb2F0X2RzcC5oICAgICAgICAgICAgfCAgMSArCiBsaWJhdnV0aWwvcmlzY3YvTWFrZWZpbGUg ICAgICAgICB8ICA0ICsrLQogbGliYXZ1dGlsL3Jpc2N2L2Zsb2F0X2RzcF9pbml0LmMgfCA0MSAr KysrKysrKysrKysrKysrKysrKysKIGxpYmF2dXRpbC9yaXNjdi9mbG9hdF9kc3BfcnZ2LlMgIHwg NjIgKysrKysrKysrKysrKysrKysrKysrKysrKysrKysrKysKIDUgZmlsZXMgY2hhbmdlZCwgMTA5 IGluc2VydGlvbnMoKyksIDEgZGVsZXRpb24oLSkKIGNyZWF0ZSBtb2RlIDEwMDY0NCBsaWJhdnV0 aWwvcmlzY3YvZmxvYXRfZHNwX2luaXQuYwogY3JlYXRlIG1vZGUgMTAwNjQ0IGxpYmF2dXRpbC9y aXNjdi9mbG9hdF9kc3BfcnZ2LlMKCmRpZmYgLS1naXQgYS9saWJhdnV0aWwvZmxvYXRfZHNwLmMg Yi9saWJhdnV0aWwvZmxvYXRfZHNwLmMKaW5kZXggODY3NmM4YjBmOC4uNzQyZGQ2NzlkMiAxMDA2 NDQKLS0tIGEvbGliYXZ1dGlsL2Zsb2F0X2RzcC5jCisrKyBiL2xpYmF2dXRpbC9mbG9hdF9kc3Au YwpAQCAtMTU2LDYgKzE1Niw4IEBAIGF2X2NvbGQgQVZGbG9hdERTUENvbnRleHQgKmF2cHJpdl9m bG9hdF9kc3BfYWxsb2MoaW50IGJpdF9leGFjdCkKICAgICBmZl9mbG9hdF9kc3BfaW5pdF9hcm0o ZmRzcCk7CiAjZWxpZiBBUkNIX1BQQwogICAgIGZmX2Zsb2F0X2RzcF9pbml0X3BwYyhmZHNwLCBi aXRfZXhhY3QpOworI2VsaWYgQVJDSF9SSVNDVgorICAgIGZmX2Zsb2F0X2RzcF9pbml0X3Jpc2N2 KGZkc3ApOwogI2VsaWYgQVJDSF9YODYKICAgICBmZl9mbG9hdF9kc3BfaW5pdF94ODYoZmRzcCk7 CiAjZWxpZiBBUkNIX01JUFMKZGlmZiAtLWdpdCBhL2xpYmF2dXRpbC9mbG9hdF9kc3AuaCBiL2xp YmF2dXRpbC9mbG9hdF9kc3AuaAppbmRleCA5YzY2NDU5MmJkLi43Y2FkOWZjNjIyIDEwMDY0NAot LS0gYS9saWJhdnV0aWwvZmxvYXRfZHNwLmgKKysrIGIvbGliYXZ1dGlsL2Zsb2F0X2RzcC5oCkBA IC0yMDUsNiArMjA1LDcgQEAgZmxvYXQgYXZwcml2X3NjYWxhcnByb2R1Y3RfZmxvYXRfYyhjb25z dCBmbG9hdCAqdjEsIGNvbnN0IGZsb2F0ICp2MiwgaW50IGxlbik7CiB2b2lkIGZmX2Zsb2F0X2Rz cF9pbml0X2FhcmNoNjQoQVZGbG9hdERTUENvbnRleHQgKmZkc3ApOwogdm9pZCBmZl9mbG9hdF9k c3BfaW5pdF9hcm0oQVZGbG9hdERTUENvbnRleHQgKmZkc3ApOwogdm9pZCBmZl9mbG9hdF9kc3Bf aW5pdF9wcGMoQVZGbG9hdERTUENvbnRleHQgKmZkc3AsIGludCBzdHJpY3QpOwordm9pZCBmZl9m bG9hdF9kc3BfaW5pdF9yaXNjdihBVkZsb2F0RFNQQ29udGV4dCAqZmRzcCk7CiB2b2lkIGZmX2Zs b2F0X2RzcF9pbml0X3g4NihBVkZsb2F0RFNQQ29udGV4dCAqZmRzcCk7CiB2b2lkIGZmX2Zsb2F0 X2RzcF9pbml0X21pcHMoQVZGbG9hdERTUENvbnRleHQgKmZkc3ApOwogCmRpZmYgLS1naXQgYS9s aWJhdnV0aWwvcmlzY3YvTWFrZWZpbGUgYi9saWJhdnV0aWwvcmlzY3YvTWFrZWZpbGUKaW5kZXgg MWY4MTgwNDNkYy4uNmJmODI0M2U4ZCAxMDA2NDQKLS0tIGEvbGliYXZ1dGlsL3Jpc2N2L01ha2Vm aWxlCisrKyBiL2xpYmF2dXRpbC9yaXNjdi9NYWtlZmlsZQpAQCAtMSArMSwzIEBACi1PQkpTICs9 IHJpc2N2L2NwdS5vCitPQkpTICs9IHJpc2N2L2NwdS5vIFwKKyAgICAgICAgcmlzY3YvZmxvYXRf ZHNwX2luaXQubyBcCisgICAgICAgIHJpc2N2L2Zsb2F0X2RzcF9ydnYubwpkaWZmIC0tZ2l0IGEv bGliYXZ1dGlsL3Jpc2N2L2Zsb2F0X2RzcF9pbml0LmMgYi9saWJhdnV0aWwvcmlzY3YvZmxvYXRf ZHNwX2luaXQuYwpuZXcgZmlsZSBtb2RlIDEwMDY0NAppbmRleCAwMDAwMDAwMDAwLi4yNzk0MTJj MDM2Ci0tLSAvZGV2L251bGwKKysrIGIvbGliYXZ1dGlsL3Jpc2N2L2Zsb2F0X2RzcF9pbml0LmMK QEAgLTAsMCArMSw0MSBAQAorLyoKKyAqIFRoaXMgZmlsZSBpcyBwYXJ0IG9mIEZGbXBlZy4KKyAq CisgKiBGRm1wZWcgaXMgZnJlZSBzb2Z0d2FyZTsgeW91IGNhbiByZWRpc3RyaWJ1dGUgaXQgYW5k L29yCisgKiBtb2RpZnkgaXQgdW5kZXIgdGhlIHRlcm1zIG9mIHRoZSBHTlUgTGVzc2VyIEdlbmVy YWwgUHVibGljCisgKiBMaWNlbnNlIGFzIHB1Ymxpc2hlZCBieSB0aGUgRnJlZSBTb2Z0d2FyZSBG b3VuZGF0aW9uOyBlaXRoZXIKKyAqIHZlcnNpb24gMi4xIG9mIHRoZSBMaWNlbnNlLCBvciAoYXQg eW91ciBvcHRpb24pIGFueSBsYXRlciB2ZXJzaW9uLgorICoKKyAqIEZGbXBlZyBpcyBkaXN0cmli dXRlZCBpbiB0aGUgaG9wZSB0aGF0IGl0IHdpbGwgYmUgdXNlZnVsLAorICogYnV0IFdJVEhPVVQg QU5ZIFdBUlJBTlRZOyB3aXRob3V0IGV2ZW4gdGhlIGltcGxpZWQgd2FycmFudHkgb2YKKyAqIE1F UkNIQU5UQUJJTElUWSBvciBGSVRORVNTIEZPUiBBIFBBUlRJQ1VMQVIgUFVSUE9TRS4gIFNlZSB0 aGUgR05VCisgKiBMZXNzZXIgR2VuZXJhbCBQdWJsaWMgTGljZW5zZSBmb3IgbW9yZSBkZXRhaWxz LgorICoKKyAqIFlvdSBzaG91bGQgaGF2ZSByZWNlaXZlZCBhIGNvcHkgb2YgdGhlIEdOVSBMZXNz ZXIgR2VuZXJhbCBQdWJsaWMKKyAqIExpY2Vuc2UgYWxvbmcgd2l0aCBGRm1wZWc7IGlmIG5vdCwg d3JpdGUgdG8gdGhlIEZyZWUgU29mdHdhcmUKKyAqIEZvdW5kYXRpb24sIEluYy4sIDUxIEZyYW5r bGluIFN0cmVldCwgRmlmdGggRmxvb3IsIEJvc3RvbiwgTUEgMDIxMTAtMTMwMSBVU0EKKyAqLwor CisjaW5jbHVkZSA8c3RkaW50Lmg+CisKKyNpbmNsdWRlICJsaWJhdnV0aWwvYXR0cmlidXRlcy5o IgorI2luY2x1ZGUgImxpYmF2dXRpbC9jcHUuaCIKKyNpbmNsdWRlICJsaWJhdnV0aWwvZmxvYXRf ZHNwLmgiCisKK3ZvaWQgZmZfdmVjdG9yX2ZtdWxfc2NhbGFyX3J2dihmbG9hdCAqZHN0LCBjb25z dCBmbG9hdCAqc3JjLCBmbG9hdCBtdWwsCisgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg IGludCBsZW4pOworCit2b2lkIGZmX3ZlY3Rvcl9kbXVsX3NjYWxhcl9ydnYoZG91YmxlICpkc3Qs IGNvbnN0IGRvdWJsZSAqc3JjLCBkb3VibGUgbXVsLAorICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICBpbnQgbGVuKTsKKworYXZfY29sZCB2b2lkIGZmX2Zsb2F0X2RzcF9pbml0X3Jpc2N2 KEFWRmxvYXREU1BDb250ZXh0ICpmZHNwKQoreworICAgIGludCBmbGFncyA9IGF2X2dldF9jcHVf ZmxhZ3MoKTsKKworICAgIGlmIChmbGFncyAmIEFWX0NQVV9GTEFHX1pWRTMyRikgeworICAgICAg ICBmZHNwLT52ZWN0b3JfZm11bF9zY2FsYXIgPSBmZl92ZWN0b3JfZm11bF9zY2FsYXJfcnZ2Owor CisgICAgICAgIGlmIChmbGFncyAmIEFWX0NQVV9GTEFHX1pWRTY0RCkKKyAgICAgICAgICAgIGZk c3AtPnZlY3Rvcl9kbXVsX3NjYWxhciA9IGZmX3ZlY3Rvcl9kbXVsX3NjYWxhcl9ydnY7CisgICAg fQorfQpkaWZmIC0tZ2l0IGEvbGliYXZ1dGlsL3Jpc2N2L2Zsb2F0X2RzcF9ydnYuUyBiL2xpYmF2 dXRpbC9yaXNjdi9mbG9hdF9kc3BfcnZ2LlMKbmV3IGZpbGUgbW9kZSAxMDA2NDQKaW5kZXggMDAw MDAwMDAwMC4uOThkMDZjNmQwNwotLS0gL2Rldi9udWxsCisrKyBiL2xpYmF2dXRpbC9yaXNjdi9m bG9hdF9kc3BfcnZ2LlMKQEAgLTAsMCArMSw2MiBAQAorLyoKKyAqIFRoaXMgZmlsZSBpcyBwYXJ0 IG9mIEZGbXBlZy4KKyAqCisgKiBGRm1wZWcgaXMgZnJlZSBzb2Z0d2FyZTsgeW91IGNhbiByZWRp c3RyaWJ1dGUgaXQgYW5kL29yCisgKiBtb2RpZnkgaXQgdW5kZXIgdGhlIHRlcm1zIG9mIHRoZSBH TlUgTGVzc2VyIEdlbmVyYWwgUHVibGljCisgKiBMaWNlbnNlIGFzIHB1Ymxpc2hlZCBieSB0aGUg RnJlZSBTb2Z0d2FyZSBGb3VuZGF0aW9uOyBlaXRoZXIKKyAqIHZlcnNpb24gMi4xIG9mIHRoZSBM aWNlbnNlLCBvciAoYXQgeW91ciBvcHRpb24pIGFueSBsYXRlciB2ZXJzaW9uLgorICoKKyAqIEZG bXBlZyBpcyBkaXN0cmlidXRlZCBpbiB0aGUgaG9wZSB0aGF0IGl0IHdpbGwgYmUgdXNlZnVsLAor ICogYnV0IFdJVEhPVVQgQU5ZIFdBUlJBTlRZOyB3aXRob3V0IGV2ZW4gdGhlIGltcGxpZWQgd2Fy cmFudHkgb2YKKyAqIE1FUkNIQU5UQUJJTElUWSBvciBGSVRORVNTIEZPUiBBIFBBUlRJQ1VMQVIg UFVSUE9TRS4gIFNlZSB0aGUgR05VCisgKiBMZXNzZXIgR2VuZXJhbCBQdWJsaWMgTGljZW5zZSBm b3IgbW9yZSBkZXRhaWxzLgorICoKKyAqIFlvdSBzaG91bGQgaGF2ZSByZWNlaXZlZCBhIGNvcHkg b2YgdGhlIEdOVSBMZXNzZXIgR2VuZXJhbCBQdWJsaWMKKyAqIExpY2Vuc2UgYWxvbmcgd2l0aCBG Rm1wZWc7IGlmIG5vdCwgd3JpdGUgdG8gdGhlIEZyZWUgU29mdHdhcmUKKyAqIEZvdW5kYXRpb24s IEluYy4sIDUxIEZyYW5rbGluIFN0cmVldCwgRmlmdGggRmxvb3IsIEJvc3RvbiwgTUEgMDIxMTAt MTMwMSBVU0EKKyAqLworCisjaW5jbHVkZSAiY29uZmlnLmgiCisjaW5jbHVkZSAiYXNtLlMiCisK KyAgICAgICAgLm9wdGlvbiAgYXJjaCwgK3YKKworLy8gKGEwKSA9IChhMSkgKiBmYTAgWzAuLmEy LTFdCitmdW5jIGZmX3ZlY3Rvcl9mbXVsX3NjYWxhcl9ydnYKKyNpZiBkZWZpbmVkIChfX3Jpc2N2 X2Zsb2F0X2FiaV9zb2Z0KQorICAgICAgICBmbXYudy54ICBmYTAsIGEyCisgICAgICAgIG12ICAg ICAgIGEyLCBhMworI2VuZGlmCisKKzE6ICAgICAgdnNldHZsaSAgdDAsIGEyLCBlMzIsIG04LCB0 YSwgbWEKKyAgICAgICAgc2xsaSAgICAgdDEsIHQwLCAyCisgICAgICAgIHZsZTMyLnYgIHYxNiwg KGExKQorICAgICAgICBhZGQgICAgICBhMSwgYTEsIHQxCisgICAgICAgIHZmbXVsLnZmIHYxNiwg djE2LCBmYTAKKyAgICAgICAgc3ViICAgICAgYTIsIGEyLCB0MAorICAgICAgICB2c2UzMi52ICB2 MTYsIChhMCkKKyAgICAgICAgYWRkICAgICAgYTAsIGEwLCB0MQorICAgICAgICBibmV6ICAgICBh MiwgMWIKKworICAgICAgICByZXQKK2VuZGZ1bmMKKworLy8gKGEwKSA9IChhMSkgKiBmYTAgWzAu LmEyLTFdCitmdW5jIGZmX3ZlY3Rvcl9kbXVsX3NjYWxhcl9ydnYKKyNpZiBkZWZpbmVkIChfX3Jp c2N2X2Zsb2F0X2FiaV9zb2Z0KSB8fCBkZWZpbmVkIChfX3Jpc2N2X2Zsb2F0X2FiaV9zaW5nbGUp CisgICAgICAgIGZtdi5kLnggIGZhMCwgYTIKKyAgICAgICAgbXYgICAgICAgYTIsIGEzCisjZW5k aWYKKworMTogICAgICB2c2V0dmxpICB0MCwgYTIsIGU2NCwgbTgsIHRhLCBtYQorICAgICAgICBz bGxpICAgICB0MSwgdDAsIDMKKyAgICAgICAgdmxlNjQudiAgdjE2LCAoYTEpCisgICAgICAgIGFk ZCAgICAgIGExLCBhMSwgdDEKKyAgICAgICAgdmZtdWwudmYgdjE2LCB2MTYsIGZhMAorICAgICAg ICBzdWIgICAgICBhMiwgYTIsIHQwCisgICAgICAgIHZzZTY0LnYgIHYxNiwgKGEwKQorICAgICAg ICBhZGQgICAgICBhMCwgYTAsIHQxCisgICAgICAgIGJuZXogICAgIGEyLCAxYgorCisgICAgICAg IHJldAorZW5kZnVuYwotLSAKMi4zNy4yCgpfX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fX19fX19fX19fXwpmZm1wZWctZGV2ZWwgbWFpbGluZyBsaXN0CmZmbXBlZy1kZXZlbEBm Zm1wZWcub3JnCmh0dHBzOi8vZmZtcGVnLm9yZy9tYWlsbWFuL2xpc3RpbmZvL2ZmbXBlZy1kZXZl bAoKVG8gdW5zdWJzY3JpYmUsIHZpc2l0IGxpbmsgYWJvdmUsIG9yIGVtYWlsCmZmbXBlZy1kZXZl bC1yZXF1ZXN0QGZmbXBlZy5vcmcgd2l0aCBzdWJqZWN0ICJ1bnN1YnNjcmliZSIuCg==