From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTP id 32FF744388 for ; Tue, 6 Sep 2022 18:45:25 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 9561F68BB5D; Tue, 6 Sep 2022 21:44:17 +0300 (EEST) Received: from ursule.remlab.net (vps-a2bccee9.vps.ovh.net [51.75.19.47]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 2250A68BB0F for ; Tue, 6 Sep 2022 21:44:03 +0300 (EEST) Received: from basile.remlab.net (localhost [IPv6:::1]) by ursule.remlab.net (Postfix) with ESMTP id CB851C00B0 for ; Tue, 6 Sep 2022 21:44:02 +0300 (EEST) From: remi@remlab.net To: ffmpeg-devel@ffmpeg.org Date: Tue, 6 Sep 2022 21:43:54 +0300 Message-Id: <20220906184402.119826-4-remi@remlab.net> X-Mailer: git-send-email 2.37.2 In-Reply-To: <5753736.MhkbZ0Pkbq@basile.remlab.net> References: <5753736.MhkbZ0Pkbq@basile.remlab.net> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH 04/12] lavu/riscv: float vector-scalar multiplication with RVV X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: RnJvbTogUsOpbWkgRGVuaXMtQ291cm1vbnQgPHJlbWlAcmVtbGFiLm5ldD4KClRoaXMgaXMgYmFz ZWQgb24gZXhpc3RpbmcgY29kZSBmcm9tIHRoZSBWTEMgZ2l0IHRyZWUgd2l0aCB0d28gbWlub3IK Y2hhbmdlcyB0byBhY2NvdW50IGZvciB0aGUgZGlmZmVyZW50IGZ1bmN0aW9uIHByb3RvdHlwZXMu Ci0tLQogbGliYXZ1dGlsL2Zsb2F0X2RzcC5jICAgICAgICAgICAgfCAgMiArKwogbGliYXZ1dGls L2Zsb2F0X2RzcC5oICAgICAgICAgICAgfCAgMSArCiBsaWJhdnV0aWwvcmlzY3YvTWFrZWZpbGUg ICAgICAgICB8ICA0ICsrLQogbGliYXZ1dGlsL3Jpc2N2L2Zsb2F0X2RzcF9pbml0LmMgfCA0MSAr KysrKysrKysrKysrKysrKysrKysrKwogbGliYXZ1dGlsL3Jpc2N2L2Zsb2F0X2RzcF9ydnYuUyAg fCA1NiArKysrKysrKysrKysrKysrKysrKysrKysrKysrKysrKwogNSBmaWxlcyBjaGFuZ2VkLCAx MDMgaW5zZXJ0aW9ucygrKSwgMSBkZWxldGlvbigtKQogY3JlYXRlIG1vZGUgMTAwNjQ0IGxpYmF2 dXRpbC9yaXNjdi9mbG9hdF9kc3BfaW5pdC5jCiBjcmVhdGUgbW9kZSAxMDA2NDQgbGliYXZ1dGls L3Jpc2N2L2Zsb2F0X2RzcF9ydnYuUwoKZGlmZiAtLWdpdCBhL2xpYmF2dXRpbC9mbG9hdF9kc3Au YyBiL2xpYmF2dXRpbC9mbG9hdF9kc3AuYwppbmRleCA4Njc2YzhiMGY4Li43NDJkZDY3OWQyIDEw MDY0NAotLS0gYS9saWJhdnV0aWwvZmxvYXRfZHNwLmMKKysrIGIvbGliYXZ1dGlsL2Zsb2F0X2Rz cC5jCkBAIC0xNTYsNiArMTU2LDggQEAgYXZfY29sZCBBVkZsb2F0RFNQQ29udGV4dCAqYXZwcml2 X2Zsb2F0X2RzcF9hbGxvYyhpbnQgYml0X2V4YWN0KQogICAgIGZmX2Zsb2F0X2RzcF9pbml0X2Fy bShmZHNwKTsKICNlbGlmIEFSQ0hfUFBDCiAgICAgZmZfZmxvYXRfZHNwX2luaXRfcHBjKGZkc3As IGJpdF9leGFjdCk7CisjZWxpZiBBUkNIX1JJU0NWCisgICAgZmZfZmxvYXRfZHNwX2luaXRfcmlz Y3YoZmRzcCk7CiAjZWxpZiBBUkNIX1g4NgogICAgIGZmX2Zsb2F0X2RzcF9pbml0X3g4NihmZHNw KTsKICNlbGlmIEFSQ0hfTUlQUwpkaWZmIC0tZ2l0IGEvbGliYXZ1dGlsL2Zsb2F0X2RzcC5oIGIv bGliYXZ1dGlsL2Zsb2F0X2RzcC5oCmluZGV4IDljNjY0NTkyYmQuLjdjYWQ5ZmM2MjIgMTAwNjQ0 Ci0tLSBhL2xpYmF2dXRpbC9mbG9hdF9kc3AuaAorKysgYi9saWJhdnV0aWwvZmxvYXRfZHNwLmgK QEAgLTIwNSw2ICsyMDUsNyBAQCBmbG9hdCBhdnByaXZfc2NhbGFycHJvZHVjdF9mbG9hdF9jKGNv bnN0IGZsb2F0ICp2MSwgY29uc3QgZmxvYXQgKnYyLCBpbnQgbGVuKTsKIHZvaWQgZmZfZmxvYXRf ZHNwX2luaXRfYWFyY2g2NChBVkZsb2F0RFNQQ29udGV4dCAqZmRzcCk7CiB2b2lkIGZmX2Zsb2F0 X2RzcF9pbml0X2FybShBVkZsb2F0RFNQQ29udGV4dCAqZmRzcCk7CiB2b2lkIGZmX2Zsb2F0X2Rz cF9pbml0X3BwYyhBVkZsb2F0RFNQQ29udGV4dCAqZmRzcCwgaW50IHN0cmljdCk7Cit2b2lkIGZm X2Zsb2F0X2RzcF9pbml0X3Jpc2N2KEFWRmxvYXREU1BDb250ZXh0ICpmZHNwKTsKIHZvaWQgZmZf ZmxvYXRfZHNwX2luaXRfeDg2KEFWRmxvYXREU1BDb250ZXh0ICpmZHNwKTsKIHZvaWQgZmZfZmxv YXRfZHNwX2luaXRfbWlwcyhBVkZsb2F0RFNQQ29udGV4dCAqZmRzcCk7CiAKZGlmZiAtLWdpdCBh L2xpYmF2dXRpbC9yaXNjdi9NYWtlZmlsZSBiL2xpYmF2dXRpbC9yaXNjdi9NYWtlZmlsZQppbmRl eCAxZjgxODA0M2RjLi42YmY4MjQzZThkIDEwMDY0NAotLS0gYS9saWJhdnV0aWwvcmlzY3YvTWFr ZWZpbGUKKysrIGIvbGliYXZ1dGlsL3Jpc2N2L01ha2VmaWxlCkBAIC0xICsxLDMgQEAKLU9CSlMg Kz0gcmlzY3YvY3B1Lm8KK09CSlMgKz0gcmlzY3YvY3B1Lm8gXAorICAgICAgICByaXNjdi9mbG9h dF9kc3BfaW5pdC5vIFwKKyAgICAgICAgcmlzY3YvZmxvYXRfZHNwX3J2di5vCmRpZmYgLS1naXQg YS9saWJhdnV0aWwvcmlzY3YvZmxvYXRfZHNwX2luaXQuYyBiL2xpYmF2dXRpbC9yaXNjdi9mbG9h dF9kc3BfaW5pdC5jCm5ldyBmaWxlIG1vZGUgMTAwNjQ0CmluZGV4IDAwMDAwMDAwMDAuLjI3OTQx MmMwMzYKLS0tIC9kZXYvbnVsbAorKysgYi9saWJhdnV0aWwvcmlzY3YvZmxvYXRfZHNwX2luaXQu YwpAQCAtMCwwICsxLDQxIEBACisvKgorICogVGhpcyBmaWxlIGlzIHBhcnQgb2YgRkZtcGVnLgor ICoKKyAqIEZGbXBlZyBpcyBmcmVlIHNvZnR3YXJlOyB5b3UgY2FuIHJlZGlzdHJpYnV0ZSBpdCBh bmQvb3IKKyAqIG1vZGlmeSBpdCB1bmRlciB0aGUgdGVybXMgb2YgdGhlIEdOVSBMZXNzZXIgR2Vu ZXJhbCBQdWJsaWMKKyAqIExpY2Vuc2UgYXMgcHVibGlzaGVkIGJ5IHRoZSBGcmVlIFNvZnR3YXJl IEZvdW5kYXRpb247IGVpdGhlcgorICogdmVyc2lvbiAyLjEgb2YgdGhlIExpY2Vuc2UsIG9yIChh dCB5b3VyIG9wdGlvbikgYW55IGxhdGVyIHZlcnNpb24uCisgKgorICogRkZtcGVnIGlzIGRpc3Ry aWJ1dGVkIGluIHRoZSBob3BlIHRoYXQgaXQgd2lsbCBiZSB1c2VmdWwsCisgKiBidXQgV0lUSE9V VCBBTlkgV0FSUkFOVFk7IHdpdGhvdXQgZXZlbiB0aGUgaW1wbGllZCB3YXJyYW50eSBvZgorICog TUVSQ0hBTlRBQklMSVRZIG9yIEZJVE5FU1MgRk9SIEEgUEFSVElDVUxBUiBQVVJQT1NFLiAgU2Vl IHRoZSBHTlUKKyAqIExlc3NlciBHZW5lcmFsIFB1YmxpYyBMaWNlbnNlIGZvciBtb3JlIGRldGFp bHMuCisgKgorICogWW91IHNob3VsZCBoYXZlIHJlY2VpdmVkIGEgY29weSBvZiB0aGUgR05VIExl c3NlciBHZW5lcmFsIFB1YmxpYworICogTGljZW5zZSBhbG9uZyB3aXRoIEZGbXBlZzsgaWYgbm90 LCB3cml0ZSB0byB0aGUgRnJlZSBTb2Z0d2FyZQorICogRm91bmRhdGlvbiwgSW5jLiwgNTEgRnJh bmtsaW4gU3RyZWV0LCBGaWZ0aCBGbG9vciwgQm9zdG9uLCBNQSAwMjExMC0xMzAxIFVTQQorICov CisKKyNpbmNsdWRlIDxzdGRpbnQuaD4KKworI2luY2x1ZGUgImxpYmF2dXRpbC9hdHRyaWJ1dGVz LmgiCisjaW5jbHVkZSAibGliYXZ1dGlsL2NwdS5oIgorI2luY2x1ZGUgImxpYmF2dXRpbC9mbG9h dF9kc3AuaCIKKwordm9pZCBmZl92ZWN0b3JfZm11bF9zY2FsYXJfcnZ2KGZsb2F0ICpkc3QsIGNv bnN0IGZsb2F0ICpzcmMsIGZsb2F0IG11bCwKKyAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgaW50IGxlbik7CisKK3ZvaWQgZmZfdmVjdG9yX2RtdWxfc2NhbGFyX3J2dihkb3VibGUgKmRz dCwgY29uc3QgZG91YmxlICpzcmMsIGRvdWJsZSBtdWwsCisgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgIGludCBsZW4pOworCithdl9jb2xkIHZvaWQgZmZfZmxvYXRfZHNwX2luaXRfcmlz Y3YoQVZGbG9hdERTUENvbnRleHQgKmZkc3ApCit7CisgICAgaW50IGZsYWdzID0gYXZfZ2V0X2Nw dV9mbGFncygpOworCisgICAgaWYgKGZsYWdzICYgQVZfQ1BVX0ZMQUdfWlZFMzJGKSB7CisgICAg ICAgIGZkc3AtPnZlY3Rvcl9mbXVsX3NjYWxhciA9IGZmX3ZlY3Rvcl9mbXVsX3NjYWxhcl9ydnY7 CisKKyAgICAgICAgaWYgKGZsYWdzICYgQVZfQ1BVX0ZMQUdfWlZFNjREKQorICAgICAgICAgICAg ZmRzcC0+dmVjdG9yX2RtdWxfc2NhbGFyID0gZmZfdmVjdG9yX2RtdWxfc2NhbGFyX3J2djsKKyAg ICB9Cit9CmRpZmYgLS1naXQgYS9saWJhdnV0aWwvcmlzY3YvZmxvYXRfZHNwX3J2di5TIGIvbGli YXZ1dGlsL3Jpc2N2L2Zsb2F0X2RzcF9ydnYuUwpuZXcgZmlsZSBtb2RlIDEwMDY0NAppbmRleCAw MDAwMDAwMDAwLi4zNjVlMDAxOTBjCi0tLSAvZGV2L251bGwKKysrIGIvbGliYXZ1dGlsL3Jpc2N2 L2Zsb2F0X2RzcF9ydnYuUwpAQCAtMCwwICsxLDU2IEBACisvKgorICogVGhpcyBmaWxlIGlzIHBh cnQgb2YgRkZtcGVnLgorICoKKyAqIEZGbXBlZyBpcyBmcmVlIHNvZnR3YXJlOyB5b3UgY2FuIHJl ZGlzdHJpYnV0ZSBpdCBhbmQvb3IKKyAqIG1vZGlmeSBpdCB1bmRlciB0aGUgdGVybXMgb2YgdGhl IEdOVSBMZXNzZXIgR2VuZXJhbCBQdWJsaWMKKyAqIExpY2Vuc2UgYXMgcHVibGlzaGVkIGJ5IHRo ZSBGcmVlIFNvZnR3YXJlIEZvdW5kYXRpb247IGVpdGhlcgorICogdmVyc2lvbiAyLjEgb2YgdGhl IExpY2Vuc2UsIG9yIChhdCB5b3VyIG9wdGlvbikgYW55IGxhdGVyIHZlcnNpb24uCisgKgorICog RkZtcGVnIGlzIGRpc3RyaWJ1dGVkIGluIHRoZSBob3BlIHRoYXQgaXQgd2lsbCBiZSB1c2VmdWws CisgKiBidXQgV0lUSE9VVCBBTlkgV0FSUkFOVFk7IHdpdGhvdXQgZXZlbiB0aGUgaW1wbGllZCB3 YXJyYW50eSBvZgorICogTUVSQ0hBTlRBQklMSVRZIG9yIEZJVE5FU1MgRk9SIEEgUEFSVElDVUxB UiBQVVJQT1NFLiAgU2VlIHRoZSBHTlUKKyAqIExlc3NlciBHZW5lcmFsIFB1YmxpYyBMaWNlbnNl IGZvciBtb3JlIGRldGFpbHMuCisgKgorICogWW91IHNob3VsZCBoYXZlIHJlY2VpdmVkIGEgY29w eSBvZiB0aGUgR05VIExlc3NlciBHZW5lcmFsIFB1YmxpYworICogTGljZW5zZSBhbG9uZyB3aXRo IEZGbXBlZzsgaWYgbm90LCB3cml0ZSB0byB0aGUgRnJlZSBTb2Z0d2FyZQorICogRm91bmRhdGlv biwgSW5jLiwgNTEgRnJhbmtsaW4gU3RyZWV0LCBGaWZ0aCBGbG9vciwgQm9zdG9uLCBNQSAwMjEx MC0xMzAxIFVTQQorICovCisKKyNpbmNsdWRlICJjb25maWcuaCIKKyNpbmNsdWRlICJhc20uUyIK KworLy8gKGEwKSA9IChhMSkgKiBmYTAgWzAuLmEyLTFdCitmdW5jIGZmX3ZlY3Rvcl9mbXVsX3Nj YWxhcl9ydnYsIHp2ZTMyZgorTk9IV0YgICBmbXYudy54ICBmYTAsIGEyCitOT0hXRiAgIG12ICAg ICAgIGEyLCBhMworCisxOiAgICAgIHZzZXR2bGkgIHQwLCBhMiwgZTMyLCBtOCwgdGEsIG1hCisg ICAgICAgIHNsbGkgICAgIHQxLCB0MCwgMgorICAgICAgICB2bGUzMi52ICB2MTYsIChhMSkKKyAg ICAgICAgYWRkICAgICAgYTEsIGExLCB0MQorICAgICAgICB2Zm11bC52ZiB2MTYsIHYxNiwgZmEw CisgICAgICAgIHN1YiAgICAgIGEyLCBhMiwgdDAKKyAgICAgICAgdnNlMzIudiAgdjE2LCAoYTAp CisgICAgICAgIGFkZCAgICAgIGEwLCBhMCwgdDEKKyAgICAgICAgYm5leiAgICAgYTIsIDFiCisK KyAgICAgICAgcmV0CitlbmRmdW5jCisKKy8vIChhMCkgPSAoYTEpICogZmEwIFswLi5hMi0xXQor ZnVuYyBmZl92ZWN0b3JfZG11bF9zY2FsYXJfcnZ2LCB6dmU2NGQKK05PSFdEICAgZm12LmQueCAg ZmEwLCBhMgorTk9IV0QgICBtdiAgICAgICBhMiwgYTMKKworMTogICAgICB2c2V0dmxpICB0MCwg YTIsIGU2NCwgbTgsIHRhLCBtYQorICAgICAgICBzbGxpICAgICB0MSwgdDAsIDMKKyAgICAgICAg dmxlNjQudiAgdjE2LCAoYTEpCisgICAgICAgIGFkZCAgICAgIGExLCBhMSwgdDEKKyAgICAgICAg dmZtdWwudmYgdjE2LCB2MTYsIGZhMAorICAgICAgICBzdWIgICAgICBhMiwgYTIsIHQwCisgICAg ICAgIHZzZTY0LnYgIHYxNiwgKGEwKQorICAgICAgICBhZGQgICAgICBhMCwgYTAsIHQxCisgICAg ICAgIGJuZXogICAgIGEyLCAxYgorCisgICAgICAgIHJldAorZW5kZnVuYwotLSAKMi4zNy4yCgpf X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fXwpmZm1wZWctZGV2 ZWwgbWFpbGluZyBsaXN0CmZmbXBlZy1kZXZlbEBmZm1wZWcub3JnCmh0dHBzOi8vZmZtcGVnLm9y Zy9tYWlsbWFuL2xpc3RpbmZvL2ZmbXBlZy1kZXZlbAoKVG8gdW5zdWJzY3JpYmUsIHZpc2l0IGxp bmsgYWJvdmUsIG9yIGVtYWlsCmZmbXBlZy1kZXZlbC1yZXF1ZXN0QGZmbXBlZy5vcmcgd2l0aCBz dWJqZWN0ICJ1bnN1YnNjcmliZSIuCg==