From: "Rémi Denis-Courmont" <remi@remlab.net> To: ffmpeg-devel@ffmpeg.org Subject: [FFmpeg-devel] [PATCHv2 0/10] RISC-V V floating point DSP Date: Sun, 04 Sep 2022 16:54:34 +0300 Message-ID: <3372981.QJadu78ljV@basile.remlab.net> (raw) The following changes since commit b6e8fc1c201d58672639134a737137e1ba7b55fe: avcodec/speexdec: improve support for speex in non-ogg (2022-09-04 11:31:57 +0200) are waiting thorough bashing at your express convenience up to: riscv: float vector dot product with RVV (2022-09-04 16:45:38 +0300) Changes since v1: - Removed stray define. - Fixed mismatch between byte and element size in mul-scalar. - Added fmul, fac, dmul, dmac, fmul-add, fmul-reverse, fmul-window. - Added float butterfly and dot product. All operations are unrolled to the maximum group size (8), with the exception of overlap/add. The later seems to require a minimum of 6 vectors (maybe 5 by extremely careful ordering), so the group size is only 4. The pointer arithmetic could be slightly optimised with SH2ADD and SH3ADD instructions from the Zvba extension. This would require more conditional code, or requiring support for Zvba for probably neglible performance gains though. ---------------------------------------------------------------- Rémi Denis-Courmont (10): riscv: add CPU flags for the RISC-V Vector extension riscv: initial common header for assembler macros riscv: float vector-scalar multiplication with RVV riscv: float vector-vector multiplication with RVV riscv: float vector multiply-accumulate with RVV riscv: float vector multiplication-addition with RVV riscv: float vector sum-and-difference with RVV riscv: float reversed vector multiplication with RVV riscv: float vector windowed overlap/add with RVV riscv: float vector dot product with RVV libavutil/cpu.c | 14 +++ libavutil/cpu.h | 6 + libavutil/cpu_internal.h | 1 + libavutil/float_dsp.c | 2 + libavutil/float_dsp.h | 1 + libavutil/riscv/Makefile | 3 + libavutil/riscv/asm.S | 33 +++++ libavutil/riscv/cpu.c | 57 +++++++++ libavutil/riscv/float_dsp_init.c | 67 ++++++++++ libavutil/riscv/float_dsp_rvv.S | 255 +++++++++++++++++++++++++++++++++++++++ 10 files changed, 439 insertions(+) create mode 100644 libavutil/riscv/Makefile create mode 100644 libavutil/riscv/asm.S create mode 100644 libavutil/riscv/cpu.c create mode 100644 libavutil/riscv/float_dsp_init.c create mode 100644 libavutil/riscv/float_dsp_rvv.S -- レミ・デニ-クールモン http://www.remlab.net/ _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
next reply other threads:[~2022-09-04 13:54 UTC|newest] Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top 2022-09-04 13:54 Rémi Denis-Courmont [this message] 2022-09-04 13:54 ` [FFmpeg-devel] [PATCH 01/10] riscv: add CPU flags for the RISC-V Vector extension remi 2022-09-04 13:54 ` [FFmpeg-devel] [PATCH 02/10] riscv: initial common header for assembler macros remi 2022-09-04 13:54 ` [FFmpeg-devel] [PATCH 03/10] riscv: float vector-scalar multiplication with RVV remi 2022-09-04 13:54 ` [FFmpeg-devel] [PATCH 04/10] riscv: float vector-vector " remi 2022-09-04 13:54 ` [FFmpeg-devel] [PATCH 05/10] riscv: float vector multiply-accumulate " remi 2022-09-04 13:54 ` [FFmpeg-devel] [PATCH 06/10] riscv: float vector multiplication-addition " remi 2022-09-04 13:55 ` [FFmpeg-devel] [PATCH 07/10] riscv: float vector sum-and-difference " remi 2022-09-04 13:55 ` [FFmpeg-devel] [PATCH 08/10] riscv: float reversed vector multiplication " remi 2022-09-04 13:55 ` [FFmpeg-devel] [PATCH 09/10] riscv: float vector windowed overlap/add " remi 2022-09-04 13:55 ` [FFmpeg-devel] [PATCH 10/10] riscv: float vector dot product " remi 2022-09-04 17:48 ` [FFmpeg-devel] [PATCHv2 0/10] RISC-V V floating point DSP Lynne 2022-09-04 19:01 ` Rémi Denis-Courmont
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=3372981.QJadu78ljV@basile.remlab.net \ --to=remi@remlab.net \ --cc=ffmpeg-devel@ffmpeg.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel This inbox may be cloned and mirrored by anyone: git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git # If you have public-inbox 1.1+ installed, you may # initialize and index your mirror using the following commands: public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \ ffmpegdev@gitmailbox.com public-inbox-index ffmpegdev Example config snippet for mirrors. AGPL code for this site: git clone https://public-inbox.org/public-inbox.git