From: Michael Niedermayer <michael@niedermayer.cc>
To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org>
Subject: Re: [FFmpeg-devel] [PATCH v2 1/7] avcodec/la: add LSX optimization for h264 idct.
Date: Fri, 19 May 2023 16:43:33 +0200
Message-ID: <20230519144333.GB1391451@pb2> (raw)
In-Reply-To: <20230517070351.4849-2-chenhao@loongson.cn>
[-- Attachment #1.1: Type: text/plain, Size: 2708 bytes --]
On Wed, May 17, 2023 at 03:03:45PM +0800, Hao Chen wrote:
> From: Shiyou Yin <yinshiyou-hf@loongson.cn>
>
> loongson_asm.S is LoongArch asm optimization helper.
> Add functions:
> ff_h264_idct_add_8_lsx
> ff_h264_idct8_add_8_lsx
> ff_h264_idct_dc_add_8_lsx
> ff_h264_idct8_dc_add_8_lsx
> ff_h264_idct_add16_8_lsx
> ff_h264_idct8_add4_8_lsx
> ff_h264_idct_add8_8_lsx
> ff_h264_idct_add8_422_8_lsx
> ff_h264_idct_add16_intra_8_lsx
> ff_h264_luma_dc_dequant_idct_8_lsx
> Replaced function(LSX is sufficient for these functions):
> ff_h264_idct_add_lasx
> ff_h264_idct4x4_addblk_dc_lasx
> ff_h264_idct_add16_lasx
> ff_h264_idct8_add4_lasx
> ff_h264_idct_add8_lasx
> ff_h264_idct_add8_422_lasx
> ff_h264_idct_add16_intra_lasx
> ff_h264_deq_idct_luma_dc_lasx
> Renamed functions:
> ff_h264_idct8_addblk_lasx ==> ff_h264_idct8_add_8_lasx
> ff_h264_idct8_dc_addblk_lasx ==> ff_h264_idct8_dc_add_8_lasx
>
> ./configure --disable-lasx
> ffmpeg -i 1_h264_1080p_30fps_3Mbps.mp4 -f rawvideo -y /dev/null -an
> before: 155fps
> after: 161fps
> ---
> libavcodec/loongarch/Makefile | 3 +-
> libavcodec/loongarch/h264_deblock_lasx.c | 2 +-
> libavcodec/loongarch/h264dsp_init_loongarch.c | 39 +-
> libavcodec/loongarch/h264dsp_lasx.c | 2 +-
> .../{h264dsp_lasx.h => h264dsp_loongarch.h} | 60 +-
> libavcodec/loongarch/h264idct.S | 659 ++++++++++++
> libavcodec/loongarch/h264idct_lasx.c | 498 ---------
> libavcodec/loongarch/h264idct_loongarch.c | 185 ++++
> libavcodec/loongarch/loongson_asm.S | 946 ++++++++++++++++++
> 9 files changed, 1851 insertions(+), 543 deletions(-)
> rename libavcodec/loongarch/{h264dsp_lasx.h => h264dsp_loongarch.h} (68%)
> create mode 100644 libavcodec/loongarch/h264idct.S
> delete mode 100644 libavcodec/loongarch/h264idct_lasx.c
> create mode 100644 libavcodec/loongarch/h264idct_loongarch.c
> create mode 100644 libavcodec/loongarch/loongson_asm.S
There are whitespace errors:
These could prevent pushing as the git hooks check for various whitespace
issues
.git/rebase-apply/patch:1432: tab in indent.
} else if (nnz) {
.git/rebase-apply/patch:881: new blank line at EOF.
+
.git/rebase-apply/patch:1576: new blank line at EOF.
+
.git/rebase-apply/patch:2528: new blank line at EOF.
+
warning: 4 lines add whitespace errors.
[...]
--
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
The bravest are surely those who have the clearest vision
of what is before them, glory and danger alike, and yet
notwithstanding go out to meet it. -- Thucydides
[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]
[-- Attachment #2: Type: text/plain, Size: 251 bytes --]
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
next prev parent reply other threads:[~2023-05-19 14:43 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-05-17 7:03 [FFmpeg-devel] Add LSX optimization in avcodec and swscale Hao Chen
2023-05-17 7:03 ` [FFmpeg-devel] [PATCH v2 1/7] avcodec/la: add LSX optimization for h264 idct Hao Chen
2023-05-17 8:01 ` Shiyou Yin
2023-05-19 14:43 ` Michael Niedermayer [this message]
2023-05-17 7:03 ` [FFmpeg-devel] [PATCH v2 2/7] avcodec/la: Add LSX optimization for loop filter Hao Chen
2023-05-17 7:03 ` [FFmpeg-devel] [PATCH v2 3/7] avcodec/la: Add LSX optimization for h264 chroma and intrapred Hao Chen
2023-05-17 7:03 ` [FFmpeg-devel] [PATCH v2 4/7] avcodec/la: Add LSX optimization for h264 qpel Hao Chen
2023-05-17 7:03 ` [FFmpeg-devel] [PATCH v2 5/7] swscale/la: Optimize the functions of the swscale series with lsx Hao Chen
2023-05-17 7:03 ` [FFmpeg-devel] [PATCH v2 6/7] swscale/la: Add following builtin optimized functions Hao Chen
2023-05-17 8:27 ` [FFmpeg-devel] Add LSX optimization in avcodec and swscale Shiyou Yin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20230519144333.GB1391451@pb2 \
--to=michael@niedermayer.cc \
--cc=ffmpeg-devel@ffmpeg.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
This inbox may be cloned and mirrored by anyone:
git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git
# If you have public-inbox 1.1+ installed, you may
# initialize and index your mirror using the following commands:
public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \
ffmpegdev@gitmailbox.com
public-inbox-index ffmpegdev
Example config snippet for mirrors.
AGPL code for this site: git clone https://public-inbox.org/public-inbox.git