From: "Martin Storsjö" <martin@martin.st>
To: Geoff Hill <geoff@geoffhill.org>
Cc: ffmpeg-devel@ffmpeg.org
Subject: Re: [FFmpeg-devel] [PATCH v4 0/5] avcodec/ac3: Add aarch64 NEON DSP
Date: Mon, 8 Apr 2024 13:47:56 +0300 (EEST)
Message-ID: <26347d5-9e6e-9932-6f30-5d7447de60b4@martin.st> (raw)
In-Reply-To: <51f7be0a-4267-47bf-ab0b-bd6585806da7@geoffhill.org>
On Sat, 6 Apr 2024, Geoff Hill wrote:
> Thanks Martin for your review and testing.
>
> Here's v4 with the following changes:
>
> * Use fmal in sum_square_butterfly_float loop. Faster.
>
> * Removed redundant loop bound zero checks in extract_exponents,
> sum_square_bufferfly_int32 and sum_square_bufferfly_float.
>
> * Fixed randomize_int24() to also use negative values.
>
> * Carry copyright from arm implementation over to aarch64. I
> did use this version as reference.
>
> * Fix indentation to match existing aarch64 assembly style.
>
> Tested once again on aarch64 and x86.
Thanks, this set looked good, so I pushed it.
I amended the commits a bit, moving the added copyright line from
checkasm/ac3dsp.c from patch 1 to 2, where that file actually gets
extended.
Actually, after pushing, I realized another thing that can be done better
in ff_ac3_sum_square_butterfly_float_neon - I'll send a patch for that.
> On AWS Graviton2 (t4g.medium), GCC 12.3:
>
> $ tests/checkasm/checkasm --bench --test=ac3dsp
> ...
> NEON:
> - ac3dsp.ac3_exponent_min [OK]
> - ac3dsp.ac3_extract_exponents [OK]
> - ac3dsp.float_to_fixed24 [OK]
> - ac3dsp.ac3_sum_square_butterfly_int32 [OK]
> - ac3dsp.ac3_sum_square_butterfly_float [OK]
> checkasm: all 20 tests passed
> float_to_fixed24_c: 2460.5
> float_to_fixed24_neon: 561.5
FWIW, it's usually neater to include such numbers in the commit message,
so it gets brought along into the final git history (to show the benefit
we got from the optimization at the time), quoting only those functions
that are added/modified in each patch. But I didn't amend in that in the
commit messages this time, but you can keep it in mind for the future.
Anyway, thanks for the patches!
// Martin
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
prev parent reply other threads:[~2024-04-08 10:48 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-04-06 14:23 Geoff Hill
2024-04-06 14:25 ` [FFmpeg-devel] [PATCH v4 1/5] avcodec/ac3: Implement float_to_fixed24 for aarch64 NEON Geoff Hill
2024-04-06 14:25 ` [FFmpeg-devel] [PATCH v4 2/5] avcodec/ac3: Implement ac3_exponent_min " Geoff Hill
2024-04-06 14:26 ` [FFmpeg-devel] [PATCH v4 3/5] avcodec/ac3: Implement ac3_extract_exponents " Geoff Hill
2024-04-06 14:26 ` [FFmpeg-devel] [PATCH v4 4/5] avcodec/ac3: Implement sum_square_butterfly_int32 " Geoff Hill
2024-04-06 14:26 ` [FFmpeg-devel] [PATCH v4 5/5] avcodec/ac3: Implement sum_square_butterfly_float " Geoff Hill
2024-04-08 10:47 ` Martin Storsjö [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=26347d5-9e6e-9932-6f30-5d7447de60b4@martin.st \
--to=martin@martin.st \
--cc=ffmpeg-devel@ffmpeg.org \
--cc=geoff@geoffhill.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
This inbox may be cloned and mirrored by anyone:
git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git
# If you have public-inbox 1.1+ installed, you may
# initialize and index your mirror using the following commands:
public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \
ffmpegdev@gitmailbox.com
public-inbox-index ffmpegdev
Example config snippet for mirrors.
AGPL code for this site: git clone https://public-inbox.org/public-inbox.git