From: "Martin Storsjö" <martin@martin.st> To: Geoff Hill <geoff@geoffhill.org> Cc: ffmpeg-devel@ffmpeg.org Subject: Re: [FFmpeg-devel] [PATCH v4 0/5] avcodec/ac3: Add aarch64 NEON DSP Date: Mon, 8 Apr 2024 13:47:56 +0300 (EEST) Message-ID: <26347d5-9e6e-9932-6f30-5d7447de60b4@martin.st> (raw) In-Reply-To: <51f7be0a-4267-47bf-ab0b-bd6585806da7@geoffhill.org> On Sat, 6 Apr 2024, Geoff Hill wrote: > Thanks Martin for your review and testing. > > Here's v4 with the following changes: > > * Use fmal in sum_square_butterfly_float loop. Faster. > > * Removed redundant loop bound zero checks in extract_exponents, > sum_square_bufferfly_int32 and sum_square_bufferfly_float. > > * Fixed randomize_int24() to also use negative values. > > * Carry copyright from arm implementation over to aarch64. I > did use this version as reference. > > * Fix indentation to match existing aarch64 assembly style. > > Tested once again on aarch64 and x86. Thanks, this set looked good, so I pushed it. I amended the commits a bit, moving the added copyright line from checkasm/ac3dsp.c from patch 1 to 2, where that file actually gets extended. Actually, after pushing, I realized another thing that can be done better in ff_ac3_sum_square_butterfly_float_neon - I'll send a patch for that. > On AWS Graviton2 (t4g.medium), GCC 12.3: > > $ tests/checkasm/checkasm --bench --test=ac3dsp > ... > NEON: > - ac3dsp.ac3_exponent_min [OK] > - ac3dsp.ac3_extract_exponents [OK] > - ac3dsp.float_to_fixed24 [OK] > - ac3dsp.ac3_sum_square_butterfly_int32 [OK] > - ac3dsp.ac3_sum_square_butterfly_float [OK] > checkasm: all 20 tests passed > float_to_fixed24_c: 2460.5 > float_to_fixed24_neon: 561.5 FWIW, it's usually neater to include such numbers in the commit message, so it gets brought along into the final git history (to show the benefit we got from the optimization at the time), quoting only those functions that are added/modified in each patch. But I didn't amend in that in the commit messages this time, but you can keep it in mind for the future. Anyway, thanks for the patches! // Martin _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
prev parent reply other threads:[~2024-04-08 10:48 UTC|newest] Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top 2024-04-06 14:23 Geoff Hill 2024-04-06 14:25 ` [FFmpeg-devel] [PATCH v4 1/5] avcodec/ac3: Implement float_to_fixed24 for aarch64 NEON Geoff Hill 2024-04-06 14:25 ` [FFmpeg-devel] [PATCH v4 2/5] avcodec/ac3: Implement ac3_exponent_min " Geoff Hill 2024-04-06 14:26 ` [FFmpeg-devel] [PATCH v4 3/5] avcodec/ac3: Implement ac3_extract_exponents " Geoff Hill 2024-04-06 14:26 ` [FFmpeg-devel] [PATCH v4 4/5] avcodec/ac3: Implement sum_square_butterfly_int32 " Geoff Hill 2024-04-06 14:26 ` [FFmpeg-devel] [PATCH v4 5/5] avcodec/ac3: Implement sum_square_butterfly_float " Geoff Hill 2024-04-08 10:47 ` Martin Storsjö [this message]
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=26347d5-9e6e-9932-6f30-5d7447de60b4@martin.st \ --to=martin@martin.st \ --cc=ffmpeg-devel@ffmpeg.org \ --cc=geoff@geoffhill.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel This inbox may be cloned and mirrored by anyone: git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git # If you have public-inbox 1.1+ installed, you may # initialize and index your mirror using the following commands: public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \ ffmpegdev@gitmailbox.com public-inbox-index ffmpegdev Example config snippet for mirrors. AGPL code for this site: git clone https://public-inbox.org/public-inbox.git