Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
 help / color / mirror / Atom feed
From: "Martin Storsjö" <martin@martin.st>
To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org>
Subject: Re: [FFmpeg-devel] [PATCH 01/10] checkasm: Add vc1dsp in-loop deblocking filter tests
Date: Tue, 29 Mar 2022 14:47:37 +0300 (EEST)
Message-ID: <fa8af6e6-342f-83da-5a66-94aa555ea9e@martin.st> (raw)
In-Reply-To: <f54b1cda-14ca-dbec-1f18-0d1b46a2e1f3@riscosopen.org>

On Mon, 28 Mar 2022, Ben Avison wrote:

> On 25/03/2022 22:53, Martin Storsjö wrote:
>> On Fri, 25 Mar 2022, Ben Avison wrote:
>> 
>>> +#define 
>>> CHECK_LOOP_FILTER(func)                                             \
>>> +    do 
>>> {                                                                    \
>>> +        if (check_func(h.func, "vc1dsp." #func)) 
>>> {                          \
>>> +            declare_func_emms(AV_CPU_FLAG_MMX, void, uint8_t *, int, 
>>> int);  \
>>> +            for (int count = 1000; count > 0; --count) 
>>> {                    \
>>> +                int pq = rnd() % 31 + 
>>> 1;                                    \
>>> +                RANDOMIZE_BUFFER8_MID_WEIGHTED(filter_buf, 24 * 
>>> 24);        \
>>> +                call_ref(filter_buf0 + 4 * 24 + 4, 24, 
>>> pq);                 \
>>> +                call_new(filter_buf1 + 4 * 24 + 4, 24, 
>>> pq);                 \
>>> +                if (memcmp(filter_buf0, filter_buf1, 24 * 
>>> 24))              \
>>> + 
>>> fail();                                                 \
>>> + 
>>> }                                                               \
>>> + 
>>> }                                                                   \
>>> +        for (int j = 0; j < 24; 
>>> ++j)                                        \
>>> +            for (int i = 0; i < 24; 
>>> ++i)                                    \
>>> +                filter_buf1[24*j + i] = 0x60 + 0x40 * (i >= 4 && j >= 
>>> 4);   \
>>> +        if (check_func(h.func, "vc1dsp." #func "_bestcase")) 
>>> {              \
>>> +            declare_func_emms(AV_CPU_FLAG_MMX, void, uint8_t *, int, 
>>> int);  \
>>> +            bench_new(filter_buf1 + 4 * 24 + 4, 24, 
>>> 1);                     \
>>> +            (void) 
>>> checked_call;                                            \
>>> + 
>>> }                                                                   \
>>> +        if (check_func(h.func, "vc1dsp." #func "_worstcase")) 
>>> {             \
>>> +            declare_func_emms(AV_CPU_FLAG_MMX, void, uint8_t *, int, 
>>> int);  \
>>> +            bench_new(filter_buf1 + 4 * 24 + 4, 24, 
>>> 31);                    \
>>> +            (void) 
>>> checked_call;                                            \
>>> + 
>>> }                                                                   \
>> 
>> (not a full review, just something that cropped up in initial build 
>> testing)
>> 
>> Why do you have the "(void) checked_call;" here? The checked_call isn't 
>> something that is universally defined; its availability depends on the 
>> OS/arch combinations, on other combinations, call_new/call_ref just call 
>> the function straight away without a wrapper.
>
> OK, I missed that subtlety. My aim was to avoid the "unused variable" 
> compiler warnings generated as a result of there being twice as many 
> benchmark tests as correctness tests.

Oh, I see. I just ran into it when trying to compile on macOS, then edited 
it out and saw that it built fine there, but didn't try building for other 
platforms with the same modification.

> I believe we need separate calls of check_func() to initialise the cycle 
> counts for each benchmark, and copying the sequence of macros from 
> checkasm/blockdsp.c,

FWIW I think blockdsp.c might have been a bad example in that regard, as 
it expands the whole testcase with macros. (I chose it mainly as it was 
one of the shortest testcases.)

I think e.g. vp8dsp would have been a better example - with the toplevel 
checkasm_check_*() function just calling individual functions for the 
tests for various function groups. As check_func() can take a format 
string, you don't usually need the macro expansion for filling that in.

> I was placing the declare_func_emms() invocations inside the if block 
> that used check_func(). That meant that checked_call was initialised, 
> but since the correctness test (call_ref / call_new) was in a different 
> block scope, this checked_call declaration was never used.
>
> Upon further investigation, I think it's valid to move the 
> declare_func_emms() invocation up to the next largest block scope. That 
> means it would only appear once rather than 3 times, and it wouldn't 
> need the cast-to-void any more. Please do correct me if I'm wrong.

Yes, that seems correct to do. And looking at other examples, e.g. vp8dsp, 
that also uses such a structure, with declare_func_*() outside of 
check_func() - in a function like check_loopfilter_simple().

// Martin
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".

  reply	other threads:[~2022-03-29 11:47 UTC|newest]

Thread overview: 55+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-03-17 18:58 [FFmpeg-devel] [PATCH 0/6] avcodec/vc1: Arm optimisations Ben Avison
2022-03-17 18:58 ` [FFmpeg-devel] [PATCH 1/6] avcodec/vc1: Arm 64-bit NEON deblocking filter fast paths Ben Avison
2022-03-17 18:58 ` [FFmpeg-devel] [PATCH 2/6] avcodec/vc1: Arm 32-bit " Ben Avison
2022-03-17 18:58 ` [FFmpeg-devel] [PATCH 3/6] avcodec/vc1: Arm 64-bit NEON inverse transform " Ben Avison
2022-03-17 18:58 ` [FFmpeg-devel] [PATCH 4/6] avcodec/idctdsp: Arm 64-bit NEON block add and clamp " Ben Avison
2022-03-17 18:58 ` [FFmpeg-devel] [PATCH 5/6] avcodec/blockdsp: Arm 64-bit NEON block clear " Ben Avison
2022-03-17 18:58 ` [FFmpeg-devel] [PATCH 6/6] avcodec/vc1: Introduce fast path for unescaping bitstream buffer Ben Avison
2022-03-18 19:10   ` Andreas Rheinhardt
2022-03-21 15:51     ` Ben Avison
2022-03-21 20:44       ` Martin Storsjö
2022-03-19 23:06 ` [FFmpeg-devel] [PATCH 0/6] avcodec/vc1: Arm optimisations Martin Storsjö
2022-03-19 23:07   ` Martin Storsjö
2022-03-21 17:37   ` Ben Avison
2022-03-21 22:29     ` Martin Storsjö
2022-03-25 18:52 ` [FFmpeg-devel] [PATCH v2 00/10] " Ben Avison
2022-03-25 18:52   ` [FFmpeg-devel] [PATCH 01/10] checkasm: Add vc1dsp in-loop deblocking filter tests Ben Avison
2022-03-25 22:53     ` Martin Storsjö
2022-03-28 18:28       ` Ben Avison
2022-03-29 11:47         ` Martin Storsjö [this message]
2022-03-29 12:24     ` Martin Storsjö
2022-03-29 12:43     ` Martin Storsjö
2022-03-25 18:52   ` [FFmpeg-devel] [PATCH 02/10] checkasm: Add vc1dsp inverse transform tests Ben Avison
2022-03-29 12:41     ` Martin Storsjö
2022-03-25 18:52   ` [FFmpeg-devel] [PATCH 03/10] checkasm: Add idctdsp add/put-pixels-clamped tests Ben Avison
2022-03-29 13:13     ` Martin Storsjö
2022-03-29 19:56       ` Martin Storsjö
2022-03-29 20:22       ` Ben Avison
2022-03-29 20:30         ` Martin Storsjö
2022-03-25 18:52   ` [FFmpeg-devel] [PATCH 04/10] avcodec/vc1: Introduce fast path for unescaping bitstream buffer Ben Avison
2022-03-29 20:37     ` Martin Storsjö
2022-03-31 13:58       ` Ben Avison
2022-03-31 14:07         ` Martin Storsjö
2022-03-25 18:52   ` [FFmpeg-devel] [PATCH 05/10] avcodec/vc1: Arm 64-bit NEON deblocking filter fast paths Ben Avison
2022-03-30 12:35     ` Martin Storsjö
2022-03-31 15:15       ` Ben Avison
2022-03-31 21:21         ` Martin Storsjö
2022-03-25 18:52   ` [FFmpeg-devel] [PATCH 06/10] avcodec/vc1: Arm 32-bit " Ben Avison
2022-03-25 19:27     ` Lynne
2022-03-25 19:49       ` Martin Storsjö
2022-03-25 19:55         ` Lynne
2022-03-30 12:37     ` Martin Storsjö
2022-03-30 13:03     ` Martin Storsjö
2022-03-25 18:52   ` [FFmpeg-devel] [PATCH 07/10] avcodec/vc1: Arm 64-bit NEON inverse transform " Ben Avison
2022-03-30 13:49     ` Martin Storsjö
2022-03-30 14:01       ` Martin Storsjö
2022-03-31 15:37       ` Ben Avison
2022-03-31 21:32         ` Martin Storsjö
2022-03-25 18:52   ` [FFmpeg-devel] [PATCH 08/10] avcodec/idctdsp: Arm 64-bit NEON block add and clamp " Ben Avison
2022-03-30 14:14     ` Martin Storsjö
2022-03-31 16:47       ` Ben Avison
2022-03-31 21:42         ` Martin Storsjö
2022-03-25 18:52   ` [FFmpeg-devel] [PATCH 09/10] avcodec/vc1: Arm 64-bit NEON unescape fast path Ben Avison
2022-03-30 14:35     ` Martin Storsjö
2022-03-25 18:52   ` [FFmpeg-devel] [PATCH 10/10] avcodec/vc1: Arm 32-bit " Ben Avison
2022-03-30 14:35     ` Martin Storsjö

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=fa8af6e6-342f-83da-5a66-94aa555ea9e@martin.st \
    --to=martin@martin.st \
    --cc=ffmpeg-devel@ffmpeg.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

This inbox may be cloned and mirrored by anyone:

	git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git

	# If you have public-inbox 1.1+ installed, you may
	# initialize and index your mirror using the following commands:
	public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \
		ffmpegdev@gitmailbox.com
	public-inbox-index ffmpegdev

Example config snippet for mirrors.


AGPL code for this site: git clone https://public-inbox.org/public-inbox.git