From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <ffmpeg-devel-bounces@ffmpeg.org>
Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100])
	by master.gitmailbox.com (Postfix) with ESMTPS id 9435A4B555
	for <ffmpegdev@gitmailbox.com>; Wed, 26 Mar 2025 10:30:55 +0000 (UTC)
Received: from [127.0.1.1] (localhost [127.0.0.1])
	by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 91E8B687B80;
	Wed, 26 Mar 2025 12:30:27 +0200 (EET)
Received: from mail-lj1-f170.google.com (mail-lj1-f170.google.com
 [209.85.208.170])
 by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 3A8EA687B90
 for <ffmpeg-devel@ffmpeg.org>; Wed, 26 Mar 2025 12:30:19 +0200 (EET)
Received: by mail-lj1-f170.google.com with SMTP id
 38308e7fff4ca-3061513d353so71069551fa.2
 for <ffmpeg-devel@ffmpeg.org>; Wed, 26 Mar 2025 03:30:19 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=martin-st.20230601.gappssmtp.com; s=20230601; t=1742985018; x=1743589818;
 darn=ffmpeg.org; 
 h=content-transfer-encoding:mime-version:references:in-reply-to
 :message-id:date:subject:to:from:from:to:cc:subject:date:message-id
 :reply-to; bh=OhP5/siB94uXmbuV+/ME+g/LHbCX7+QxBL4mmMmzXQU=;
 b=VYJlORoi6E8oohEaaiNUynVJlqWEcG/R/6sD5Tmcl1jxmMmG9Lf1GGFr3EVPHk2vw2
 JBdPU6AppqmXmKEpT/3E99USVAvgI+ihkGIIhPaddwKtkQ92zZUmQUQdHKYxMLYMDrNM
 nH2dEWLqXX5Wf2vnmXWZr4DSW1Mb8A/5qGA69hxVDujGBAAAbMgrGd6H0bcXiLBCQVc8
 SiBV8+MJfMgUrCqUEEEsySLkkcCTbhbCSoN43/xhdWLTmwTcAIKTIALEhRXl+sJcWJIR
 5Ho9Nd3SLTcCddhXIQ2CbOa9QPCXOLpHc4/l0KcBn8hmnd9HYFmE8X37lBDW1CI7wH9/
 D2rw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20230601; t=1742985018; x=1743589818;
 h=content-transfer-encoding:mime-version:references:in-reply-to
 :message-id:date:subject:to:from:x-gm-message-state:from:to:cc
 :subject:date:message-id:reply-to;
 bh=OhP5/siB94uXmbuV+/ME+g/LHbCX7+QxBL4mmMmzXQU=;
 b=vjk8NTQHcIoM6j/C1Oq0J8C6PKILrdI8GUoKzBqH1ZjVjL1g2KPs2lPvbxkRmKP0Yp
 xb0yIsJqQcRCTwsGhKBtDxdmsQ6fAB5eEIxwm48HUOjB2V4K56r/I1j/LXECWxrQszeu
 s1as3GJ6Pa/vExNjqukLEO9TkIRKpXg+h76K6jALqllPJqZgJSjjxiyXgiuXauEUFBZE
 NUtjFs+/Np8QDoJMoQplIDejlnhGEQQX086R2/0VVkKSylDWQ0tOOGZ2f3Mb7X8mQkkL
 d+43qm5K1gSZISaoHAORnqamu9F8Bpyb2c8XWQo10tA53aPOTvLTz+hkc+Tx32ZGYfqL
 Zh2g==
X-Gm-Message-State: AOJu0YznzOueSBpLJiLgHAKS648ald461nIXWIpPbgKFUv0KbogLfFHg
 x+y88H1VKeT4mrIDw4TdvH0te5YCohQtZ3Y2fnsn0tLGA93sNdo17bAezv5tXX4ovDa11aRPy1C
 H+g==
X-Gm-Gg: ASbGnctiW3EUOUz91JXnFxs4WoiqvxkwTfWc7Gfn6gp98RgSaN8diXCWs5LuzSi4iph
 URFvepZ8+Ws8gYJJzs/KTTjuc6SUSQM9y5v8VXVQoFpReeBGa69YD7JYwh1SWvXVgAoQnO2006P
 YbHNFW16ejJdB6wcX3RkWU6bim1q3K2tKGYy+1Zj/w0T/geGP3AOySNLWdb303IgBgKyXTE3FG7
 OkVbsTJ72HVhBripLTXHKZTnK/e2KtC1OSxjEx2YJwO20FQRczyRrILJeEThmeIPMzcNQwlifoD
 HgwSo0Xc3bGl/BX4M/kOIl6OHGLSPIGeL7OtKzEq3/SxNLZXFYpXwTRUYQmb3/Op8oW1Lqbv2LX
 qWCsYFKd1TZA=
X-Google-Smtp-Source: AGHT+IFKV9gLR7hzY+ntw263QE6rLAFy8slLMUsziPA52tJZkFlbrZzSVHkaUateAIVyOZuVVZtIqQ==
X-Received: by 2002:a05:651c:506:b0:30b:b204:6b98 with SMTP id
 38308e7fff4ca-30d7e21a455mr78100431fa.12.1742985018177; 
 Wed, 26 Mar 2025 03:30:18 -0700 (PDT)
Received: from localhost (dsl-tkubng21-50de30-198.dhcp.inet.fi.
 [80.222.48.198]) by smtp.gmail.com with ESMTPSA id
 38308e7fff4ca-30d7d9107eesm21404571fa.92.2025.03.26.03.30.17
 for <ffmpeg-devel@ffmpeg.org>
 (version=TLS1 cipher=AES128-SHA bits=128/128);
 Wed, 26 Mar 2025 03:30:17 -0700 (PDT)
From: =?UTF-8?q?Martin=20Storsj=C3=B6?= <martin@martin.st>
To: ffmpeg-devel@ffmpeg.org
Date: Wed, 26 Mar 2025 12:30:14 +0200
Message-Id: <20250326103015.44296-3-martin@martin.st>
X-Mailer: git-send-email 2.39.5 (Apple Git-154)
In-Reply-To: <20250326103015.44296-1-martin@martin.st>
References: <20250326103015.44296-1-martin@martin.st>
MIME-Version: 1.0
Subject: [FFmpeg-devel] [PATCH 3/4] checkasm: hevc_pel: Use helpers for
 checking for writes out of bounds
X-BeenThere: ffmpeg-devel@ffmpeg.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: FFmpeg development discussions and patches <ffmpeg-devel.ffmpeg.org>
List-Unsubscribe: <https://ffmpeg.org/mailman/options/ffmpeg-devel>,
 <mailto:ffmpeg-devel-request@ffmpeg.org?subject=unsubscribe>
List-Archive: <https://ffmpeg.org/pipermail/ffmpeg-devel>
List-Post: <mailto:ffmpeg-devel@ffmpeg.org>
List-Help: <mailto:ffmpeg-devel-request@ffmpeg.org?subject=help>
List-Subscribe: <https://ffmpeg.org/mailman/listinfo/ffmpeg-devel>,
 <mailto:ffmpeg-devel-request@ffmpeg.org?subject=subscribe>
Reply-To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Errors-To: ffmpeg-devel-bounces@ffmpeg.org
Sender: "ffmpeg-devel" <ffmpeg-devel-bounces@ffmpeg.org>
Archived-At: <https://master.gitmailbox.com/ffmpegdev/20250326103015.44296-3-martin@martin.st/>
List-Archive: <https://master.gitmailbox.com/ffmpegdev/>
List-Post: <mailto:ffmpegdev@gitmailbox.com>

This allows catching whether the functions write outside of
the designated rectangle, and if run with "checkasm -v", it also
prints out on which side of the rectangle the overwrite was.
---
 tests/checkasm/hevc_pel.c | 157 ++++++++++++++++++++++----------------
 1 file changed, 90 insertions(+), 67 deletions(-)

diff --git a/tests/checkasm/hevc_pel.c b/tests/checkasm/hevc_pel.c
index b9417832e5..9a83613915 100644
--- a/tests/checkasm/hevc_pel.c
+++ b/tests/checkasm/hevc_pel.c
@@ -44,9 +44,14 @@ static const int offsets[] = {0, 255, -1 };
             uint32_t r = rnd() & mask;               \
             AV_WN32A(buf0 + k, r);                   \
             AV_WN32A(buf1 + k, r);                   \
-            if (k >= BUF_SIZE)                       \
-                continue;                            \
-            r = rnd();                               \
+        }                                            \
+    } while (0)
+
+#define randomize_buffers_dst()                      \
+    do {                                             \
+        int k;                                       \
+        for (k = 0; k < BUF_SIZE; k += 4) {          \
+            uint32_t r = rnd();                      \
             AV_WN32A(dst0 + k, r);                   \
             AV_WN32A(dst1 + k, r);                   \
         }                                            \
@@ -100,6 +105,7 @@ static void checkasm_check_hevc_qpel(void)
                                    "put_hevc_%s%d_%d", type, sizes[size], bit_depth)) {
                         int16_t *dstw0 = (int16_t *) dst0, *dstw1 = (int16_t *) dst1;
                         randomize_buffers();
+                        randomize_buffers_dst();
                         call_ref(dstw0, src0, sizes[size] * SIZEOF_PIXEL, sizes[size], i, j, sizes[size]);
                         call_new(dstw1, src1, sizes[size] * SIZEOF_PIXEL, sizes[size], i, j, sizes[size]);
                         checkasm_check(int16_t, dstw0, MAX_PB_SIZE * sizeof(int16_t),
@@ -118,8 +124,8 @@ static void checkasm_check_hevc_qpel_uni(void)
 {
     LOCAL_ALIGNED_32(uint8_t, buf0, [BUF_SIZE + SRC_EXTRA]);
     LOCAL_ALIGNED_32(uint8_t, buf1, [BUF_SIZE + SRC_EXTRA]);
-    LOCAL_ALIGNED_32(uint8_t, dst0, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst1, [BUF_SIZE]);
+    PIXEL_RECT(dst0, 64, 64);
+    PIXEL_RECT(dst1, 64, 64);
 
     HEVCDSPContext h;
     int size, bit_depth, i, j;
@@ -143,16 +149,18 @@ static void checkasm_check_hevc_qpel_uni(void)
                     if (check_func(h.put_hevc_qpel_uni[size][j][i],
                                    "put_hevc_%s%d_%d", type, sizes[size], bit_depth)) {
                         randomize_buffers();
-                        call_ref(dst0, sizes[size] * SIZEOF_PIXEL,
+                        CLEAR_PIXEL_RECT(dst0);
+                        CLEAR_PIXEL_RECT(dst1);
+                        call_ref(dst0, dst0_stride,
                                  src0, sizes[size] * SIZEOF_PIXEL,
                                  sizes[size], i, j, sizes[size]);
-                        call_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                        call_new(dst1, dst1_stride,
                                  src1, sizes[size] * SIZEOF_PIXEL,
                                  sizes[size], i, j, sizes[size]);
-                        checkasm_check_pixel(dst0, sizes[size] * SIZEOF_PIXEL,
-                                             dst1, sizes[size] * SIZEOF_PIXEL,
-                                             size[sizes], size[sizes], "dst");
-                        bench_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                        checkasm_check_pixel_padded(dst0, dst0_stride,
+                                                    dst1, dst1_stride,
+                                                    size[sizes], size[sizes], "dst");
+                        bench_new(dst1, dst1_stride,
                                   src1, sizes[size] * SIZEOF_PIXEL,
                                   sizes[size], i, j, sizes[size]);
                     }
@@ -167,8 +175,8 @@ static void checkasm_check_hevc_qpel_uni_w(void)
 {
     LOCAL_ALIGNED_32(uint8_t, buf0, [BUF_SIZE + SRC_EXTRA]);
     LOCAL_ALIGNED_32(uint8_t, buf1, [BUF_SIZE + SRC_EXTRA]);
-    LOCAL_ALIGNED_32(uint8_t, dst0, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst1, [BUF_SIZE]);
+    PIXEL_RECT(dst0, 64, 64);
+    PIXEL_RECT(dst1, 64, 64);
 
     HEVCDSPContext h;
     int size, bit_depth, i, j;
@@ -196,16 +204,18 @@ static void checkasm_check_hevc_qpel_uni_w(void)
                             for (wx = weights; *wx >= 0; wx++) {
                                 for (ox = offsets; *ox >= 0; ox++) {
                                     randomize_buffers();
-                                    call_ref(dst0, sizes[size] * SIZEOF_PIXEL,
+                                    CLEAR_PIXEL_RECT(dst0);
+                                    CLEAR_PIXEL_RECT(dst1);
+                                    call_ref(dst0, dst0_stride,
                                              src0, sizes[size] * SIZEOF_PIXEL,
                                              sizes[size], *denom, *wx, *ox, i, j, sizes[size]);
-                                    call_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                                    call_new(dst1, dst1_stride,
                                              src1, sizes[size] * SIZEOF_PIXEL,
                                              sizes[size], *denom, *wx, *ox, i, j, sizes[size]);
-                                    checkasm_check_pixel(dst0, sizes[size] * SIZEOF_PIXEL,
-                                                         dst1, sizes[size] * SIZEOF_PIXEL,
-                                                         size[sizes], size[sizes], "dst");
-                                    bench_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                                    checkasm_check_pixel_padded(dst0, dst0_stride,
+                                                                dst1, dst1_stride,
+                                                                size[sizes], size[sizes], "dst");
+                                    bench_new(dst1, dst1_stride,
                                               src1, sizes[size] * SIZEOF_PIXEL,
                                               sizes[size], *denom, *wx, *ox, i, j, sizes[size]);
                                 }
@@ -223,8 +233,8 @@ static void checkasm_check_hevc_qpel_bi(void)
 {
     LOCAL_ALIGNED_32(uint8_t, buf0, [BUF_SIZE + SRC_EXTRA]);
     LOCAL_ALIGNED_32(uint8_t, buf1, [BUF_SIZE + SRC_EXTRA]);
-    LOCAL_ALIGNED_32(uint8_t, dst0, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst1, [BUF_SIZE]);
+    PIXEL_RECT(dst0, 64, 64);
+    PIXEL_RECT(dst1, 64, 64);
     LOCAL_ALIGNED_32(int16_t, ref0, [BUF_SIZE]);
     LOCAL_ALIGNED_32(int16_t, ref1, [BUF_SIZE]);
 
@@ -251,16 +261,18 @@ static void checkasm_check_hevc_qpel_bi(void)
                     if (check_func(h.put_hevc_qpel_bi[size][j][i],
                                    "put_hevc_%s%d_%d", type, sizes[size], bit_depth)) {
                         randomize_buffers_ref();
-                        call_ref(dst0, sizes[size] * SIZEOF_PIXEL,
+                        CLEAR_PIXEL_RECT(dst0);
+                        CLEAR_PIXEL_RECT(dst1);
+                        call_ref(dst0, dst0_stride,
                                  src0, sizes[size] * SIZEOF_PIXEL,
                                  ref0, sizes[size], i, j, sizes[size]);
-                        call_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                        call_new(dst1, dst1_stride,
                                  src1, sizes[size] * SIZEOF_PIXEL,
                                  ref1, sizes[size], i, j, sizes[size]);
-                        checkasm_check_pixel(dst0, sizes[size] * SIZEOF_PIXEL,
-                                             dst1, sizes[size] * SIZEOF_PIXEL,
-                                             size[sizes], size[sizes], "dst");
-                        bench_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                        checkasm_check_pixel_padded(dst0, dst0_stride,
+                                                    dst1, dst1_stride,
+                                                    size[sizes], size[sizes], "dst");
+                        bench_new(dst1, dst1_stride,
                                   src1, sizes[size] * SIZEOF_PIXEL,
                                   ref1, sizes[size], i, j, sizes[size]);
                     }
@@ -275,8 +287,8 @@ static void checkasm_check_hevc_qpel_bi_w(void)
 {
     LOCAL_ALIGNED_32(uint8_t, buf0, [BUF_SIZE + SRC_EXTRA]);
     LOCAL_ALIGNED_32(uint8_t, buf1, [BUF_SIZE + SRC_EXTRA]);
-    LOCAL_ALIGNED_32(uint8_t, dst0, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst1, [BUF_SIZE]);
+    PIXEL_RECT(dst0, 64, 64);
+    PIXEL_RECT(dst1, 64, 64);
     LOCAL_ALIGNED_32(int16_t, ref0, [BUF_SIZE]);
     LOCAL_ALIGNED_32(int16_t, ref1, [BUF_SIZE]);
 
@@ -308,16 +320,18 @@ static void checkasm_check_hevc_qpel_bi_w(void)
                             for (wx = weights; *wx >= 0; wx++) {
                                 for (ox = offsets; *ox >= 0; ox++) {
                                     randomize_buffers_ref();
-                                    call_ref(dst0, sizes[size] * SIZEOF_PIXEL,
+                                    CLEAR_PIXEL_RECT(dst0);
+                                    CLEAR_PIXEL_RECT(dst1);
+                                    call_ref(dst0, dst0_stride,
                                              src0, sizes[size] * SIZEOF_PIXEL,
                                              ref0, sizes[size], *denom, *wx, *wx, *ox, *ox, i, j, sizes[size]);
-                                    call_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                                    call_new(dst1, dst1_stride,
                                              src1, sizes[size] * SIZEOF_PIXEL,
                                              ref1, sizes[size], *denom, *wx, *wx, *ox, *ox, i, j, sizes[size]);
-                                    checkasm_check_pixel(dst0, sizes[size] * SIZEOF_PIXEL,
-                                                         dst1, sizes[size] * SIZEOF_PIXEL,
-                                                         size[sizes], size[sizes], "dst");
-                                    bench_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                                    checkasm_check_pixel_padded(dst0, dst0_stride,
+                                                                dst1, dst1_stride,
+                                                                size[sizes], size[sizes], "dst");
+                                    bench_new(dst1, dst1_stride,
                                               src1, sizes[size] * SIZEOF_PIXEL,
                                               ref1, sizes[size], *denom, *wx, *wx, *ox, *ox, i, j, sizes[size]);
                                 }
@@ -364,6 +378,7 @@ static void checkasm_check_hevc_epel(void)
                                    "put_hevc_%s%d_%d", type, sizes[size], bit_depth)) {
                         int16_t *dstw0 = (int16_t *) dst0, *dstw1 = (int16_t *) dst1;
                         randomize_buffers();
+                        randomize_buffers_dst();
                         call_ref(dstw0, src0, sizes[size] * SIZEOF_PIXEL, sizes[size], i, j, sizes[size]);
                         call_new(dstw1, src1, sizes[size] * SIZEOF_PIXEL, sizes[size], i, j, sizes[size]);
                         checkasm_check(int16_t, dstw0, MAX_PB_SIZE * sizeof(int16_t),
@@ -382,8 +397,8 @@ static void checkasm_check_hevc_epel_uni(void)
 {
     LOCAL_ALIGNED_32(uint8_t, buf0, [BUF_SIZE]);
     LOCAL_ALIGNED_32(uint8_t, buf1, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst0, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst1, [BUF_SIZE]);
+    PIXEL_RECT(dst0, 64, 64);
+    PIXEL_RECT(dst1, 64, 64);
 
     HEVCDSPContext h;
     int size, bit_depth, i, j;
@@ -407,16 +422,18 @@ static void checkasm_check_hevc_epel_uni(void)
                     if (check_func(h.put_hevc_epel_uni[size][j][i],
                                    "put_hevc_%s%d_%d", type, sizes[size], bit_depth)) {
                         randomize_buffers();
-                        call_ref(dst0, sizes[size] * SIZEOF_PIXEL,
+                        CLEAR_PIXEL_RECT(dst0);
+                        CLEAR_PIXEL_RECT(dst1);
+                        call_ref(dst0, dst0_stride,
                                  src0, sizes[size] * SIZEOF_PIXEL,
                                  sizes[size], i, j, sizes[size]);
-                        call_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                        call_new(dst1, dst1_stride,
                                  src1, sizes[size] * SIZEOF_PIXEL,
                                  sizes[size], i, j, sizes[size]);
-                        checkasm_check_pixel(dst0, sizes[size] * SIZEOF_PIXEL,
-                                             dst1, sizes[size] * SIZEOF_PIXEL,
-                                             size[sizes], size[sizes], "dst");
-                        bench_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                        checkasm_check_pixel_padded(dst0, dst0_stride,
+                                                    dst1, dst1_stride,
+                                                    size[sizes], size[sizes], "dst");
+                        bench_new(dst1, dst1_stride,
                                   src1, sizes[size] * SIZEOF_PIXEL,
                                   sizes[size], i, j, sizes[size]);
                     }
@@ -431,8 +448,8 @@ static void checkasm_check_hevc_epel_uni_w(void)
 {
     LOCAL_ALIGNED_32(uint8_t, buf0, [BUF_SIZE]);
     LOCAL_ALIGNED_32(uint8_t, buf1, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst0, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst1, [BUF_SIZE]);
+    PIXEL_RECT(dst0, 64, 64);
+    PIXEL_RECT(dst1, 64, 64);
 
     HEVCDSPContext h;
     int size, bit_depth, i, j;
@@ -460,16 +477,18 @@ static void checkasm_check_hevc_epel_uni_w(void)
                             for (wx = weights; *wx >= 0; wx++) {
                                 for (ox = offsets; *ox >= 0; ox++) {
                                     randomize_buffers();
-                                    call_ref(dst0, sizes[size] * SIZEOF_PIXEL,
+                                    CLEAR_PIXEL_RECT(dst0);
+                                    CLEAR_PIXEL_RECT(dst1);
+                                    call_ref(dst0, dst0_stride,
                                              src0, sizes[size] * SIZEOF_PIXEL,
                                              sizes[size], *denom, *wx, *ox, i, j, sizes[size]);
-                                    call_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                                    call_new(dst1, dst1_stride,
                                              src1, sizes[size] * SIZEOF_PIXEL,
                                              sizes[size], *denom, *wx, *ox, i, j, sizes[size]);
-                                    checkasm_check_pixel(dst0, sizes[size] * SIZEOF_PIXEL,
-                                                         dst1, sizes[size] * SIZEOF_PIXEL,
-                                                         size[sizes], size[sizes], "dst");
-                                    bench_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                                    checkasm_check_pixel_padded(dst0, dst0_stride,
+                                                                dst1, dst1_stride,
+                                                                size[sizes], size[sizes], "dst");
+                                    bench_new(dst1, dst1_stride,
                                               src1, sizes[size] * SIZEOF_PIXEL,
                                               sizes[size], *denom, *wx, *ox, i, j, sizes[size]);
                                 }
@@ -487,8 +506,8 @@ static void checkasm_check_hevc_epel_bi(void)
 {
     LOCAL_ALIGNED_32(uint8_t, buf0, [BUF_SIZE]);
     LOCAL_ALIGNED_32(uint8_t, buf1, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst0, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst1, [BUF_SIZE]);
+    PIXEL_RECT(dst0, 64, 64);
+    PIXEL_RECT(dst1, 64, 64);
     LOCAL_ALIGNED_32(int16_t, ref0, [BUF_SIZE]);
     LOCAL_ALIGNED_32(int16_t, ref1, [BUF_SIZE]);
 
@@ -515,16 +534,18 @@ static void checkasm_check_hevc_epel_bi(void)
                     if (check_func(h.put_hevc_epel_bi[size][j][i],
                                    "put_hevc_%s%d_%d", type, sizes[size], bit_depth)) {
                         randomize_buffers_ref();
-                        call_ref(dst0, sizes[size] * SIZEOF_PIXEL,
+                        CLEAR_PIXEL_RECT(dst0);
+                        CLEAR_PIXEL_RECT(dst1);
+                        call_ref(dst0, dst0_stride,
                                  src0, sizes[size] * SIZEOF_PIXEL,
                                  ref0, sizes[size], i, j, sizes[size]);
-                        call_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                        call_new(dst1, dst1_stride,
                                  src1, sizes[size] * SIZEOF_PIXEL,
                                  ref1, sizes[size], i, j, sizes[size]);
-                        checkasm_check_pixel(dst0, sizes[size] * SIZEOF_PIXEL,
-                                             dst1, sizes[size] * SIZEOF_PIXEL,
-                                             size[sizes], size[sizes], "dst");
-                        bench_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                        checkasm_check_pixel_padded(dst0, dst0_stride,
+                                                    dst1, dst1_stride,
+                                                    size[sizes], size[sizes], "dst");
+                        bench_new(dst1, dst1_stride,
                                   src1, sizes[size] * SIZEOF_PIXEL,
                                   ref1, sizes[size], i, j, sizes[size]);
                     }
@@ -539,8 +560,8 @@ static void checkasm_check_hevc_epel_bi_w(void)
 {
     LOCAL_ALIGNED_32(uint8_t, buf0, [BUF_SIZE]);
     LOCAL_ALIGNED_32(uint8_t, buf1, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst0, [BUF_SIZE]);
-    LOCAL_ALIGNED_32(uint8_t, dst1, [BUF_SIZE]);
+    PIXEL_RECT(dst0, 64, 64);
+    PIXEL_RECT(dst1, 64, 64);
     LOCAL_ALIGNED_32(int16_t, ref0, [BUF_SIZE]);
     LOCAL_ALIGNED_32(int16_t, ref1, [BUF_SIZE]);
 
@@ -572,16 +593,18 @@ static void checkasm_check_hevc_epel_bi_w(void)
                             for (wx = weights; *wx >= 0; wx++) {
                                 for (ox = offsets; *ox >= 0; ox++) {
                                     randomize_buffers_ref();
-                                    call_ref(dst0, sizes[size] * SIZEOF_PIXEL,
+                                    CLEAR_PIXEL_RECT(dst0);
+                                    CLEAR_PIXEL_RECT(dst1);
+                                    call_ref(dst0, dst0_stride,
                                              src0, sizes[size] * SIZEOF_PIXEL,
                                              ref0, sizes[size], *denom, *wx, *wx, *ox, *ox, i, j, sizes[size]);
-                                    call_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                                    call_new(dst1, dst1_stride,
                                              src1, sizes[size] * SIZEOF_PIXEL,
                                              ref1, sizes[size], *denom, *wx, *wx, *ox, *ox, i, j, sizes[size]);
-                                    checkasm_check_pixel(dst0, sizes[size] * SIZEOF_PIXEL,
-                                                         dst1, sizes[size] * SIZEOF_PIXEL,
-                                                         size[sizes], size[sizes], "dst");
-                                    bench_new(dst1, sizes[size] * SIZEOF_PIXEL,
+                                    checkasm_check_pixel_padded(dst0, dst0_stride,
+                                                                dst1, dst1_stride,
+                                                                size[sizes], size[sizes], "dst");
+                                    bench_new(dst1, dst1_stride,
                                               src1, sizes[size] * SIZEOF_PIXEL,
                                               ref1, sizes[size], *denom, *wx, *wx, *ox, *ox, i, j, sizes[size]);
                                 }
-- 
2.39.5 (Apple Git-154)

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".