From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTP id 3B48947811 for ; Fri, 22 Dec 2023 23:08:45 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 89C8468D142; Sat, 23 Dec 2023 01:08:42 +0200 (EET) Received: from relay9-d.mail.gandi.net (relay9-d.mail.gandi.net [217.70.183.199]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 399A168D098 for ; Sat, 23 Dec 2023 01:08:36 +0200 (EET) Received: by mail.gandi.net (Postfix) with ESMTPSA id 88357FF80E for ; Fri, 22 Dec 2023 23:08:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=niedermayer.cc; s=gm1; t=1703286515; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=gOBwVcGxVAwgADs0gB7wiyJvYWPsaINC1mX7CYmQejU=; b=pU+Bzm/9ZS90UEeRMXdvFKSy1B2bGEnMqpD7gZP4w3Xx10V9SDWQ7nWUP9PO3zjtBTh87p K/Cvf6xWgcRS+egdAkaVYOQbDMQsg8S/yb6KUSMkC9pqPc8EW5QOCEVR8NDjD2d4xU+lDm ar2A0zEJ83R9w5s6uKmJbbl6dpAC3nah6Z95CWTcFcate1ewRuOjj+cUd3ZcuD5qeiSVDy F8Sa/rfFzEHJ8rxm1an3yLBBfJWmTzaVmOm3GjDK3mk8hR+ALs4jn5sR9Rds29HIkcW2JI 8WWjvK8M6ZUN0fzgWs+ZNwvCYsvTuHO3Dmdt1M35Kguv9HOD3sj87/X7gCm+BA== Date: Sat, 23 Dec 2023 00:08:34 +0100 From: Michael Niedermayer To: FFmpeg development discussions and patches Message-ID: <20231222230834.GH6420@pb2> References: <20231222011549.16057-1-jamrial@gmail.com> <20231222011549.16057-2-jamrial@gmail.com> MIME-Version: 1.0 In-Reply-To: <20231222011549.16057-2-jamrial@gmail.com> X-GND-Sasl: michael@niedermayer.cc Subject: Re: [FFmpeg-devel] [PATCH 2/2] x86/takdsp: add avx2 versions of all functions X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: multipart/mixed; boundary="===============4819886938155013954==" Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: --===============4819886938155013954== Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="x7gJcYyRf5ZnuMVj" Content-Disposition: inline --x7gJcYyRf5ZnuMVj Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Thu, Dec 21, 2023 at 10:15:49PM -0300, James Almer wrote: > On an Intel Core i7 12700k: >=20 > decorrelate_ls_c: 814.3 > decorrelate_ls_sse2: 165.8 > decorrelate_ls_avx2: 101.3 > decorrelate_sf_c: 1602.6 > decorrelate_sf_sse4: 640.1 > decorrelate_sf_avx2: 324.6 > decorrelate_sm_c: 1564.8 > decorrelate_sm_sse2: 379.3 > decorrelate_sm_avx2: 203.3 > decorrelate_sr_c: 785.3 > decorrelate_sr_sse2: 176.3 > decorrelate_sr_avx2: 99.8 >=20 > Signed-off-by: James Almer on AMD Ryzen 9 3950X 16-Core Processor Illegal instruction (core dumped) threads=3D1 tests/Makefile:308: recipe for target 'fate-lossless-tak' failed make: *** [fate-lossless-tak] Error 132 (gdb) disassemble $rip-32, $rip+32 Dump of assembler code from 0x55555651a580 to 0x55555651a5c0: 0x000055555651a580: or $0x17,%al 0x000055555651a582: movdqa %xmm1,(%rdi,%rdx,1) 0x000055555651a587: add $0x10,%rdx 0x000055555651a58b: jl 0x55555651a562 0x000055555651a58d: retq 0x000055555651a58e: nop 0x000055555651a58f: nop 0x000055555651a590: shl $0x2,%edx 0x000055555651a593: add %rdx,%rdi 0x000055555651a596: add %rdx,%rsi 0x000055555651a599: neg %rdx 0x000055555651a59c: vmovd %ecx,%xmm2 =3D> 0x000055555651a5a0: vpbroadcastd %r8d,%ymm3 0x000055555651a5a6: vbroadcasti128 0x4bc751(%rip),%ymm4 # 0x55555= 69d6d00 0x000055555651a5af: vmovdqa (%rsi,%rdx,1),%ymm1 0x000055555651a5b4: vpsrad %xmm2,%ymm1,%ymm1 0x000055555651a5b8: vpmulld %ymm3,%ymm1,%ymm1 0x000055555651a5bd: vpaddd %ymm4,%ymm1,%ymm1 End of assembler dump. [...] --=20 Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB Why not whip the teacher when the pupil misbehaves? -- Diogenes of Sinope --x7gJcYyRf5ZnuMVj Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iF0EABEIAB0WIQSf8hKLFH72cwut8TNhHseHBAsPqwUCZYYW7wAKCRBhHseHBAsP qyXeAJ4o1XBQHGspL2XLC6GsRih08A6BxACeNZiPG9k+8KKt1+axykdCrlCJy1w= =kQqz -----END PGP SIGNATURE----- --x7gJcYyRf5ZnuMVj-- --===============4819886938155013954== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe". --===============4819886938155013954==--