From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.ffmpeg.org (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTPS id 7868A4CF42 for ; Fri, 30 May 2025 02:23:24 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.ffmpeg.org (Postfix) with ESMTP id D346968CFAC; Fri, 30 May 2025 05:23:19 +0300 (EEST) Received: from relay1-d.mail.gandi.net (relay1-d.mail.gandi.net [217.70.183.193]) by ffbox0-bg.ffmpeg.org (Postfix) with ESMTPS id 17A00687D1D for ; Fri, 30 May 2025 05:23:14 +0300 (EEST) Received: by mail.gandi.net (Postfix) with ESMTPSA id 385AA43A26 for ; Fri, 30 May 2025 02:23:13 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=niedermayer.cc; s=gm1; t=1748571793; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=RDz6Y5BTCoc0kZaaUNuMToDQ1NbgiHU31jsJT9AhdY4=; b=Aew/lV3hunLPnZDv2wbodlAFe8/EGdw1PikHOhaxMKfWtiwlqco8QVRwtpF9yf2WXw/GJx aSxSO3Qcv6DvRVxlITaI1U3fkSalYdgerKiVZ5fo41RvCv9XuiJeC1Vmj3H/OXz/rptIDG 2hFqYq63bRv7H3LEzJCQ5pH77hDD1OBjhOqORmMrNQXXRyCsoQRkHCT0eXW8NeWix9MlyM zfXyYa89uiAIOBANMtChJGU8fZihEJptw3OzgJr5Cubm2PIf0+LODzTmoeTYReorEr7RnQ eMzi7zSm87quTP41E+gE65at6qGR4Z2slGjrY3rx3PAZgYo9aSBP0++y5z9nSw== Date: Fri, 30 May 2025 04:23:12 +0200 From: Michael Niedermayer To: FFmpeg development discussions and patches Message-ID: <20250530022312.GH29660@pb2> References: <20250527081242.22892-1-ffmpeg@haasn.xyz> <20250527081242.22892-15-ffmpeg@haasn.xyz> MIME-Version: 1.0 In-Reply-To: <20250527081242.22892-15-ffmpeg@haasn.xyz> X-GND-State: clean X-GND-Score: -85 X-GND-Cause: gggruggvucftvghtrhhoucdtuddrgeeffedrtddtgddvjeektdculddtuddrgeefvddrtddtmdcutefuodetggdotefrodftvfcurfhrohhfihhlvgemucfitefpfffkpdcuggftfghnshhusghstghrihgsvgenuceurghilhhouhhtmecufedtudenucesvcftvggtihhpihgvnhhtshculddquddttddmnegfrhhlucfvnfffucdludehmdenucfjughrpeffhffvuffkfhggtggujgesghdtreertddtvdenucfhrhhomhepofhitghhrggvlhcupfhivgguvghrmhgrhigvrhcuoehmihgthhgrvghlsehnihgvuggvrhhmrgihvghrrdgttgeqnecuggftrfgrthhtvghrnhepieegkedtjeduffejhfetgeejtdegteetgfegtdfhjefgvefhteegkeejtddvhfevnecukfhppeeguddrieeirdeijedruddufeenucevlhhushhtvghrufhiiigvpedtnecurfgrrhgrmhepihhnvghtpeeguddrieeirdeijedruddufedphhgvlhhopehlohgtrghlhhhoshhtpdhmrghilhhfrhhomhepmhhitghhrggvlhesnhhivgguvghrmhgrhigvrhdrtggtpdhnsggprhgtphhtthhopedupdhrtghpthhtohepfhhfmhhpvghgqdguvghvvghlsehffhhmphgvghdrohhrgh X-GND-Sasl: michael@niedermayer.cc Subject: Re: [FFmpeg-devel] [PATCH v3 14/17] swscale/x86: add SIMD backend X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: multipart/mixed; boundary="===============2061655405224012774==" Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: --===============2061655405224012774== Content-Type: multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="EjYgeAmkUSEMfrQ8" Content-Disposition: inline --EjYgeAmkUSEMfrQ8 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Tue, May 27, 2025 at 09:55:33AM +0200, Niklas Haas wrote: > From: Niklas Haas >=20 > This covers most 8-bit and 16-bit ops, and some 32-bit ops. It also cover= s all > floating point operations. While this is not yet 100% coverage, it's good > enough for the vast majority of formats out there. >=20 > Of special note is the packed shuffle fast path, which uses pshufb at vec= tor > sizes up to AVX512. > --- > libswscale/ops.c | 4 + > libswscale/x86/Makefile | 3 + > libswscale/x86/ops.c | 722 +++++++++++++++++++++++ > libswscale/x86/ops_common.asm | 305 ++++++++++ > libswscale/x86/ops_float.asm | 389 ++++++++++++ > libswscale/x86/ops_int.asm | 1049 +++++++++++++++++++++++++++++++++ > 6 files changed, 2472 insertions(+) > create mode 100644 libswscale/x86/ops.c > create mode 100644 libswscale/x86/ops_common.asm > create mode 100644 libswscale/x86/ops_float.asm > create mode 100644 libswscale/x86/ops_int.asm seems to break on x86-32 linux =2E.. src/libswscale/x86/ops_float.asm:389: error: symbol `m9' undefined src/libswscale/x86/ops_float.asm:378: ... from macro `linear_fns' defined h= ere src/libswscale/x86/ops_float.asm:339: ... from macro `linear_mask' defined = here src/libswscale/x86/ops_float.asm:330: ... from macro `linear_inner' defined= here src/libswscale/x86/ops_common.asm:296: ... from macro `IF' defined here src//libavutil/x86/x86inc.asm:1639: ... from macro `movdqa' defined here src//libavutil/x86/x86inc.asm:1501: ... from macro `RUN_AVX_INSTR' defined = here src//libavutil/x86/x86inc.asm:1996: ... from macro `vmovdqa' defined here src/libswscale/x86/ops_float.asm:389: error: symbol `m10' undefined src/libswscale/x86/ops_float.asm:378: ... from macro `linear_fns' defined h= ere src/libswscale/x86/ops_float.asm:339: ... from macro `linear_mask' defined = here src/libswscale/x86/ops_float.asm:331: ... from macro `linear_inner' defined= here src/libswscale/x86/ops_common.asm:296: ... from macro `IF' defined here src//libavutil/x86/x86inc.asm:1639: ... from macro `movdqa' defined here src//libavutil/x86/x86inc.asm:1501: ... from macro `RUN_AVX_INSTR' defined = here src//libavutil/x86/x86inc.asm:1996: ... from macro `vmovdqa' defined here src/libswscale/x86/ops_float.asm:389: error: symbol `m11' undefined src/libswscale/x86/ops_float.asm:378: ... from macro `linear_fns' defined h= ere src/libswscale/x86/ops_float.asm:339: ... from macro `linear_mask' defined = here src/libswscale/x86/ops_float.asm:332: ... from macro `linear_inner' defined= here src/libswscale/x86/ops_common.asm:296: ... from macro `IF' defined here src//libavutil/x86/x86inc.asm:1639: ... from macro `movdqa' defined here src//libavutil/x86/x86inc.asm:1501: ... from macro `RUN_AVX_INSTR' defined = here src//libavutil/x86/x86inc.asm:1996: ... from macro `vmovdqa' defined here make: *** [src/ffbuild/common.mak:103: libswscale/x86/ops_float.o] Error 1 [...] --=20 Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB Old school: Use the lowest level language in which you can solve the problem conveniently. New school: Use the highest level language in which the latest supercomputer can solve the problem without the user falling asleep waiting. --EjYgeAmkUSEMfrQ8 Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iF0EABEKAB0WIQSf8hKLFH72cwut8TNhHseHBAsPqwUCaDkWigAKCRBhHseHBAsP q/G9AJ4wuY5SJLIg5Q1k7of67g+CI20CmACfT+TQCean2wuyfBBSIWOAF//TgcI= =tVBL -----END PGP SIGNATURE----- --EjYgeAmkUSEMfrQ8-- --===============2061655405224012774== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe". --===============2061655405224012774==--