From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTPS id 59A244105D for ; Mon, 17 Feb 2025 08:51:20 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 44B5268BF61; Mon, 17 Feb 2025 10:51:17 +0200 (EET) Received: from mail-lf1-f48.google.com (mail-lf1-f48.google.com [209.85.167.48]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 5F6DD68B3CB for ; Mon, 17 Feb 2025 10:51:10 +0200 (EET) Received: by mail-lf1-f48.google.com with SMTP id 2adb3069b0e04-545316f80beso1469939e87.1 for ; Mon, 17 Feb 2025 00:51:10 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=martin-st.20230601.gappssmtp.com; s=20230601; t=1739782269; x=1740387069; darn=ffmpeg.org; h=mime-version:references:message-id:in-reply-to:subject:cc:to:from :date:from:to:cc:subject:date:message-id:reply-to; bh=eTMbMev2z4ja6h3ORnuBW0g2wnNsqgZuL1kmbjpKZhg=; b=2dvHd+GIjoO819fbYm6qUKHrkcAQn6iP/tCCfpfuugSOdhn879kG5fxJv4g4sgSuAM Plj9tr20AXSRENW0E1F1G+n3b0654y8X4lXNh8rKNnPusD7+u7z944Wl9isb5h+8B7N4 GsEuTHgQ4t2vdh3zj5xcmkYqPON2YAWbs5GlCFhWwHKFYAOGkgN3SWC+bZwQZ74qdMfI GquOu1nzZ5+6tPKsR9ykEbzhcsRDgBCJHcZsvnOY+8pPWvMHd5KIG5O8V2v3wTWAFC48 xMYA2UevQdO72vq0WA6XdgRAOMOYaLwk5zVw5UDUybvvgrO+5UimFqXbmqSobIJNQVt2 7V+Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1739782269; x=1740387069; h=mime-version:references:message-id:in-reply-to:subject:cc:to:from :date:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=eTMbMev2z4ja6h3ORnuBW0g2wnNsqgZuL1kmbjpKZhg=; b=nqeVWv72ck+nGvWGbY7JZikucAOBL3aueQIDkNcLdxugWBL2W11O+169XGuzMJT2+W 6qn3fHq1wFgGfofEWP7KRYM9h0u3FkmkuILomNC9hXUGTKNRddCw8XvlRgjjxYKSEAke gb2ARMIjyk2FfkFe/vwVzWCBOTVFBeq0khLEdEHeHfxFlrr73+1EkMChtwFKipRHG4Nv EzHX7UDGc9PQGD/FYyWyWJBORlQ0PYqeA9ychiw1NC8FV3wah9JEOe5HYGihWRsdyv5q 1KzdVhRPpt/8Lxxqn0MbhqGzpUlNGQ/ocNcG/LJ5vUUUacxvq5KL7Z3IU2dVDVwMD9mF 7NHg== X-Gm-Message-State: AOJu0YxmgN3t+L5VGSzfT9Xe/CP/zk8hgpB4t+7UjkUEdmDLAQypSy88 t/dQMPHYgjMI3wW8Ni+MZxPkP97SCTt4q/QlUIQd7Z/+mfmziiSd3soZpS0ePuDAnEJ7oJ6d3lv n/A== X-Gm-Gg: ASbGncsp0nGMvWydWz0pQKIF3guRgeeaNSJIOrOw2jLR3a/NwR6fvr94R4i8ipUS28W pvrf1Z4xymf2X5+Glw0KKJU88w96oE7tDjSkSUWMfIknBQUdh1BtWkc5JhvGdWG/nJaK4GDE7kT c2ulodUHloI+5gs1Uhe9Val4lEC+W2neuqq4IfK3n4Z/sCN0ZsC70AiJe4WidoOSN6FbxlawuC4 WNYPmKl5YJ5FSooI3/jIGK9Pp8x8Dzw6re9z4VjZhTHEQYCiQII4ho2QzOaA5jYoc3Y2rdZeA1X PgT0ET47xHo9qlbDobZGZSLBgLUJEr6+5WJyO5roHJws65DtQ7C2DEvUjGckIycIEywuojSnSNd ZM7O3oUEuU8s= X-Google-Smtp-Source: AGHT+IHkyLGlz1hG+K2scoQEFhGelcKQKJEPaQHVsLN9uPHe9pIhBEhEpYOEP/8KURzkA2cUXmsOUA== X-Received: by 2002:a05:6512:3d15:b0:545:ae6:d73f with SMTP id 2adb3069b0e04-5452fe71c09mr2802241e87.46.1739782269089; Mon, 17 Feb 2025 00:51:09 -0800 (PST) Received: from tunnel335574-pt.tunnel.tserv24.sto1.ipv6.he.net (tunnel335574-pt.tunnel.tserv24.sto1.ipv6.he.net. [2001:470:27:11::2]) by smtp.gmail.com with ESMTPSA id 2adb3069b0e04-5461e202368sm495918e87.61.2025.02.17.00.51.08 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 17 Feb 2025 00:51:08 -0800 (PST) Date: Mon, 17 Feb 2025 10:51:06 +0200 (EET) From: =?ISO-8859-15?Q?Martin_Storsj=F6?= To: Krzysztof Pyrkosz via ffmpeg-devel In-Reply-To: <20250211214310.116203-2-ffmpeg@szaka.eu> Message-ID: <58a22ef0-b630-2357-13fa-e683e06da6c1@martin.st> References: <20250211214310.116203-2-ffmpeg@szaka.eu> MIME-Version: 1.0 Subject: Re: [FFmpeg-devel] [PATCH] swscale/aarch64/rgb24toyv12: skip early right shift by 2 X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: Krzysztof Pyrkosz Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset="us-ascii"; Format="flowed" Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: On Tue, 11 Feb 2025, Krzysztof Pyrkosz via ffmpeg-devel wrote: > It's a minor improvement that shaves off 5-8% from the execution time. > Instead of shifting by 2 right away and by 7 soon after, shift by 9 one > time. > > Times before and after: > > A78: > rgb24toyv12_16_200_neon: 5366.8 ( 3.62x) > rgb24toyv12_128_60_neon: 13574.0 ( 3.34x) > rgb24toyv12_512_16_neon: 14463.8 ( 3.33x) > rgb24toyv12_1920_4_neon: 13508.2 ( 3.34x) > rgb24toyv12_1920_4_negstride_neon: 13525.0 ( 3.34x) > > rgb24toyv12_16_200_neon: 5293.8 ( 3.66x) > rgb24toyv12_128_60_neon: 12955.0 ( 3.50x) > rgb24toyv12_512_16_neon: 13784.0 ( 3.50x) > rgb24toyv12_1920_4_neon: 12900.8 ( 3.49x) > rgb24toyv12_1920_4_negstride_neon: 12902.8 ( 3.49x) > > > A72: > rgb24toyv12_16_200_neon: 9695.8 ( 2.50x) > rgb24toyv12_128_60_neon: 20286.6 ( 2.70x) > rgb24toyv12_512_16_neon: 22276.6 ( 2.57x) > rgb24toyv12_1920_4_neon: 19154.1 ( 2.77x) > rgb24toyv12_1920_4_negstride_neon: 19055.1 ( 2.78x) > > rgb24toyv12_16_200_neon: 9214.8 ( 2.65x) > rgb24toyv12_128_60_neon: 20731.5 ( 2.65x) > rgb24toyv12_512_16_neon: 21145.0 ( 2.70x) > rgb24toyv12_1920_4_neon: 17586.5 ( 2.99x) > rgb24toyv12_1920_4_negstride_neon: 17571.0 ( 2.98x) > > > A53: > rgb24toyv12_16_200_neon: 12880.4 ( 3.76x) > rgb24toyv12_128_60_neon: 27776.3 ( 3.94x) > rgb24toyv12_512_16_neon: 29411.3 ( 3.94x) > rgb24toyv12_1920_4_neon: 27253.1 ( 3.98x) > rgb24toyv12_1920_4_negstride_neon: 27474.3 ( 3.95x) > > rgb24toyv12_16_200_neon: 12196.3 ( 3.95x) > rgb24toyv12_128_60_neon: 26943.1 ( 4.07x) > rgb24toyv12_512_16_neon: 28642.3 ( 4.07x) > rgb24toyv12_1920_4_neon: 26676.6 ( 4.08x) > rgb24toyv12_1920_4_negstride_neon: 26713.8 ( 4.07x) > > --- > libswscale/aarch64/rgb2rgb_neon.S | 24 +++++++++++------------- > 1 file changed, 11 insertions(+), 13 deletions(-) Thanks, applied. // Martin _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".