From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTPS id 281644CC49 for ; Sat, 25 Jan 2025 15:51:12 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 6636D68B818; Sat, 25 Jan 2025 17:51:09 +0200 (EET) Received: from mail-ej1-f44.google.com (mail-ej1-f44.google.com [209.85.218.44]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 24F2168AE4B for ; Sat, 25 Jan 2025 17:51:03 +0200 (EET) Received: by mail-ej1-f44.google.com with SMTP id a640c23a62f3a-ab643063598so442586366b.2 for ; Sat, 25 Jan 2025 07:51:03 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1737820262; x=1738425062; darn=ffmpeg.org; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :from:to:cc:subject:date:message-id:reply-to; bh=nIDmQnbnWH44dPjIBjfUE2qEwyuJxDOg14XXO7nHEhc=; b=OT+oscjy1p57HbbSNtFZkX5TP5s7qqZCRabAu0UQM0x+AZrHGq7SnUu32GaNtRqW4c lo/rcXMIoFPfMSZ3fyOEyHgppmBahTEOtbOHuuZsexEKOlFDtaeqyr/uxwb5kFbqd29g 4E8tgJBKfj9GKKafKbsYuyK2SXJUCGyokeEFiOXZ/MfK2XMndKMRKnvPWK2DYYOOS9ZU SsEkI13BsvpvfdEodKpcZE7c4EwpjEDC5ugv/pprxtQG3fNxQ0UmCDjlgJxQ3b0b/0GZ AKebT3GfnFSBRLwUc9ZOgohZb/XBcKOYHXosoe1CNwYgpr1l14YNKXTPf+8pmLp8LSe5 xD7Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1737820262; x=1738425062; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=nIDmQnbnWH44dPjIBjfUE2qEwyuJxDOg14XXO7nHEhc=; b=GX/Mv1lPYYN0Wu50++BaaBvjExc691O7ldPp6vjuZDc0YOMLV1ZIZkWOc5W1pBZQak 3G8tjiggvmMJ03x5I5kbgbrdpgpysbDPLqQj6lRruW+F+xnz5TACLiN7y20XEnY1gp7d YrHSM0nRyXmgYjqvIKJtUSaz9T20n6sjjpuMoAsGxaAjB6KbN6EzVLOLnHeQX6Z/4uJm nEwyNiBaxileFZnmuB/dpE3yHnVQ3r/DqQf+lvBmY4bvzszV7j0Am0MOdOJ5iR9lXhRT MYQDuWdLM+TVVYes2N2bGGoDvqsqvIC6n+9+1+9WQtmHMe+GJLcrN2+9AHPLDMeD5k5v CTWg== X-Gm-Message-State: AOJu0YwtbmJ4AzS24/TrHoSvVnswm+nch92sNU7oLzaxFJsLyYLVeAyJ H2v+HlKw31/hKaaJlgDCFk6ao3av1ZO5eQnY/AH9d3uQwWLYIGctfbD3XGg4WaLnECy1hHOcHX6 k0cysq4EqXfMdRVqBaXiPrZ7wf8dqTcoA X-Gm-Gg: ASbGnctZuUYAZFthR0Fen+yaJaedi6EmBc908Y7rAg+47fhTVaQV7l7gKNV98f6Q47D Z1qIZ4wdCqzErXqZKJ2p4Ja3ZsT9dUSd+ytk2W21BnCFkrz2oPJd0og6tPydrJA== X-Google-Smtp-Source: AGHT+IEWOFGNpmCrmyhHlYYCb+xp+7FadhQSDSyqtSN9qby6qtbDQBnk4twFKF22Dnx7oy/ZRKfJYVHsYBUQVNI8JfY= X-Received: by 2002:a05:6402:2105:b0:5db:f52c:8074 with SMTP id 4fb4d7f45d1cf-5dbf52c8de5mr35427584a12.28.1737820261499; Sat, 25 Jan 2025 07:51:01 -0800 (PST) MIME-Version: 1.0 References: <20250125142546.1244665-1-16567adigashreesh@gmail.com> <563cde8a-8fce-45fc-b126-7498e7d3862f@gmail.com> In-Reply-To: From: Shreesh Adiga <16567adigashreesh@gmail.com> Date: Sat, 25 Jan 2025 21:20:50 +0530 X-Gm-Features: AWEUYZnikewR9ZnbmjMKfqNd9fWQf8jKEjj_K4Izo30JG5kjuqMJmcBUlKeq-XM Message-ID: To: FFmpeg development discussions and patches X-Content-Filtered-By: Mailman/MimeDel 2.1.29 Subject: Re: [FFmpeg-devel] [PATCH] swscale/x86/rgb2rgb: add AVX512ICL versions of shuffle_bytes X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: > Try running it several times using the same seed, so > "tests/checkasm/checkasm --test=sw_rgb --bench 17575157", and make sure > no power saving feature is enabled (so the CPU frequency doesn't change > based on load). That may help getting consistent results. After running "echo performance | tee /sys/devices/system/cpu/cpu*/cpufreq/scaling_governor" and recompiling ffmpeg with "--enable-linux-perf", I am seeing the below numbers: shuffle_bytes_0321_c: 56.5 ( 1.00x) shuffle_bytes_0321_ssse3: 18.0 ( 3.14x) shuffle_bytes_0321_avx2: 10.0 ( 5.65x) shuffle_bytes_0321_avx512icl: 9.0 ( 6.28x) shuffle_bytes_1230_c: 84.5 ( 1.00x) shuffle_bytes_1230_ssse3: 18.2 ( 4.63x) shuffle_bytes_1230_avx2: 22.2 ( 3.80x) shuffle_bytes_1230_avx512icl: 10.0 ( 8.45x) shuffle_bytes_2103_c: 49.8 ( 1.00x) shuffle_bytes_2103_ssse3: 21.2 ( 2.34x) shuffle_bytes_2103_avx2: 17.5 ( 2.84x) shuffle_bytes_2103_avx512icl: 7.5 ( 6.63x) shuffle_bytes_3012_c: 84.5 ( 1.00x) shuffle_bytes_3012_ssse3: 17.0 ( 4.97x) shuffle_bytes_3012_avx2: 16.0 ( 5.28x) shuffle_bytes_3012_avx512icl: 16.2 ( 5.20x) shuffle_bytes_3210_c: 92.8 ( 1.00x) shuffle_bytes_3210_ssse3: 25.8 ( 3.60x) shuffle_bytes_3210_avx2: 14.0 ( 6.62x) shuffle_bytes_3210_avx512icl: 9.0 (10.31x) Thanks, Shreesh _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".