From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTP id 7B93C4B3B5 for ; Tue, 4 Jun 2024 19:37:41 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id EC7D768D6F8; Tue, 4 Jun 2024 22:37:38 +0300 (EEST) Received: from mail-pj1-f47.google.com (mail-pj1-f47.google.com [209.85.216.47]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 82BA468D3A1 for ; Tue, 4 Jun 2024 22:37:32 +0300 (EEST) Received: by mail-pj1-f47.google.com with SMTP id 98e67ed59e1d1-2c2070e1579so2402379a91.3 for ; Tue, 04 Jun 2024 12:37:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1717529849; x=1718134649; darn=ffmpeg.org; h=content-transfer-encoding:in-reply-to:from:content-language :references:to:subject:user-agent:mime-version:date:message-id:from :to:cc:subject:date:message-id:reply-to; bh=/2PqpnQCPy++W1hItGSoCcso/kEDnL04iAlqe0utbR8=; b=QIPXHXlH3Z8l2PfmW2oB4SKHYz20jiyCQGw7yrAHv86TUPzB4HZqaMMpYNyOaw8GEv QI0fdEWinavWwfPvYvj6eAlLsKaVVshLO3I86Ks+YUSlLaeWkG2Mh4uSuNA6Uc3IXxVx pHBbRIrluXlSmsTrQQ3BESIXJ6CRryPEUgDW3mJbAVEuAf5NGNzp1OjX7czJewFYxyyL T1P1YppKeU6DxZgCwCuyjBx05BDMHqUJe1qPRadPetgAZCQ82aPKOY7V2JbIEUvQSy37 h3pmtJ9kKDM0tYSwtbghtrSsyaFIAgkKhb4/UH/Yeh7ayc2GEPzPEHISYUZGEtXSldNv jVDg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1717529849; x=1718134649; h=content-transfer-encoding:in-reply-to:from:content-language :references:to:subject:user-agent:mime-version:date:message-id :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=/2PqpnQCPy++W1hItGSoCcso/kEDnL04iAlqe0utbR8=; b=fzUzxLtOC5eLi/rygIyQh1XWToTuPNvZ+Dbv1TDvoTl7cy+Fsy0t1JSEq+xDWMMCZu F5gD5A2r21KXq4iHSm2S5eEejHTDfNBa+n/1fqg7sCPTeBmlFYLRVi3HW7sSAb2B+Ynp arcevXBqTmlpLzxW+jqKILA8a1Fu3nDc7f4tpzHkT+F/4rJRfHHFcdhF25Uz8xUcflkT xcYDUa/mzIPcdKsptYk+TNXyxGxdsCSdxF2dRxGdIBCteyJpghxXgBoynj1ICe9veY+v VRbFHFwHJ/D2xoIzrqZiPJTUHtaxixK2Z6V9jqCQLOsSSvIZtWRGuGXwqtdk3fyyV43W bU3A== X-Gm-Message-State: AOJu0YzvaM8iG7huNnd4Dofc4OWo6FgwvOJquO6Hctq34i0gormoOyiT 3I2yeSi9hrhFHgPAkpVRPmMxKgckZNQRfRpTkjSN5bhlkxxcFDJb8pn4Fg== X-Google-Smtp-Source: AGHT+IERhsu80hkX/mAKLVWloRGmeBe0Q0Lwyy2ZApY8uwnwTX7R1DiKwrm03fZoSHpZRER+vV3hJg== X-Received: by 2002:a17:90a:c688:b0:2c2:53f9:56fd with SMTP id 98e67ed59e1d1-2c27db16364mr403717a91.16.1717529849142; Tue, 04 Jun 2024 12:37:29 -0700 (PDT) Received: from [192.168.0.16] ([190.194.167.233]) by smtp.gmail.com with ESMTPSA id 98e67ed59e1d1-2c254a78f24sm1216895a91.0.2024.06.04.12.37.28 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Tue, 04 Jun 2024 12:37:28 -0700 (PDT) Message-ID: <876497f3-da0d-49b2-b307-717a2d21d5a2@gmail.com> Date: Tue, 4 Jun 2024 16:37:36 -0300 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird To: ffmpeg-devel@ffmpeg.org References: <20240604190742.1742-1-jamrial@gmail.com> <7245c66d-e607-4093-9868-35132ba6a2f1@gmail.com> Content-Language: en-US From: James Almer In-Reply-To: Subject: Re: [FFmpeg-devel] [PATCH] swscale/x86/input: add AVX2 optimized RGB24 to YUV functions X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset="us-ascii"; Format="flowed" Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: On 6/4/2024 4:32 PM, Andreas Rheinhardt wrote: > James Almer: >> On 6/4/2024 4:25 PM, Andreas Rheinhardt wrote: >>> James Almer: >>>> rgb24_to_uv_8_c: 39.3 >>>> rgb24_to_uv_8_sse2: 14.3 >>>> rgb24_to_uv_8_ssse3: 13.3 >>>> rgb24_to_uv_8_avx: 12.8 >>>> rgb24_to_uv_8_avx2: 14.3 >>> >>> Worse than avx and ssse3 >> >> Can't be disabled for small sizes. >> > > Why can't this be disabled? The function pointer is for all sizes. And adding a check + jmp in the assembly is not going to be better. Also, this being for 8 pixel wide buffers, it's not going to be anyone's bottleneck. > >>> >>>> rgb24_to_uv_128_c: 582.8 >>>> rgb24_to_uv_128_sse2: 127.3 >>>> rgb24_to_uv_128_ssse3: 107.3 >>>> rgb24_to_uv_128_avx: 111.3 >>> >>> Worse than ssse3 >> >> I know, but it's not related to this patch. >> Same for all the comments below. > > _______________________________________________ > ffmpeg-devel mailing list > ffmpeg-devel@ffmpeg.org > https://ffmpeg.org/mailman/listinfo/ffmpeg-devel > > To unsubscribe, visit link above, or email > ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe". _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".