From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTP id 48D8045A50 for ; Wed, 11 Oct 2023 03:35:08 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id BF65068C9AA; Wed, 11 Oct 2023 06:35:05 +0300 (EEST) Received: from w4.tutanota.de (w4.tutanota.de [81.3.6.165]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 827B368C8C9 for ; Wed, 11 Oct 2023 06:34:58 +0300 (EEST) Received: from tutadb.w10.tutanota.de (unknown [192.168.1.10]) by w4.tutanota.de (Postfix) with ESMTP id 0C175106015F for ; Wed, 11 Oct 2023 03:34:58 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; t=1696995297; s=s1; d=lynne.ee; h=From:From:To:To:Subject:Subject:Content-Description:Content-ID:Content-Type:Content-Type:Content-Transfer-Encoding:Content-Transfer-Encoding:Cc:Date:Date:In-Reply-To:In-Reply-To:MIME-Version:MIME-Version:Message-ID:Message-ID:Reply-To:References:References:Sender; bh=cYi1E2acu7n+G3TAygMQniZHaG933JEBFhUyfWmxoXk=; b=xm4u1vefeLbW2rAlQvmhKAcA6ACpaEyvc//Udr6npNo2XA/1d6RystckI1k+ueVz o6eAfRwWYhW3b+KV4WF5Sa6Zuw4GUZybmlg7mBebVjZITHGuEOtub8KCz/LPpmm+lw0 krBVs2Xem0ZcKnZmWsdF9iQcf68tQwjdxYaw0M4uQ+2Hpd4fl9vrvf0p9pSh2FZq48M I/6OyfJcA71y3ghCXYHQLqUJdaxjoOTzWDhuvCiMQnR62MEiFpAGGXcy3CogPCwOyFn jeuSzAEvj5kTrms+dQOI2XwqDCnxdM9WzxagcicoxA2NcsCA989pNVeF999W/zKB402 //0IxwpNnA== Date: Wed, 11 Oct 2023 05:34:57 +0200 (CEST) From: Lynne To: FFmpeg development discussions and patches Message-ID: In-Reply-To: References: MIME-Version: 1.0 Subject: Re: [FFmpeg-devel] [PATCH 3/3] nlmeans_vulkan: parallelize workgroup invocations X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: Oct 7, 2023, 17:08 by dev@lynne.ee: > Removes the clever subgroup parallel prefix computation, > and instead just computes the prefix inline. > Cuts down the number of dispatches by a huge amount. > > Provides a ~12x speedup (2.5fps to 30fps on a 7900XTX, > 2.1fps to 24fps on an Ada). > > Patch attached. > Going to push the patchset a bit later today. _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".