From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <ffmpeg-devel-bounces@ffmpeg.org>
Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100])
	by master.gitmailbox.com (Postfix) with ESMTPS id BDB654D407
	for <ffmpegdev@gitmailbox.com>; Fri, 18 Apr 2025 01:11:25 +0000 (UTC)
Received: from [127.0.1.1] (localhost [127.0.0.1])
	by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id B04B5687DD5;
	Fri, 18 Apr 2025 04:11:20 +0300 (EEST)
Received: from vidala.pars.ee (vidala.pars.ee [116.203.72.101])
 by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 7FDE5687CEA
 for <ffmpeg-devel@ffmpeg.org>; Fri, 18 Apr 2025 04:11:14 +0300 (EEST)
DKIM-Signature: v=1; a=rsa-sha256; s=202405r; d=lynne.ee; c=relaxed/relaxed;
 h=From:To:Subject:Date:Message-ID; t=1744938673; bh=lMoAxJ+lAWxDPXmJZqiMwrV
 rinBTEGbZ/eRTBem4R4M=; b=oj7nLbPd09U86h9VitQhyjr6orGsKYvewyDgzwqKzDR5tCFYyp
 J42djbKJOjqkj4ej4rU4FFvlMusZpVsJcjJnIDbfHBMyl5NDyC0K+v4Ral3q3Sxswgegnvopxga
 1u0bGtO6Dywl9vpsUmqoQQ/shgN1fqTsh0HaF+k3ITISFCplW1f+fPiESKIKKmvsG4LGZGlDNZ5
 GeFtWOO8c8e1iX4pjC3x4s9vFncnFK+/DcaoNyloX6TFVX8rCjdfR6+7S3AxiUvUSwGDm84igjA
 2lXRuo1Oo4TRQx7+RRumsJcvkSw7pfTCtz82TgvLojhZ0ZnzlDPpK6igWE4u3mSmiiw==;
DKIM-Signature: v=1; a=ed25519-sha256; s=202405e; d=lynne.ee; c=relaxed/relaxed;
 h=From:To:Subject:Date:Message-ID; t=1744938673; bh=lMoAxJ+lAWxDPXmJZqiMwrV
 rinBTEGbZ/eRTBem4R4M=; b=gLQU75vMLaSbCBHQiPQbfuuWpgF5N0NhoKBkjZBS0xhjR9Dzqd
 UkUuzR+NWCRmRcDuO7O3OlrOFtSJj9j+0xDA==;
Message-ID: <ecffacc5-958d-47f2-939d-7822fc7c51db@lynne.ee>
Date: Fri, 18 Apr 2025 03:11:13 +0200
MIME-Version: 1.0
User-Agent: Mozilla Thunderbird
To: ffmpeg-devel@ffmpeg.org
References: <20250417235543.227108-1-47210458+raphaelthegreat@users.noreply.github.com>
 <20250417235543.227108-5-47210458+raphaelthegreat@users.noreply.github.com>
Content-Language: en-US
From: Lynne <dev@lynne.ee>
In-Reply-To: <20250417235543.227108-5-47210458+raphaelthegreat@users.noreply.github.com>
Subject: Re: [FFmpeg-devel] [PATCH v3 5/5] lavc: implement a Vulkan-based
 VC-2 encoder Implements a Vulkan based dirac encoder. Supports Haar and
 Legall wavelets and should work with all wavelet depths.
X-BeenThere: ffmpeg-devel@ffmpeg.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: FFmpeg development discussions and patches <ffmpeg-devel.ffmpeg.org>
List-Unsubscribe: <https://ffmpeg.org/mailman/options/ffmpeg-devel>,
 <mailto:ffmpeg-devel-request@ffmpeg.org?subject=unsubscribe>
List-Archive: <https://ffmpeg.org/pipermail/ffmpeg-devel>
List-Post: <mailto:ffmpeg-devel@ffmpeg.org>
List-Help: <mailto:ffmpeg-devel-request@ffmpeg.org?subject=help>
List-Subscribe: <https://ffmpeg.org/mailman/listinfo/ffmpeg-devel>,
 <mailto:ffmpeg-devel-request@ffmpeg.org?subject=subscribe>
Reply-To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org>
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset="us-ascii"; Format="flowed"
Errors-To: ffmpeg-devel-bounces@ffmpeg.org
Sender: "ffmpeg-devel" <ffmpeg-devel-bounces@ffmpeg.org>
Archived-At: <https://master.gitmailbox.com/ffmpegdev/ecffacc5-958d-47f2-939d-7822fc7c51db@lynne.ee/>
List-Archive: <https://master.gitmailbox.com/ffmpegdev/>
List-Post: <mailto:ffmpegdev@gitmailbox.com>

On 18/04/2025 01:55, IndecisiveTurtle wrote:
> From: IndecisiveTurtle <geoster3d@gmail.com>
> 
> Performance wise, encoding a 1080p 1-minute video is performed in about 2.5 minutes with the cpu encoder running on my Ryzen 5 4600H, while it takes about 30 seconds on my NVIDIA GTX 1650
> 
> Haar shader has a subgroup optimized variant that applies when configured wavelet depth allows it
> ---
>   configure                                    |   1 +
>   libavcodec/Makefile                          |   3 +
>   libavcodec/allcodecs.c                       |   1 +
>   libavcodec/vc2enc_vulkan.c                   | 959 +++++++++++++++++++
>   libavcodec/vulkan/vc2_dwt_haar.comp          |  82 ++
>   libavcodec/vulkan/vc2_dwt_haar_subgroup.comp |  75 ++
>   libavcodec/vulkan/vc2_dwt_hor_legall.comp    |  82 ++
>   libavcodec/vulkan/vc2_dwt_upload.comp        |  96 ++
>   libavcodec/vulkan/vc2_dwt_ver_legall.comp    |  78 ++
>   libavcodec/vulkan/vc2_encode.comp            | 169 ++++
>   libavcodec/vulkan/vc2_slice_sizes.comp       | 170 ++++
>   11 files changed, 1716 insertions(+)
>   create mode 100644 libavcodec/vc2enc_vulkan.c
>   create mode 100644 libavcodec/vulkan/vc2_dwt_haar.comp
>   create mode 100644 libavcodec/vulkan/vc2_dwt_haar_subgroup.comp
>   create mode 100644 libavcodec/vulkan/vc2_dwt_hor_legall.comp
>   create mode 100644 libavcodec/vulkan/vc2_dwt_upload.comp
>   create mode 100644 libavcodec/vulkan/vc2_dwt_ver_legall.comp
>   create mode 100644 libavcodec/vulkan/vc2_encode.comp
>   create mode 100644 libavcodec/vulkan/vc2_slice_sizes.comp

LGTM.
Planning to push this tomorrow unless there are objections.
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".