From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <ffmpeg-devel-bounces@ffmpeg.org>
Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100])
	by master.gitmailbox.com (Postfix) with ESMTPS id 805854D6C0
	for <ffmpegdev@gitmailbox.com>; Fri, 18 Apr 2025 09:33:10 +0000 (UTC)
Received: from [127.0.1.1] (localhost [127.0.0.1])
	by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id B468568BB3A;
	Fri, 18 Apr 2025 12:33:06 +0300 (EEST)
Received: from EUR03-VI1-obe.outbound.protection.outlook.com
 (mail-vi1eur03olkn2040.outbound.protection.outlook.com [40.92.57.40])
 by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id C81A868BABF
 for <ffmpeg-devel@ffmpeg.org>; Fri, 18 Apr 2025 12:32:59 +0300 (EEST)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none;
 b=Vav/tqKbq6graMD9E4rZxRlqlo4tZezVnWrxclijy7+kcUfAHjbZ1vtgQdyFxnGYgofQuANbQOkoIPRyZWPnTzPDT+h3B41XMw7WlIndZC4ngIM0IJ6+iN+CNc3Ol6J5Y/jnTcLmUgAAWQPVKlmXgXK8mZHMNWhPMvCGWoqTnxKVphSELL7tTs4QiElLkisH2DqOCOEzQd9XgHjffenxVXJyeYGjcX/KZQmQyR7xwlPhDQdcei+NDJP7PgxM03gnUnWbvEgLiqaM0D/4puXQGnkh+bOVmka6nD+gOvevWC4dF1xmAxNYiowVob0oyVeJML+Ax0mHPf6CxCwuoOKbbQ==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; 
 s=arcselector10001;
 h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1;
 bh=81IsYwti8ZPVdJQd1h9s44bSrBqS9I5gajRbhHJctj4=;
 b=gX6WBygb2tlQ7CsSBnK7Y5O1SzGG3HgLFJmaxfDgxUS5mmLBRl+xexqmbJEbkzGniB4ynR8PTjnKvX4OG4XKEsGWKM/ISJdYUOdBnuPo0jy3E0/pweOR506MsFJozYiprFX8tZSYYZWHjToW96K4hMZfxUl6ONtYQ6J12DscPgvOiUzHznX6nniuc79cdUXjRyVQkX/AiofOdrPkv7bdD09Edz8Uq2/N+8Cm5P7gjnulOAXTeHmMXuDPdyt9bNbl5xLZXK+9dEeq+R4JBemlcBaM9QuS0YdL1e8sQLFibKrQ9UfIRtHNtRNAZJjYHe2prI9f2+Wfdh68aYs/mrrhCQ==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=none; dmarc=none;
 dkim=none; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=outlook.com;
 s=selector1;
 h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck;
 bh=81IsYwti8ZPVdJQd1h9s44bSrBqS9I5gajRbhHJctj4=;
 b=K4+6jYQUo5mK2scHDhtdwInI80h5nnfrESBzqW3QhpjE/lK637Ohm0/cxRc/6HrQJFtuYdu+6k3KgY0ED02t9OvIFUYgQa8wQvd6Ph9Rs/mU1RiCLnXqCbIa+YMGSuhNMr1BNxFQYj7vKKqI7wXKM4LUfH3yEu42xRQEc90iRFU+f3BLqb3ebG46Zn2xvFQVv606QpqlRorwHGqqhhnCDz8cf6AzjsSMwNQoES1Zev37fV3JShBzRO8BcmLhpE4kTf6gGfFlcGQUWTuw/P4GQfsfg6ly9b5n21JuXVJ7HJX5GBT0XG64eTa8KpJlRM2elUJTVeZeHw/1WhAHBRWy4Q==
Received: from GV1P250MB0737.EURP250.PROD.OUTLOOK.COM (2603:10a6:150:8e::17)
 by DB4P250MB1029.EURP250.PROD.OUTLOOK.COM (2603:10a6:10:3cf::5) with
 Microsoft SMTP Server (version=TLS1_2,
 cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.8655.18; Fri, 18 Apr
 2025 09:32:58 +0000
Received: from GV1P250MB0737.EURP250.PROD.OUTLOOK.COM
 ([fe80::d6a1:e3af:a5f1:b614]) by GV1P250MB0737.EURP250.PROD.OUTLOOK.COM
 ([fe80::d6a1:e3af:a5f1:b614%5]) with mapi id 15.20.8655.012; Fri, 18 Apr 2025
 09:32:57 +0000
Message-ID: <GV1P250MB0737969C52577FF08254F7F58FBF2@GV1P250MB0737.EURP250.PROD.OUTLOOK.COM>
Date: Fri, 18 Apr 2025 11:32:54 +0200
User-Agent: Mozilla Thunderbird
To: ffmpeg-devel@ffmpeg.org
References: <20250417235543.227108-1-47210458+raphaelthegreat@users.noreply.github.com>
 <20250417235543.227108-5-47210458+raphaelthegreat@users.noreply.github.com>
Content-Language: en-US
From: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>
In-Reply-To: <20250417235543.227108-5-47210458+raphaelthegreat@users.noreply.github.com>
X-ClientProxiedBy: FR2P281CA0055.DEUP281.PROD.OUTLOOK.COM
 (2603:10a6:d10:93::14) To GV1P250MB0737.EURP250.PROD.OUTLOOK.COM
 (2603:10a6:150:8e::17)
X-Microsoft-Original-Message-ID: <c8f52ce5-d4f1-4059-9436-e5ad021f91c4@outlook.com>
MIME-Version: 1.0
X-MS-Exchange-MessageSentRepresentingType: 1
X-MS-PublicTrafficType: Email
X-MS-TrafficTypeDiagnostic: GV1P250MB0737:EE_|DB4P250MB1029:EE_
X-MS-Office365-Filtering-Correlation-Id: b102ba3b-78ac-4d69-cb5b-08dd7e5c00bf
X-Microsoft-Antispam: BCL:0;
 ARA:14566002|7092599003|5072599009|15080799006|6090799003|19110799003|461199028|8060799006|440099028|3412199025|41001999003;
X-Microsoft-Antispam-Message-Info: =?utf-8?B?TnZzMkg4dVJwQktJTnBmbUlrR3lrZVNxRW13SGxrQVQzYXBORittWmx2Qm82?=
 =?utf-8?B?S0ltaUJiNmxqMU5BY1dyQVVKMHQ2WXp6bHM2V2RZZlV6K3BocnhSeDVFVld0?=
 =?utf-8?B?MjYyTXhKbDJjYnU3c2g2ZmVkaHRJYkVLOXBsaWdhN3phc09ZUFlTalljSnZV?=
 =?utf-8?B?d0xURldOY1Q2V2hOcHhlQjZBaEVQdTFPYVhWM09mTVlic25WVXFaalhVSXE5?=
 =?utf-8?B?bFE0cVF0MkxDVnJ6UloyUXJseG55eWQvZFJPOW94dlFCUjh3WWVBbHhFbWRC?=
 =?utf-8?B?SUl6N0VYRDRMUDVOdXRlK1I3NEIrWldmMUxmNHdUczdOcFF5VjFrU3o1dnZY?=
 =?utf-8?B?YzZjaEdQK0lIc3BsbGFhN0NiWTVBMXUxVjI2UU1uaDlYOGxqSzNaYmNVZ0U0?=
 =?utf-8?B?WUk4MkxxVUVXbUR5SzYxOTlqS0RTZjNrc0lJYWZTWldHYzAyb09RV2l1eDhi?=
 =?utf-8?B?NzhyUUdYd1dYRlZWcFFKWTBFVTNNUTltclQ1VEJXUkxXOXAyWXFucjh6d3Ri?=
 =?utf-8?B?LzJTZXZPNmtoYUpUSlNiUVdZUFJaV1FFZnJGOGEzd1hjdVNhMnJ0bThrVGNL?=
 =?utf-8?B?Vys5Y2tkU2NORDF3ZzA4azkvR1h0bkJzL0l1KzlsRGFaUUJmOExOdkN0bnpQ?=
 =?utf-8?B?eHlDOFZZdmI0eUltWk9mRkMwRzhQc1JobGxxeEZuMmtEcllBWkpOVXZaZmFT?=
 =?utf-8?B?YXBPa2dTMnlETDRVS0I2bmtmV05DNm9pZktra0dkS3ZhdXU2ZVVtMnlpaEhz?=
 =?utf-8?B?VUR6VTdsb0tIWEZXOWlzd3c4R1hHNHBqVnhPTUFZSDZvZ0o3T3Y0T2haTnh1?=
 =?utf-8?B?bDFhSG1Qd3orUCtHYml6NnNTTTUxcmdSOVV1TzFvbVNwdGRoR3FqekpsSm11?=
 =?utf-8?B?RUh2RGgxU01iSFdRTGliK1Z2TkhBTDZJeWQzL1p1TFlZM29wZm1DT2lBMGx1?=
 =?utf-8?B?R3QrcVkxQy9EUXUyYzZEUjFNY0xJTXg2bUw1Zkw5aXcvNUsrSXdVL1YzTWpO?=
 =?utf-8?B?NG9HYnhFZkxzaGdoY3REcUtlNVlhYlZCK3ViWk94S3dpK1F1TkRHOHo4UENr?=
 =?utf-8?B?czdmblJMT0xBUFFkYnNHSS8rNm51dDJJK3RqZnVZSmNPei9SaDZhenBDZ3Jr?=
 =?utf-8?B?cUo0K2dDcnBZUGFlbWpsS0xHU3dWeW45eldodFcvOUtIcXRBTjFvV3hOU2JD?=
 =?utf-8?B?a3dSR2o5K2o5NW5vQWltemh5VTliaHQ1c3Z2bTl1c1lleFhib2xNOFBTaVli?=
 =?utf-8?B?aTBnNlpqTEVCMDhtTHNZM21FaHFsaE1JR0o3RHVjdUU0ZHpnMnRWRmJpSVR0?=
 =?utf-8?B?NDZTRHJJanhsakFCYjFDVHFXQmdZajVLZVhvZXZ2OWFLamI2Si8xYVgzMk4r?=
 =?utf-8?B?cTNUR1RPNGFVaVFwYlBsc0lsZzQyKzk4d29IRzRvR2xPenFIaVE4M281bnNm?=
 =?utf-8?B?OU5DWXA4Mk4vdys0WEdEdlM4SU9Sc2FaMHBTcmh3WGN4TmpRVUlBSUdnVitI?=
 =?utf-8?B?NjhaUlV3MSsrOHBsZUxBcXR6eGZJOWFSMVVRczk2M3UreGR5Tno0YzY3cEJ3?=
 =?utf-8?Q?elw+gVZEwXZ6KHuUq5lWIGWmk=3D?=
X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?QU1qVEt1aG5MUDlhOTFpS2JlZkFTTXgzbXlSR0YxRllDUUVsTFgrdTdpRDNo?=
 =?utf-8?B?VUFvQnlaQVNpY294SStQaWlSNVlOOU1ZV2Z0eGRtTkFNaUp1WktHeE1sU2dZ?=
 =?utf-8?B?dWRmWnk4WW5Ia1FJV0ZqcWtiRk90S2hETjJ6TFNMMmdDMlNoMXJIbXA0RFZE?=
 =?utf-8?B?M05tRWg5ZmJ6WjA0QWdGLzYxSTM1b0Jla2dsOFQ1Q1F6Q2RlU2pxT2t5UXpE?=
 =?utf-8?B?cFpmaThJajhBSGpLblgxR29OTlBWMzFpNkpwZmpDaXorcE84TDV5dVpCUkRk?=
 =?utf-8?B?WThtVmY2UHRUZEtZekNDOGdPV0pKS2JDSVpOZUl4a0VBZTE4eTQrYVJFZ1NU?=
 =?utf-8?B?NXovdmNtZ2VOUW9MSllZcGRaWExpaEJLNkMvWmM1U1VyR0ZzQVFXc295emQ1?=
 =?utf-8?B?ZE85bVNPMzhHQ1hEZWtPcFJyMkg2L0hGWXpQaEt6REtxNTR6RkNZWVVmNXRF?=
 =?utf-8?B?b0xLeEJ3bHRKOHA5MmVhSkdpRFlSVjFkOEp4UmovN3krTVZrbWQxWEQ2MDVz?=
 =?utf-8?B?elRUNkV6akdKQm5xVWp4UXByWFpNZUR3d1FqcmRaUXVUOTh3K0pBWUZYMkJp?=
 =?utf-8?B?RTRVL2dGL2Z0UGw5YlEwOWpjR1ZwZXU2c09ScGE0cHByZmpvOHZLaDFmZ2k4?=
 =?utf-8?B?N2tlNHR2RE5JVEZVK1JncGNadUdSb1ZlSW50RUdZVVNkeU8zYU1BS0tGNlRr?=
 =?utf-8?B?bnh3ZmNtaFoyQUdHNkVPYUtyK3F1L3diOUlpNk16d0l6V01nOUxJU21uVkR3?=
 =?utf-8?B?Sy9iOGxyd1N0UGozanhBdDNlUGZENDJZT3I1Y3hqRFVBc0hEWUM0aVdVTlJ5?=
 =?utf-8?B?RXJXNVY3OWk3RGJFdEFoWUxZWFRPaVJPV2xYdm9jNzVvQVVjZkZ6QmNLWE9P?=
 =?utf-8?B?RjV3K3VaMCtnT2tqMDRid0xvOC9DWnBKOHM1QkgwQmZDejk1cm9HbUVSL0U1?=
 =?utf-8?B?dGtDZjloNnVmUUdTOVdQampNU1did1JqT1RPMytlbjZFckJLeFd4TjdVdENw?=
 =?utf-8?B?WVdxTnRhdldwL1JGbUFqZzdPVmFkRHEwUjg0cE5mYXNjUko4SGt6bnp2V3hO?=
 =?utf-8?B?eCtaYlFIOU55eEJRTnFoU2JvR0JGbG1aWTduVzNMemx3THVMN2dQaUR2dytM?=
 =?utf-8?B?Znh3M2F3RVNaQU1iSmNUdzI0L1picmZMeVJ1ZUhRTDQ3amFXWTZlSFRGYUYw?=
 =?utf-8?B?S28vdmQvVk9BK3FXZXJCWWpVWXZiVG5tbGtHaENsTUJucGlSWUxreVF4NTJ0?=
 =?utf-8?B?YzFjTy9WdG95MWtld2V2OEVDc0ttZ05YTzcrSXFlbGpWdnBhZWJOUmR0K3hC?=
 =?utf-8?B?UkxvblRUOWJYZUtmMURYV2xsZkUrcGhTN3ZkUXZYNlAyOTNXQ2V1U0ljTkJw?=
 =?utf-8?B?anIwbnIzNm14VFowVWovY2JSaUw5eS9jZlN6ZEs1cE5SUGdxZ2hneWZpSzhy?=
 =?utf-8?B?MXZTc3ZjWmN6a0swZkdyNDdYTVpacUhDeE5Cc28zdmNtQldIUlJxUURaWnli?=
 =?utf-8?B?eS9UcWRWbWx4Vmd0N1E5VTkxMzFTazh0NUNRRlMxMytLeWNEOFIySTNwNDBO?=
 =?utf-8?B?WnkvS1NRa2pWY1VDU2l4VUFaVWxFbC9CaE9BZXFvMmJCeEMyUjhrY0lvdnJK?=
 =?utf-8?B?ZWpGbk1pMlNPMnJpQVdBR3N0Q0t3ZlZ1LzdMamRzOEsvQmNlaW84Ny9tZWo5?=
 =?utf-8?B?TlQrb3RrNGFaUUtiL0tYQUI2NDQvTmVXeFpsWFd1Nnh1SlphRmZiU3M1ank4?=
 =?utf-8?B?OHZid0lwTmsyY0NmcVBMVHlIYzlVRmZvaG04ZC83ZDJvUm1UWGhDT3M1NTZR?=
 =?utf-8?B?Q0diSXpVcTRzZ3BxVitsUT09?=
X-OriginatorOrg: outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: b102ba3b-78ac-4d69-cb5b-08dd7e5c00bf
X-MS-Exchange-CrossTenant-AuthSource: GV1P250MB0737.EURP250.PROD.OUTLOOK.COM
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 18 Apr 2025 09:32:57.5056 (UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-CrossTenant-Id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa
X-MS-Exchange-CrossTenant-RMS-PersistedConsumerOrg: 00000000-0000-0000-0000-000000000000
X-MS-Exchange-Transport-CrossTenantHeadersStamped: DB4P250MB1029
Subject: Re: [FFmpeg-devel] [PATCH v3 5/5] lavc: implement a Vulkan-based
 VC-2 encoder Implements a Vulkan based dirac encoder. Supports Haar and
 Legall wavelets and should work with all wavelet depths.
X-BeenThere: ffmpeg-devel@ffmpeg.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: FFmpeg development discussions and patches <ffmpeg-devel.ffmpeg.org>
List-Unsubscribe: <https://ffmpeg.org/mailman/options/ffmpeg-devel>,
 <mailto:ffmpeg-devel-request@ffmpeg.org?subject=unsubscribe>
List-Archive: <https://ffmpeg.org/pipermail/ffmpeg-devel>
List-Post: <mailto:ffmpeg-devel@ffmpeg.org>
List-Help: <mailto:ffmpeg-devel-request@ffmpeg.org?subject=help>
List-Subscribe: <https://ffmpeg.org/mailman/listinfo/ffmpeg-devel>,
 <mailto:ffmpeg-devel-request@ffmpeg.org?subject=subscribe>
Reply-To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Errors-To: ffmpeg-devel-bounces@ffmpeg.org
Sender: "ffmpeg-devel" <ffmpeg-devel-bounces@ffmpeg.org>
Archived-At: <https://master.gitmailbox.com/ffmpegdev/GV1P250MB0737969C52577FF08254F7F58FBF2@GV1P250MB0737.EURP250.PROD.OUTLOOK.COM/>
List-Archive: <https://master.gitmailbox.com/ffmpegdev/>
List-Post: <mailto:ffmpegdev@gitmailbox.com>

IndecisiveTurtle:
> From: IndecisiveTurtle <geoster3d@gmail.com>
> 
> Performance wise, encoding a 1080p 1-minute video is performed in about 2.5 minutes with the cpu encoder running on my Ryzen 5 4600H, while it takes about 30 seconds on my NVIDIA GTX 1650
> 
> Haar shader has a subgroup optimized variant that applies when configured wavelet depth allows it
> ---



> +
> +void put_vc2_ue_uint(inout PutBitContext pb, uint val)
> +{
> +    int pbits = 0, topbit = 1, maxval = 1, bits = 0;
> +    if (val == 0)
> +    {
> +        put_bits(pb, 1, 1);
> +        return;
> +    }
> +    val++;
> +
> +    while (val > maxval)
> +    {
> +        topbit <<= 1;
> +        bits++;
> +        maxval <<= 1;
> +        maxval |=  1;
> +    }
> +
> +    for (int i = 0; i < bits; i++)
> +    {
> +        topbit >>= 1;
> +        pbits <<= 2;
> +        if ((val & topbit) != 0)
> +            pbits |= 1;
> +    }
> +
> +    put_bits(pb, bits * 2 + 1, (pbits << 1) | 1);
> +}
> +

You are still using the old and inefficient way to write VC-2
exponential coded integers. Improving this gave a nice speed boost to
the software encoder.

- Andreas

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".