From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: <ffmpeg-devel-bounces@ffmpeg.org> Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTPS id 805854D6C0 for <ffmpegdev@gitmailbox.com>; Fri, 18 Apr 2025 09:33:10 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id B468568BB3A; Fri, 18 Apr 2025 12:33:06 +0300 (EEST) Received: from EUR03-VI1-obe.outbound.protection.outlook.com (mail-vi1eur03olkn2040.outbound.protection.outlook.com [40.92.57.40]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id C81A868BABF for <ffmpeg-devel@ffmpeg.org>; Fri, 18 Apr 2025 12:32:59 +0300 (EEST) ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=Vav/tqKbq6graMD9E4rZxRlqlo4tZezVnWrxclijy7+kcUfAHjbZ1vtgQdyFxnGYgofQuANbQOkoIPRyZWPnTzPDT+h3B41XMw7WlIndZC4ngIM0IJ6+iN+CNc3Ol6J5Y/jnTcLmUgAAWQPVKlmXgXK8mZHMNWhPMvCGWoqTnxKVphSELL7tTs4QiElLkisH2DqOCOEzQd9XgHjffenxVXJyeYGjcX/KZQmQyR7xwlPhDQdcei+NDJP7PgxM03gnUnWbvEgLiqaM0D/4puXQGnkh+bOVmka6nD+gOvevWC4dF1xmAxNYiowVob0oyVeJML+Ax0mHPf6CxCwuoOKbbQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=81IsYwti8ZPVdJQd1h9s44bSrBqS9I5gajRbhHJctj4=; b=gX6WBygb2tlQ7CsSBnK7Y5O1SzGG3HgLFJmaxfDgxUS5mmLBRl+xexqmbJEbkzGniB4ynR8PTjnKvX4OG4XKEsGWKM/ISJdYUOdBnuPo0jy3E0/pweOR506MsFJozYiprFX8tZSYYZWHjToW96K4hMZfxUl6ONtYQ6J12DscPgvOiUzHznX6nniuc79cdUXjRyVQkX/AiofOdrPkv7bdD09Edz8Uq2/N+8Cm5P7gjnulOAXTeHmMXuDPdyt9bNbl5xLZXK+9dEeq+R4JBemlcBaM9QuS0YdL1e8sQLFibKrQ9UfIRtHNtRNAZJjYHe2prI9f2+Wfdh68aYs/mrrhCQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=none; dmarc=none; dkim=none; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=outlook.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=81IsYwti8ZPVdJQd1h9s44bSrBqS9I5gajRbhHJctj4=; b=K4+6jYQUo5mK2scHDhtdwInI80h5nnfrESBzqW3QhpjE/lK637Ohm0/cxRc/6HrQJFtuYdu+6k3KgY0ED02t9OvIFUYgQa8wQvd6Ph9Rs/mU1RiCLnXqCbIa+YMGSuhNMr1BNxFQYj7vKKqI7wXKM4LUfH3yEu42xRQEc90iRFU+f3BLqb3ebG46Zn2xvFQVv606QpqlRorwHGqqhhnCDz8cf6AzjsSMwNQoES1Zev37fV3JShBzRO8BcmLhpE4kTf6gGfFlcGQUWTuw/P4GQfsfg6ly9b5n21JuXVJ7HJX5GBT0XG64eTa8KpJlRM2elUJTVeZeHw/1WhAHBRWy4Q== Received: from GV1P250MB0737.EURP250.PROD.OUTLOOK.COM (2603:10a6:150:8e::17) by DB4P250MB1029.EURP250.PROD.OUTLOOK.COM (2603:10a6:10:3cf::5) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.8655.18; Fri, 18 Apr 2025 09:32:58 +0000 Received: from GV1P250MB0737.EURP250.PROD.OUTLOOK.COM ([fe80::d6a1:e3af:a5f1:b614]) by GV1P250MB0737.EURP250.PROD.OUTLOOK.COM ([fe80::d6a1:e3af:a5f1:b614%5]) with mapi id 15.20.8655.012; Fri, 18 Apr 2025 09:32:57 +0000 Message-ID: <GV1P250MB0737969C52577FF08254F7F58FBF2@GV1P250MB0737.EURP250.PROD.OUTLOOK.COM> Date: Fri, 18 Apr 2025 11:32:54 +0200 User-Agent: Mozilla Thunderbird To: ffmpeg-devel@ffmpeg.org References: <20250417235543.227108-1-47210458+raphaelthegreat@users.noreply.github.com> <20250417235543.227108-5-47210458+raphaelthegreat@users.noreply.github.com> Content-Language: en-US From: Andreas Rheinhardt <andreas.rheinhardt@outlook.com> In-Reply-To: <20250417235543.227108-5-47210458+raphaelthegreat@users.noreply.github.com> X-ClientProxiedBy: FR2P281CA0055.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:93::14) To GV1P250MB0737.EURP250.PROD.OUTLOOK.COM (2603:10a6:150:8e::17) X-Microsoft-Original-Message-ID: <c8f52ce5-d4f1-4059-9436-e5ad021f91c4@outlook.com> MIME-Version: 1.0 X-MS-Exchange-MessageSentRepresentingType: 1 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: GV1P250MB0737:EE_|DB4P250MB1029:EE_ X-MS-Office365-Filtering-Correlation-Id: b102ba3b-78ac-4d69-cb5b-08dd7e5c00bf X-Microsoft-Antispam: BCL:0; ARA:14566002|7092599003|5072599009|15080799006|6090799003|19110799003|461199028|8060799006|440099028|3412199025|41001999003; X-Microsoft-Antispam-Message-Info: =?utf-8?B?TnZzMkg4dVJwQktJTnBmbUlrR3lrZVNxRW13SGxrQVQzYXBORittWmx2Qm82?= =?utf-8?B?S0ltaUJiNmxqMU5BY1dyQVVKMHQ2WXp6bHM2V2RZZlV6K3BocnhSeDVFVld0?= =?utf-8?B?MjYyTXhKbDJjYnU3c2g2ZmVkaHRJYkVLOXBsaWdhN3phc09ZUFlTalljSnZV?= =?utf-8?B?d0xURldOY1Q2V2hOcHhlQjZBaEVQdTFPYVhWM09mTVlic25WVXFaalhVSXE5?= =?utf-8?B?bFE0cVF0MkxDVnJ6UloyUXJseG55eWQvZFJPOW94dlFCUjh3WWVBbHhFbWRC?= =?utf-8?B?SUl6N0VYRDRMUDVOdXRlK1I3NEIrWldmMUxmNHdUczdOcFF5VjFrU3o1dnZY?= =?utf-8?B?YzZjaEdQK0lIc3BsbGFhN0NiWTVBMXUxVjI2UU1uaDlYOGxqSzNaYmNVZ0U0?= =?utf-8?B?WUk4MkxxVUVXbUR5SzYxOTlqS0RTZjNrc0lJYWZTWldHYzAyb09RV2l1eDhi?= =?utf-8?B?NzhyUUdYd1dYRlZWcFFKWTBFVTNNUTltclQ1VEJXUkxXOXAyWXFucjh6d3Ri?= =?utf-8?B?LzJTZXZPNmtoYUpUSlNiUVdZUFJaV1FFZnJGOGEzd1hjdVNhMnJ0bThrVGNL?= =?utf-8?B?Vys5Y2tkU2NORDF3ZzA4azkvR1h0bkJzL0l1KzlsRGFaUUJmOExOdkN0bnpQ?= =?utf-8?B?eHlDOFZZdmI0eUltWk9mRkMwRzhQc1JobGxxeEZuMmtEcllBWkpOVXZaZmFT?= =?utf-8?B?YXBPa2dTMnlETDRVS0I2bmtmV05DNm9pZktra0dkS3ZhdXU2ZVVtMnlpaEhz?= =?utf-8?B?VUR6VTdsb0tIWEZXOWlzd3c4R1hHNHBqVnhPTUFZSDZvZ0o3T3Y0T2haTnh1?= =?utf-8?B?bDFhSG1Qd3orUCtHYml6NnNTTTUxcmdSOVV1TzFvbVNwdGRoR3FqekpsSm11?= =?utf-8?B?RUh2RGgxU01iSFdRTGliK1Z2TkhBTDZJeWQzL1p1TFlZM29wZm1DT2lBMGx1?= =?utf-8?B?R3QrcVkxQy9EUXUyYzZEUjFNY0xJTXg2bUw1Zkw5aXcvNUsrSXdVL1YzTWpO?= =?utf-8?B?NG9HYnhFZkxzaGdoY3REcUtlNVlhYlZCK3ViWk94S3dpK1F1TkRHOHo4UENr?= =?utf-8?B?czdmblJMT0xBUFFkYnNHSS8rNm51dDJJK3RqZnVZSmNPei9SaDZhenBDZ3Jr?= =?utf-8?B?cUo0K2dDcnBZUGFlbWpsS0xHU3dWeW45eldodFcvOUtIcXRBTjFvV3hOU2JD?= =?utf-8?B?a3dSR2o5K2o5NW5vQWltemh5VTliaHQ1c3Z2bTl1c1lleFhib2xNOFBTaVli?= =?utf-8?B?aTBnNlpqTEVCMDhtTHNZM21FaHFsaE1JR0o3RHVjdUU0ZHpnMnRWRmJpSVR0?= =?utf-8?B?NDZTRHJJanhsakFCYjFDVHFXQmdZajVLZVhvZXZ2OWFLamI2Si8xYVgzMk4r?= =?utf-8?B?cTNUR1RPNGFVaVFwYlBsc0lsZzQyKzk4d29IRzRvR2xPenFIaVE4M281bnNm?= =?utf-8?B?OU5DWXA4Mk4vdys0WEdEdlM4SU9Sc2FaMHBTcmh3WGN4TmpRVUlBSUdnVitI?= =?utf-8?B?NjhaUlV3MSsrOHBsZUxBcXR6eGZJOWFSMVVRczk2M3UreGR5Tno0YzY3cEJ3?= =?utf-8?Q?elw+gVZEwXZ6KHuUq5lWIGWmk=3D?= X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?QU1qVEt1aG5MUDlhOTFpS2JlZkFTTXgzbXlSR0YxRllDUUVsTFgrdTdpRDNo?= =?utf-8?B?VUFvQnlaQVNpY294SStQaWlSNVlOOU1ZV2Z0eGRtTkFNaUp1WktHeE1sU2dZ?= =?utf-8?B?dWRmWnk4WW5Ia1FJV0ZqcWtiRk90S2hETjJ6TFNMMmdDMlNoMXJIbXA0RFZE?= =?utf-8?B?M05tRWg5ZmJ6WjA0QWdGLzYxSTM1b0Jla2dsOFQ1Q1F6Q2RlU2pxT2t5UXpE?= =?utf-8?B?cFpmaThJajhBSGpLblgxR29OTlBWMzFpNkpwZmpDaXorcE84TDV5dVpCUkRk?= =?utf-8?B?WThtVmY2UHRUZEtZekNDOGdPV0pKS2JDSVpOZUl4a0VBZTE4eTQrYVJFZ1NU?= =?utf-8?B?NXovdmNtZ2VOUW9MSllZcGRaWExpaEJLNkMvWmM1U1VyR0ZzQVFXc295emQ1?= =?utf-8?B?ZE85bVNPMzhHQ1hEZWtPcFJyMkg2L0hGWXpQaEt6REtxNTR6RkNZWVVmNXRF?= =?utf-8?B?b0xLeEJ3bHRKOHA5MmVhSkdpRFlSVjFkOEp4UmovN3krTVZrbWQxWEQ2MDVz?= =?utf-8?B?elRUNkV6akdKQm5xVWp4UXByWFpNZUR3d1FqcmRaUXVUOTh3K0pBWUZYMkJp?= =?utf-8?B?RTRVL2dGL2Z0UGw5YlEwOWpjR1ZwZXU2c09ScGE0cHByZmpvOHZLaDFmZ2k4?= =?utf-8?B?N2tlNHR2RE5JVEZVK1JncGNadUdSb1ZlSW50RUdZVVNkeU8zYU1BS0tGNlRr?= =?utf-8?B?bnh3ZmNtaFoyQUdHNkVPYUtyK3F1L3diOUlpNk16d0l6V01nOUxJU21uVkR3?= =?utf-8?B?Sy9iOGxyd1N0UGozanhBdDNlUGZENDJZT3I1Y3hqRFVBc0hEWUM0aVdVTlJ5?= =?utf-8?B?RXJXNVY3OWk3RGJFdEFoWUxZWFRPaVJPV2xYdm9jNzVvQVVjZkZ6QmNLWE9P?= =?utf-8?B?RjV3K3VaMCtnT2tqMDRid0xvOC9DWnBKOHM1QkgwQmZDejk1cm9HbUVSL0U1?= =?utf-8?B?dGtDZjloNnVmUUdTOVdQampNU1did1JqT1RPMytlbjZFckJLeFd4TjdVdENw?= =?utf-8?B?WVdxTnRhdldwL1JGbUFqZzdPVmFkRHEwUjg0cE5mYXNjUko4SGt6bnp2V3hO?= =?utf-8?B?eCtaYlFIOU55eEJRTnFoU2JvR0JGbG1aWTduVzNMemx3THVMN2dQaUR2dytM?= =?utf-8?B?Znh3M2F3RVNaQU1iSmNUdzI0L1picmZMeVJ1ZUhRTDQ3amFXWTZlSFRGYUYw?= =?utf-8?B?S28vdmQvVk9BK3FXZXJCWWpVWXZiVG5tbGtHaENsTUJucGlSWUxreVF4NTJ0?= =?utf-8?B?YzFjTy9WdG95MWtld2V2OEVDc0ttZ05YTzcrSXFlbGpWdnBhZWJOUmR0K3hC?= =?utf-8?B?UkxvblRUOWJYZUtmMURYV2xsZkUrcGhTN3ZkUXZYNlAyOTNXQ2V1U0ljTkJw?= =?utf-8?B?anIwbnIzNm14VFowVWovY2JSaUw5eS9jZlN6ZEs1cE5SUGdxZ2hneWZpSzhy?= =?utf-8?B?MXZTc3ZjWmN6a0swZkdyNDdYTVpacUhDeE5Cc28zdmNtQldIUlJxUURaWnli?= =?utf-8?B?eS9UcWRWbWx4Vmd0N1E5VTkxMzFTazh0NUNRRlMxMytLeWNEOFIySTNwNDBO?= =?utf-8?B?WnkvS1NRa2pWY1VDU2l4VUFaVWxFbC9CaE9BZXFvMmJCeEMyUjhrY0lvdnJK?= =?utf-8?B?ZWpGbk1pMlNPMnJpQVdBR3N0Q0t3ZlZ1LzdMamRzOEsvQmNlaW84Ny9tZWo5?= =?utf-8?B?TlQrb3RrNGFaUUtiL0tYQUI2NDQvTmVXeFpsWFd1Nnh1SlphRmZiU3M1ank4?= =?utf-8?B?OHZid0lwTmsyY0NmcVBMVHlIYzlVRmZvaG04ZC83ZDJvUm1UWGhDT3M1NTZR?= =?utf-8?B?Q0diSXpVcTRzZ3BxVitsUT09?= X-OriginatorOrg: outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: b102ba3b-78ac-4d69-cb5b-08dd7e5c00bf X-MS-Exchange-CrossTenant-AuthSource: GV1P250MB0737.EURP250.PROD.OUTLOOK.COM X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 18 Apr 2025 09:32:57.5056 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa X-MS-Exchange-CrossTenant-RMS-PersistedConsumerOrg: 00000000-0000-0000-0000-000000000000 X-MS-Exchange-Transport-CrossTenantHeadersStamped: DB4P250MB1029 Subject: Re: [FFmpeg-devel] [PATCH v3 5/5] lavc: implement a Vulkan-based VC-2 encoder Implements a Vulkan based dirac encoder. Supports Haar and Legall wavelets and should work with all wavelet depths. X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches <ffmpeg-devel.ffmpeg.org> List-Unsubscribe: <https://ffmpeg.org/mailman/options/ffmpeg-devel>, <mailto:ffmpeg-devel-request@ffmpeg.org?subject=unsubscribe> List-Archive: <https://ffmpeg.org/pipermail/ffmpeg-devel> List-Post: <mailto:ffmpeg-devel@ffmpeg.org> List-Help: <mailto:ffmpeg-devel-request@ffmpeg.org?subject=help> List-Subscribe: <https://ffmpeg.org/mailman/listinfo/ffmpeg-devel>, <mailto:ffmpeg-devel-request@ffmpeg.org?subject=subscribe> Reply-To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org> Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" <ffmpeg-devel-bounces@ffmpeg.org> Archived-At: <https://master.gitmailbox.com/ffmpegdev/GV1P250MB0737969C52577FF08254F7F58FBF2@GV1P250MB0737.EURP250.PROD.OUTLOOK.COM/> List-Archive: <https://master.gitmailbox.com/ffmpegdev/> List-Post: <mailto:ffmpegdev@gitmailbox.com> IndecisiveTurtle: > From: IndecisiveTurtle <geoster3d@gmail.com> > > Performance wise, encoding a 1080p 1-minute video is performed in about 2.5 minutes with the cpu encoder running on my Ryzen 5 4600H, while it takes about 30 seconds on my NVIDIA GTX 1650 > > Haar shader has a subgroup optimized variant that applies when configured wavelet depth allows it > --- > + > +void put_vc2_ue_uint(inout PutBitContext pb, uint val) > +{ > + int pbits = 0, topbit = 1, maxval = 1, bits = 0; > + if (val == 0) > + { > + put_bits(pb, 1, 1); > + return; > + } > + val++; > + > + while (val > maxval) > + { > + topbit <<= 1; > + bits++; > + maxval <<= 1; > + maxval |= 1; > + } > + > + for (int i = 0; i < bits; i++) > + { > + topbit >>= 1; > + pbits <<= 2; > + if ((val & topbit) != 0) > + pbits |= 1; > + } > + > + put_bits(pb, bits * 2 + 1, (pbits << 1) | 1); > +} > + You are still using the old and inefficient way to write VC-2 exponential coded integers. Improving this gave a nice speed boost to the software encoder. - Andreas _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".