From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTPS id A97C24E0C2 for ; Sat, 8 Mar 2025 15:35:20 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 6DC3B68DDBD; Sat, 8 Mar 2025 17:34:59 +0200 (EET) Received: from mail-wm1-f54.google.com (mail-wm1-f54.google.com [209.85.128.54]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 1F78968DDD5 for ; Sat, 8 Mar 2025 17:34:53 +0200 (EET) Received: by mail-wm1-f54.google.com with SMTP id 5b1f17b1804b1-43cebe06e9eso1747475e9.3 for ; Sat, 08 Mar 2025 07:34:53 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1741448092; x=1742052892; darn=ffmpeg.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=IlZDb1fLUwnosb6LNIcRphkv/lnSJwvFzmc6RfgH85E=; b=VsQ56vHnF6TQPNGkBibXmEuVOj2gy+HjoZvEDcY2hS+wxLfpDh2E2X0z6BXbQiHB3x +SJ5XitMri4EWia8NsTJwAc0t9JoNKQBwH4E19jTpDPKPoi2idCS8fOnJBnctIhZINb4 7x5Z0nu2whzEcirLO5xTea+Jsu89corCuZqc7K5vXY69tqQLD3kr99Vs/dbf4sxIXyEd Qha8gsFUo54YqJqrr7CYfQLcKDPwb4Gs7LDrT1QNJMklPdcjudOp38FxXf7CN6jPwqE/ ZGdXvKzSt14iN1Ij1Nd5avDrVvGP1vdOHLE9HGA1msyWcaK38aKoGapnKsFbdp0LEkYn zG2g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1741448092; x=1742052892; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=IlZDb1fLUwnosb6LNIcRphkv/lnSJwvFzmc6RfgH85E=; b=FU6McCIyScAdMDWuQqjB05RjQVgKf2NjAltmwQMyO6Ky4SWjVNBG93qiHxA9x/WICR DyN1kCPNEp9r3xMqJs7stOLhRSKxGRVfGVovk5sntSUgxgsRRX1v1/aRJIzDhvJFiWxV VxDC1xBWo5lQ6fAYjK7xKiiKX5vTxWWThW7yHqHVOuD9k25jM74Y8v2BmJwJhshGE7/Q F7dNRCGK9P19omHUnMo8jTnDWbqmUu+QaUZDPQfSflE1BdFvpb2tHyDNJCNJz4UOwdw+ ybKjSC4ALtW9TIepSSvXPsg3AguPqgbH/NEXLJUhJhy/CZMPL/Ayk28mYljMjvzQHnIQ u3oQ== X-Gm-Message-State: AOJu0Yy/170CMnOCwunzPyXPlewq5Q/+NbtdxfIX06U6B0osS2TpWKYj cxzUXVB1da3nzGC3oWDsoh+Y73PEcQAT1AvtS9X0pIcyv9cTsmXE4g200A== X-Gm-Gg: ASbGncuDBH9qP2rzLUVSEFiFwJPlIF0lP0n+qlee10BVaAhUP1LGb7US4pgd+a6Ej71 IvbQ5agwGhjucaprydrpWtlJBIRrjjEnSOq6/544DdqiO6PwkIwTa7GUQwailg/FNtQzkhLqiiT P+yMhtfMNKjkDOJdB80PpUKC1sEI1ML3cmV+HcMHT+R2TmlZtqoxCYkr6/GKM9gigmmmLu2xaTe qy0xi0G2tO4fPrbL0E4qLQmTbLOFfqkQLOBLSGr1Vf8tf/1eOLiQkoW0dguoDBl6ZwCaN4ZKhf3 e8Jb6q2Ow2FI79NIGMtwSBj3XTHFw0gOUlg611W97y1rxxy8dCJJ0UbU8rAFrIyzyF+4qVsjo+S gmUNuToIUxey59R/0L/5Bm4iF X-Google-Smtp-Source: AGHT+IF9LcEbxPah0I/w/C2QENT9TcuNV+IsunWpeWslkxZ0gz0zmDCngtayCrjYQ4E4AWp8WNMxiQ== X-Received: by 2002:a05:6000:2cc:b0:390:e158:a1b8 with SMTP id ffacd0b85a97d-39132da132bmr5236042f8f.43.1741448091968; Sat, 08 Mar 2025 07:34:51 -0800 (PST) Received: from localhost.localdomain ([2a02:586:6902:b900:ee24:a2d9:d520:c905]) by smtp.gmail.com with ESMTPSA id ffacd0b85a97d-3912c0e308dsm9280286f8f.67.2025.03.08.07.34.50 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sat, 08 Mar 2025 07:34:51 -0800 (PST) From: IndecisiveTurtle X-Google-Original-From: IndecisiveTurtle <47210458+raphaelthegreat@users.noreply.github.com> To: ffmpeg-devel@ffmpeg.org Date: Sat, 8 Mar 2025 17:33:56 +0200 Message-ID: <20250308153441.100573-2-47210458+raphaelthegreat@users.noreply.github.com> X-Mailer: git-send-email 2.48.1 In-Reply-To: <20250308153441.100573-1-47210458+raphaelthegreat@users.noreply.github.com> References: <20250308153441.100573-1-47210458+raphaelthegreat@users.noreply.github.com> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH v2 2/4] libavcodec/vulkan: Add modifications to common shader for VC2 vulkan encoder X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Cc: IndecisiveTurtle <47210458+raphaelthegreat@users.noreply.github.com> Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: --- libavcodec/vulkan/common.comp | 58 +++++++++++++++++++++++++++-------- 1 file changed, 46 insertions(+), 12 deletions(-) diff --git a/libavcodec/vulkan/common.comp b/libavcodec/vulkan/common.comp index e4e983b3e2..3dc1527529 100644 --- a/libavcodec/vulkan/common.comp +++ b/libavcodec/vulkan/common.comp @@ -18,6 +18,9 @@ * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA */ +#extension GL_EXT_buffer_reference : require +#extension GL_EXT_buffer_reference2 : require + layout(buffer_reference, buffer_reference_align = 1) buffer u8buf { uint8_t v; }; @@ -57,22 +60,20 @@ layout(buffer_reference, buffer_reference_align = 8) buffer u64buf { #define mid_pred(a, b, c) \ max(min((a), (b)), min(max((a), (b)), (c))) -/* TODO: optimize */ + uint align(uint src, uint a) { - uint res = src % a; - if (res == 0) - return src; - return src + a - res; + return (src + a - 1) & ~(a - 1); +} + +int align(int src, int a) +{ + return (src + a - 1) & ~(a - 1); } -/* TODO: optimize */ uint64_t align64(uint64_t src, uint64_t a) { - uint64_t res = src % a; - if (res == 0) - return src; - return src + a - res; + return (src + a - 1) & ~(a - 1); } #define reverse4(src) \ @@ -163,6 +164,39 @@ uint32_t flush_put_bits(inout PutBitContext pb) return uint32_t(pb.buf - pb.buf_start); } +void skip_put_bytes(inout PutBitContext pb, int n) +{ + int bytes_left = pb.bit_left >> 3; + if (n < bytes_left) + { + int n_bits = n << 3; + int mask = (1 << n_bits) - 1; + pb.bit_buf <<= n_bits; + pb.bit_buf |= mask; + pb.bit_left -= uint8_t(n_bits); + return; + } + if (pb.bit_left < BUF_BITS) + { + int mask = (1 << pb.bit_left) - 1; + pb.bit_buf <<= pb.bit_left; + pb.bit_buf |= mask; + u32vec2buf(pb.buf).v = BUF_REVERSE(pb.bit_buf); + pb.buf += BUF_BYTES; + n -= pb.bit_left >> 3; + } + int skip_dwords = n >> 2; + while (skip_dwords > 0) + { + u32buf(pb.buf).v = 0xFFFFFFFF; + pb.buf += 4; + skip_dwords--; + } + int skip_bits = (n & 3) << 3; + pb.bit_buf = (1 << skip_bits) - 1; + pb.bit_left = uint8_t(BUF_BITS - skip_bits); +} + void init_put_bits(out PutBitContext pb, u8buf data, uint64_t len) { pb.buf_start = uint64_t(data); @@ -177,8 +211,8 @@ uint64_t put_bits_count(in PutBitContext pb) return (pb.buf - pb.buf_start)*8 + BUF_BITS - pb.bit_left; } -uint32_t put_bytes_count(in PutBitContext pb) +int32_t put_bytes_count(in PutBitContext pb) { uint64_t num_bytes = (pb.buf - pb.buf_start) + ((BUF_BITS - pb.bit_left) >> 3); - return uint32_t(num_bytes); + return int32_t(num_bytes); } -- 2.48.1 _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".