From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTPS id 4805D4E2D0 for ; Mon, 10 Mar 2025 19:49:51 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id 8A68968DF92; Mon, 10 Mar 2025 21:49:43 +0200 (EET) Received: from mail-wr1-f43.google.com (mail-wr1-f43.google.com [209.85.221.43]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id 13B3868DF94 for ; Mon, 10 Mar 2025 21:49:42 +0200 (EET) Received: by mail-wr1-f43.google.com with SMTP id ffacd0b85a97d-3912c09bea5so3975161f8f.1 for ; Mon, 10 Mar 2025 12:49:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1741636181; x=1742240981; darn=ffmpeg.org; h=thread-index:content-language:content-transfer-encoding :mime-version:message-id:date:subject:to:from:from:to:cc:subject :date:message-id:reply-to; bh=CnoYIIr4cRQTINqeYwqtiX6VKaF4FavicuNiCo1Dr3g=; b=Fv+kNHHKbmRWvLJMel8atTAxqo3IXTmuFpWsAOEwU3BUdb3oCQaS/TAVe2Iq6k7SCJ gj2236RH5d+fmCIPx5bVCyuu6T4E1M7VnA+QAYb9zxExu2rP55V1EMRIZohZkKrSKrgx 1Uvt2Y5x+sKotpPhY9xpLwzRW0xtLwu4XY995cXlglUMm/K0s7tVPHH0tsDEvwWSC/pi 5eOuSrq6bMIZFRB0LxQe33EX1YNKROYLp6lBoS4NhidsvWGWYbOQBnaXJOxvxVGwmEUk MH2lK1L0AHIsv/Q9Mhm04699J/L7YQm+o5Xws0EP1XOirQ4227x8GJRS6cfJGgry4PrR IVAQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1741636181; x=1742240981; h=thread-index:content-language:content-transfer-encoding :mime-version:message-id:date:subject:to:from:x-gm-message-state :from:to:cc:subject:date:message-id:reply-to; bh=CnoYIIr4cRQTINqeYwqtiX6VKaF4FavicuNiCo1Dr3g=; b=W+IPJh2Ws0//Y2bLqglIqxDS/HzbmwaGkYl0Mr5Vw4mHr98Buqk0SRmB0ZcfcmhZON i+up0p0cAnx5mb/+xbPF+PfXcoIQqs3CQAhDxTbU9R9VFTV4fMDWa2iJ7/rj+KvzF03D /ogy4xsdA9Xv5XuiLcpExihhqkPxVDTouEzEM4KslxVq8VYIT7bdOlufkDZq4JShkMDB elO7WJPYtzTkvIZ0N4LRdZOU5ZU1NqejVLbKIMilNmo8ubhoHycBDRxbTHm4/5OMRXxD 3NWMN58xPApOR+0wOlpbBdHctECc2fD9eT7p9UJCl5vueRbCdBxd5Rs5n7S7qvfezKRJ igBA== X-Gm-Message-State: AOJu0Yxtwtfd8XgsnsmNlv/+1K80xU5x2P2djv+RtQ7ie95yWf+agT3Z hTXgDwCzol3+5Zh506qXnkc/BhKqbK5nGFPoGUJecGtApTar8862yS0jDg== X-Gm-Gg: ASbGncu3FWd+DfA/uO/AEffV7yfUg0JXvR9DT1bXsOQlxCH8ljssn3g2Zpw9jjG5MsF BD3RZCS5C0BqBLlNQbhuCRnPz+3gH9YEGYrpLyewirRYvnfNPbn2RFEoNhACxpCEvwrFRgtIJ+u ofakai6VqERxnnE0Eru763oRmmZdFS8SMIYfT0VC3lDnFDf/XOSEcGZnjN9NYep+HDdjs3kpYh0 pxNi3lhB1aE2VXyxFcbp+VKrqq5HXUjLr//l2mWdSqCj9FLnihDPtn8X3I21EQfuqXRsTXRqxp5 xKsQGTzd9Np95sEQBsmA0oQwOZIY0CoRSeSDDbjvUdfCdl/OCh8U0DiKnKfyi4sI7vXLBD+fYgn JQ+719MrqStZDm533 X-Google-Smtp-Source: AGHT+IEw3Hi68blyIfGqfyeH1YD6mD3iYRGXt31oWhLIpo0DTuS4f+Itq+kBlp3TFx1FHXWGgYjFXQ== X-Received: by 2002:a5d:5f84:0:b0:391:487f:2828 with SMTP id ffacd0b85a97d-39263b01e2fmr1439200f8f.10.1741636180740; Mon, 10 Mar 2025 12:49:40 -0700 (PDT) Received: from MK2 (80-108-16-220.cable.dynamic.surfer.at. [80.108.16.220]) by smtp.gmail.com with ESMTPSA id 5b1f17b1804b1-43cf6d05234sm48561685e9.2.2025.03.10.12.49.40 for (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Mon, 10 Mar 2025 12:49:40 -0700 (PDT) From: To: Date: Mon, 10 Mar 2025 20:49:40 +0100 Message-ID: <003601db91f5$901b12b0$b0513810$@gmail.com> MIME-Version: 1.0 X-Mailer: Microsoft Outlook 16.0 Content-Language: en-at Thread-Index: AduR9Y/DHDVuegH1SfqcpFFh9B5zlw== Subject: [FFmpeg-devel] [PATCH v2 FFmpeg 2/20] libavfilter/dnn_filter_common: batch tokenizer implementation X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: FFmpeg development discussions and patches Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: Signed-off-by: MaximilianKaindl --- libavfilter/dnn_filter_common.c | 154 ++++++++++++++++++++++++++++++++ libavfilter/dnn_filter_common.h | 19 ++++ 2 files changed, 173 insertions(+) diff --git a/libavfilter/dnn_filter_common.c b/libavfilter/dnn_filter_common.c index 6b9c6f8d7f..c4ad000409 100644 --- a/libavfilter/dnn_filter_common.c +++ b/libavfilter/dnn_filter_common.c @@ -20,6 +20,11 @@ #include "libavutil/avstring.h" #include "libavutil/mem.h" #include "libavutil/opt.h" +#include "libavformat/avio.h" + +#if (CONFIG_LIBTOKENIZERS == 1) +#include "tokenizers_c.h" +#endif #define MAX_SUPPORTED_OUTPUTS_NB 4 @@ -217,3 +222,152 @@ void ff_dnn_uninit(DnnContext *ctx) av_freep(&ctx->model_outputnames); } } + +static int load_file_content(const char *path, char **data, size_t *data_size, void *log_ctx) { + AVIOContext *avio_ctx = NULL; + int ret; + int64_t size; + + ret = avio_open(&avio_ctx, path, AVIO_FLAG_READ); + if (ret < 0) { + if (log_ctx) + av_log(log_ctx, AV_LOG_ERROR, "Cannot open file: %s\n", path); + return ret; + } + + size = avio_size(avio_ctx); + if (size < 0) { + if (log_ctx) + av_log(log_ctx, AV_LOG_ERROR, "Failed to determine file size: %s\n", path); + avio_closep(&avio_ctx); + return size; + } + + *data = av_malloc(size + 1); + if (!*data) { + avio_closep(&avio_ctx); + return AVERROR(ENOMEM); + } + + ret = avio_read(avio_ctx, (unsigned char *)*data, size); + avio_closep(&avio_ctx); + + if (ret < 0) { + if (log_ctx) + av_log(log_ctx, AV_LOG_ERROR, "Failed to read file: %s\n", path); + av_freep(data); + return ret; + } + + if (ret != size) { + if (log_ctx) + av_log(log_ctx, AV_LOG_ERROR, "Incomplete read: %s\n", path); + av_freep(data); + return AVERROR(EIO); + } + + // Null-terminate the data + (*data)[size] = '\0'; + *data_size = size; + + return 0; +} + +#if (CONFIG_LIBTOKENIZERS == 1) +TokenizerHandle ff_dnn_tokenizer_create(const char *path, void *log_ctx) +{ + char *blob = NULL; + size_t blob_size = 0; + TokenizerHandle handle = NULL; + int ret; + + if (!path) { + if (log_ctx) + av_log(log_ctx, AV_LOG_ERROR, "Tokenizer path is NULL\n"); + return NULL; + } + + ret = load_file_content(path, &blob, &blob_size, log_ctx); + if (ret < 0) + return NULL; + + handle = tokenizers_new_from_str(blob, blob_size); + av_freep(&blob); + + if (!handle && log_ctx) + av_log(log_ctx, AV_LOG_ERROR, "Error creating tokenizer\n"); + + return handle; +} + +int ff_dnn_tokenizer_encode_batch(TokenizerHandle tokenizer, const char **texts, int text_count, + TokenizerEncodeResult **results, void *log_ctx) +{ + size_t *lengths = NULL; + int ret = 0; + + if (!tokenizer) { + if (log_ctx) + av_log(log_ctx, AV_LOG_ERROR, "Tokenizer is NULL\n"); + return AVERROR(EINVAL); + } + + if (!texts || text_count <= 0 || !results) { + if (log_ctx) + av_log(log_ctx, AV_LOG_ERROR, "Invalid parameters\n"); + return AVERROR(EINVAL); + } + + *results = av_calloc(text_count, sizeof(**results)); + if (!*results) { + ret = AVERROR(ENOMEM); + goto fail; + } + + lengths = av_calloc(text_count, sizeof(*lengths)); + if (!lengths) { + ret = AVERROR(ENOMEM); + goto fail; + } + + // Calculate text lengths + for (int i = 0; i < text_count; i++) { + lengths[i] = texts[i] ? strlen(texts[i]) : 0; + } + + // Tokenize all texts in batch - directly store results in the output array + tokenizers_encode_batch(tokenizer, texts, lengths, text_count, 1, *results); + + av_freep(&lengths); + return 0; + +fail: + av_freep(results); + av_freep(&lengths); + return ret; +} + +int ff_dnn_create_tokenizer_and_encode_batch(const char *path, const char **texts, int text_count, + TokenizerEncodeResult **results, void *log_ctx) +{ + int ret; + + // Create tokenizer + TokenizerHandle tokenizer = ff_dnn_tokenizer_create(path, log_ctx); + if (!tokenizer) { + av_log(log_ctx, AV_LOG_ERROR, "Error creating tokenizer\n"); + return AVERROR(EINVAL); + } + + // Tokenize batch + ret = ff_dnn_tokenizer_encode_batch(tokenizer, texts, text_count, results, log_ctx); + + if (ret < 0) { + av_log(log_ctx, AV_LOG_ERROR, "Failed to tokenize batch text\n"); + } + + // Clean up tokenizer + ff_dnn_tokenizer_free(tokenizer); + return ret; +} +#endif \ No newline at end of file diff --git a/libavfilter/dnn_filter_common.h b/libavfilter/dnn_filter_common.h index 42a4719997..fffa676a9e 100644 --- a/libavfilter/dnn_filter_common.h +++ b/libavfilter/dnn_filter_common.h @@ -25,6 +25,9 @@ #define AVFILTER_DNN_FILTER_COMMON_H #include "dnn_interface.h" +#if(CONFIG_LIBTOKENIZERS == 1) +#include "tokenizers_c.h" +#endif #define DNN_FILTER_CHILD_CLASS_ITERATE(name, backend_mask) \ static const AVClass *name##_child_class_iterate(void **iter) \ @@ -63,4 +66,20 @@ DNNAsyncStatusType ff_dnn_get_result(DnnContext *ctx, AVFrame **in_frame, AVFram int ff_dnn_flush(DnnContext *ctx); void ff_dnn_uninit(DnnContext *ctx); +#if(CONFIG_LIBTOKENIZERS == 1) +TokenizerHandle ff_dnn_tokenizer_create(const char *path, void *log_ctx); +int ff_dnn_tokenizer_encode_batch(TokenizerHandle tokenizer, const char **texts, int text_count, TokenizerEncodeResult **results, void *log_ctx); +int ff_dnn_create_tokenizer_and_encode_batch(const char *path, const char **texts, int text_count, TokenizerEncodeResult **results, void *log_ctx); + +inline void ff_dnn_tokenizer_free(TokenizerHandle tokenizer) { + if (tokenizer) + tokenizers_free(tokenizer); +} +inline void ff_dnn_tokenizer_free_results(TokenizerEncodeResult *results, int count) { + if (results) { + tokenizers_free_encode_results(results, count); + } +} +#endif + #endif -- 2.34.1 _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".