From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <ffmpeg-devel-bounces@ffmpeg.org>
Received: from ffbox0-bg.ffmpeg.org (ffbox0-bg.ffmpeg.org [79.124.17.100])
	by master.gitmailbox.com (Postfix) with ESMTPS id CA1CA4E1B9
	for <ffmpegdev@gitmailbox.com>; Fri, 13 Feb 2026 18:54:55 +0000 (UTC)
Authentication-Results: ffbox; dkim=fail (body hash mismatch (got 
   b'dWnW7s6fA1/HxmA2lLCcGMO4N0Nh0ee9sbBbONXlMVo=', expected 
   b'f/ww5J8GuVFHsA+xpv4mz/iPogG12rS8/DKdzHhdl5E=')) header.d=ffmpeg.org 
   header.i=@ffmpeg.org header.a=rsa-sha256
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ffmpeg.org;
 i=@ffmpeg.org; q=dns/txt; s=mail; t=1771008864; h=mime-version : to :
 date : message-id : reply-to : subject : list-id : list-archive :
 list-archive : list-help : list-owner : list-post : list-subscribe :
 list-unsubscribe : from : cc : content-type :
 content-transfer-encoding : from;
 bh=dWnW7s6fA1/HxmA2lLCcGMO4N0Nh0ee9sbBbONXlMVo=;
 b=p2hEKs2LcNmCFe0cJHlA2gaFJ52SjBRO6bbey4VaOhwlabDuv/4wnLv23bRgNUVHkoZct
 98xBtjDc18FyfnB0oG3mWKjhWg+VImI+oisfMgFJH4NscBdC/YZhor1wCmCSILqKcATISqz
 rgVt3KtgelHBHO4x8jhd7Kg+/6ZxfVbUdC9JYM9JqseHfnheAMtj3LmThjw4dcjbkXnLVqt
 4j2Y1JxWAdI0Fj82qBrb5xtYmLrMMluO2wd+QLXD70KrhWTp3e+t4Pv93aG1OtuomEGhihZ
 m9tk9TjvAhUBMtUtsupX75auhFKxjzrp9VVpladLdBwCJU/aRz9Zh+hdHbnw==
Received: from [172.20.0.3] (unknown [172.20.0.3])
	by ffbox0-bg.ffmpeg.org (Postfix) with ESMTP id E66536910AA;
	Fri, 13 Feb 2026 20:54:24 +0200 (EET)
ARC-Seal: i=1; cv=none; a=rsa-sha256; d=ffmpeg.org; s=arc; t=1771008854;
 b=hfWOjxP7/hYFW0DGFT4HrLOc3G0bS7rdQPLSF4E9HWcOEqSLLOM1s+rwNx2M+fa1NQDWy
 9aRGGTb1pJ+EyFzOP4AEybUQewkJebAKcO+nOPXfP2CeDQnqPxvZ1KWNeHQEzVFgcnZfkQy
 wGuI23XQRwYiZ7mbYk3F45AKLeM3FsrqySCLWIYQ7S9CHU/xa5t8Q9sd8vbwWGux5NBZBv4
 H13KGltjwNuYDR5qnxf5xgypzmcive6txi6aKFqNoKFLah8wpa75kODbcMPtg3GBbsnWlxJ
 KLf2AKLd2/0/J1e5BL+d3uGzFPaLhviKyr+IXrSxmws0+YOal073E8ft5fSA==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed;
 d=ffmpeg.org; s=arc; t=1771008854; h=from : sender : reply-to :
 subject : date : message-id : to : cc : mime-version : content-type :
 content-transfer-encoding : content-id : content-description :
 resent-date : resent-from : resent-sender : resent-to : resent-cc :
 resent-message-id : in-reply-to : references : list-id : list-help :
 list-unsubscribe : list-subscribe : list-post : list-owner :
 list-archive; bh=uDjMc60KwWVLb42IMvzvhw9QvqihVLSK7G1aMQLmu0w=;
 b=B1I1ZLA7hqxuo1hlrOuxt6nqGy29+Mh1+W4nnMZRJ0LySJaoiUdCKgqLc1/x/tavraUko
 uS18cKG/7RtS9TiMm8eBZhvCxH9fVogswoBAqvDg1mQE3ZHmojE24WfF3dWuSm+iqHu/x3l
 IT5A4IxG+INEIc9lArsZZuYywOgJvR8SxEZSQVJ7In/q8McANY2Kh3Pv8UtOxa/4Ydk/KdV
 2iiFSW5KdnAyTXfcOp86F9UyAkWilA+kHoPX0rKT88y28YVSrsIXpOllqUuKdTwnqwGyW6c
 pXEgXnwPAcKA7DsLjoSSI/2xO86Srf8kVI0RYhl9N8h081FB8ueldI3wrASA==
ARC-Authentication-Results: i=1; ffmpeg.org;
 dkim=pass header.d=ffmpeg.org header.i=@ffmpeg.org;
 arc=none;
 dmarc=pass header.from=ffmpeg.org policy.dmarc=quarantine
Authentication-Results: ffmpeg.org;
 dkim=pass header.d=ffmpeg.org header.i=@ffmpeg.org;
 arc=none (Message is not ARC signed);
 dmarc=pass (Used From Domain Record) header.from=ffmpeg.org
 policy.dmarc=quarantine
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ffmpeg.org;
 i=@ffmpeg.org; q=dns/txt; s=mail; t=1771008846; h=content-type :
 mime-version : content-transfer-encoding : from : to : reply-to :
 subject : date : from;
 bh=f/ww5J8GuVFHsA+xpv4mz/iPogG12rS8/DKdzHhdl5E=;
 b=KC2fhYFYCjG9hT1d+ITPUiWyp2gsoOhkZ5PfFS36eKkHGyYrKG3qDNJ9cbY4hzWMq0b+H
 AX5PcRcQuZT3ob9vwKrR9vNsgPKJDJkEI3iUg39kP8puwB5UiYcMGE8amVQpoGSh76FtN5h
 o9QrfYs7HXRGTLUtmP67CQI8H2/7Az5K9d4kXwDCEyupJFz16OHp4T3RVvT+qMq8rf1H4DI
 VkMXm9YHkH1vlxoqR+hfI5KFw9oFs4ZCVLxcJmDNVDZBwT4TwFt6HAoi8vD1cTdueGkoD9k
 4GobkAlJhokwlK9ak2n5Jyonxz95OYfK8SSHYWHmuGXyKgoV4Ue9P1XGq7iQ==
Received: from c8d966988b92 (code.ffmpeg.org [188.245.149.3])
	by ffbox0-bg.ffmpeg.org (Postfix) with ESMTPS id 527556910B1
	for <ffmpeg-devel@ffmpeg.org>; Fri, 13 Feb 2026 20:54:06 +0200 (EET)
MIME-Version: 1.0
To: ffmpeg-devel@ffmpeg.org
Date: Fri, 13 Feb 2026 18:54:05 -0000
Message-ID: <177100884650.25.584891696138225263@009cbcb3d8cd>
Message-ID-Hash: 3IGP44L2IELBCQZMSHEBCGTXWOMU3GMC
X-Message-ID-Hash: 3IGP44L2IELBCQZMSHEBCGTXWOMU3GMC
X-MailFrom: code@ffmpeg.org
X-Mailman-Rule-Hits: nonmember-moderation
X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; loop;
 banned-address; header-match-ffmpeg-devel.ffmpeg.org-0;
 header-match-ffmpeg-devel.ffmpeg.org-1;
 header-match-ffmpeg-devel.ffmpeg.org-2;
 header-match-ffmpeg-devel.ffmpeg.org-3; emergency; member-moderation
X-Mailman-Version: 3.3.10
Precedence: list
Reply-To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org>
Subject: [FFmpeg-devel] [PR] avfilter/dnn: implement persistent buffers for LibTorch
 backend (PR #21749)
List-Id: FFmpeg development discussions and patches <ffmpeg-devel.ffmpeg.org>
Archived-At: 
 <https://lists.ffmpeg.org/archives/list/ffmpeg-devel@ffmpeg.org/message/3IGP44L2IELBCQZMSHEBCGTXWOMU3GMC/>
Archived-At: 
 <https://lists.ffmpeg.org/lore/ffmpeg-devel/177100884650.25.584891696138225263@009cbcb3d8cd/>
List-Archive: 
 <https://lists.ffmpeg.org/archives/list/ffmpeg-devel@ffmpeg.org/>
List-Archive: <https://lists.ffmpeg.org/lore/ffmpeg-devel/>
List-Help: <mailto:ffmpeg-devel-request@ffmpeg.org?subject=help>
List-Owner: <mailto:ffmpeg-devel-owner@ffmpeg.org>
List-Post: <mailto:ffmpeg-devel@ffmpeg.org>
List-Subscribe: <mailto:ffmpeg-devel-join@ffmpeg.org>
List-Unsubscribe: <mailto:ffmpeg-devel-leave@ffmpeg.org>
From: Raja-89 via ffmpeg-devel <ffmpeg-devel@ffmpeg.org>
Cc: Raja-89 <code@ffmpeg.org>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Archived-At: <https://master.gitmailbox.com/ffmpegdev/177100884650.25.584891696138225263@009cbcb3d8cd/>
List-Archive: <https://master.gitmailbox.com/ffmpegdev/>
List-Post: <mailto:ffmpegdev@gitmailbox.com>

PR #21749 opened by Raja-89
URL: https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/21749
Patch URL: https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/21749.patch

This patch enables dynamic resolution support for the Torch backend.

By utilizing torch::from_blob inside the inference loop, we can re-wrap the persistent input buffer into a tensor with updated dimensions (H, W) for every frame. This allows the filter to handle streams where resolution changes mid-process (e.g., following a concat or scale filter) without reallocation of the underlying C++ tensor objects.

Verified with Valgrind and complex filtergraphs involving resolution jumps. No memory leaks detected beyond existing static library initializers.


>>From 7e36d2a9f3c55d6fe06364ace01b53d9d020ece6 Mon Sep 17 00:00:00 2001
From: Raja Rathour <imraja729@gmail.com>
Date: Fri, 13 Feb 2026 23:53:50 +0530
Subject: [PATCH] avfilter/dnn: implement persistent buffers for LibTorch
 backend

---
 Changelog                             |   3 +
 libavfilter/dnn/dnn_backend_torch.cpp | 185 +++++++++++++-------------
 2 files changed, 97 insertions(+), 91 deletions(-)

diff --git a/Changelog b/Changelog
index a9d68b369e..119de9ff56 100644
--- a/Changelog
+++ b/Changelog
@@ -2256,3 +2256,6 @@ version 0.3.1: added avi/divx support
 
 
 version 0.3: initial public release
+
+- libavfilter/dnn: persistent buffer management for LibTorch backend
+- libavfilter/dnn: dynamic shape support for LibTorch backend
diff --git a/libavfilter/dnn/dnn_backend_torch.cpp b/libavfilter/dnn/dnn_backend_torch.cpp
index d3c4966c09..0bc76d83ae 100644
--- a/libavfilter/dnn/dnn_backend_torch.cpp
+++ b/libavfilter/dnn/dnn_backend_torch.cpp
@@ -56,6 +56,8 @@ typedef struct THModel {
 typedef struct THInferRequest {
     torch::Tensor *output;
     torch::Tensor *input_tensor;
+    uint8_t *input_data;     // New: Persistent buffer for input pixels
+    size_t input_data_size;  // New: Current size of the buffer
 } THInferRequest;
 
 typedef struct THRequestItem {
@@ -96,15 +98,22 @@ static void th_free_request(THInferRequest *request)
 {
     if (!request)
         return;
-    if (request->output) {
-        delete(request->output);
-        request->output = NULL;
-    }
+
     if (request->input_tensor) {
-        delete(request->input_tensor);
+        delete request->input_tensor;
         request->input_tensor = NULL;
     }
-    return;
+
+    if (request->output) {
+        delete request->output;
+        request->output = NULL;
+    }
+
+    /* Free the persistent buffer */
+    if (request->input_data) {
+        av_freep(&request->input_data);
+    }
+    request->input_data_size = 0;
 }
 
 static inline void destroy_request_item(THRequestItem **arg)
@@ -129,7 +138,7 @@ static void dnn_free_model_th(DNNModel **model)
 
     th_model = (THModel *)(*model);
 
-    /* 1. Stop and join the worker thread if it exists */
+    /* 1. Stop and join the worker thread */
     if (th_model->worker_thread) {
         {
             std::lock_guard<std::mutex> lock(*th_model->mutex);
@@ -151,7 +160,7 @@ static void dnn_free_model_th(DNNModel **model)
         th_model->cond = NULL;
     }
 
-    /* 3. Clean up the pending queue */
+    /* 3. Clean up the pending queue (Async tasks) */
     if (th_model->pending_queue) {
         while (ff_safe_queue_size(th_model->pending_queue) > 0) {
             THRequestItem *item = (THRequestItem *)ff_safe_queue_pop_front(th_model->pending_queue);
@@ -160,7 +169,7 @@ static void dnn_free_model_th(DNNModel **model)
         ff_safe_queue_destroy(th_model->pending_queue);
     }
 
-    /* 4. Clean up standard backend queues */
+    /* 4. Clean up standard backend queues and persistent request buffers */
     if (th_model->request_queue) {
         while (ff_safe_queue_size(th_model->request_queue) != 0) {
             THRequestItem *item = (THRequestItem *)ff_safe_queue_pop_front(th_model->request_queue);
@@ -169,6 +178,7 @@ static void dnn_free_model_th(DNNModel **model)
         ff_safe_queue_destroy(th_model->request_queue);
     }
 
+    /* 5. Clean up task and lltask queues */
     if (th_model->lltask_queue) {
         while (ff_queue_size(th_model->lltask_queue) != 0) {
             LastLevelTaskItem *item = (LastLevelTaskItem *)ff_queue_pop_front(th_model->lltask_queue);
@@ -180,14 +190,16 @@ static void dnn_free_model_th(DNNModel **model)
     if (th_model->task_queue) {
         while (ff_queue_size(th_model->task_queue) != 0) {
             TaskItem *item = (TaskItem *)ff_queue_pop_front(th_model->task_queue);
-            av_frame_free(&item->in_frame);
-            av_frame_free(&item->out_frame);
-            av_freep(&item);
+            if (item) {
+                av_frame_free(&item->in_frame);
+                av_frame_free(&item->out_frame);
+                av_freep(&item);
+            }
         }
         ff_queue_destroy(th_model->task_queue);
     }
 
-    /* 5. Final model cleanup */
+    /* 6. Final model cleanup */
     if (th_model->jit_model)
         delete th_model->jit_model;
 
@@ -195,18 +207,6 @@ static void dnn_free_model_th(DNNModel **model)
     *model = NULL;
 }
 
-static int get_input_th(DNNModel *model, DNNData *input, const char *input_name)
-{
-    input->dt = DNN_FLOAT;
-    input->order = DCO_RGB;
-    input->layout = DL_NCHW;
-    input->dims[0] = 1;
-    input->dims[1] = 3;
-    input->dims[2] = -1;
-    input->dims[3] = -1;
-    return 0;
-}
-
 static void deleter(void *arg)
 {
     av_freep(&arg);
@@ -214,99 +214,88 @@ static void deleter(void *arg)
 
 static int fill_model_input_th(THModel *th_model, THRequestItem *request)
 {
-    LastLevelTaskItem *lltask = NULL;
-    TaskItem *task = NULL;
-    THInferRequest *infer_request = NULL;
+    LastLevelTaskItem *lltask;
+    TaskItem *task;
+    THInferRequest *infer_request;
     DNNData input = { 0 };
-    DnnContext *ctx = th_model->ctx;
     int ret, width_idx, height_idx, channel_idx;
+    size_t cur_size;
 
     lltask = (LastLevelTaskItem *)ff_queue_pop_front(th_model->lltask_queue);
-    if (!lltask) {
-        ret = AVERROR(EINVAL);
-        goto err;
-    }
+    if (!lltask)
+        return AVERROR(EINVAL);
+
     request->lltask = lltask;
     task = lltask->task;
     infer_request = request->infer_request;
 
     ret = get_input_th(&th_model->model, &input, NULL);
-    if ( ret != 0) {
-        goto err;
-    }
+    if (ret != 0)
+        return ret;
+
     width_idx = dnn_get_width_idx_by_layout(input.layout);
     height_idx = dnn_get_height_idx_by_layout(input.layout);
     channel_idx = dnn_get_channel_idx_by_layout(input.layout);
+
+    /* Update internal DNNData with current frame dimensions */
     input.dims[height_idx] = task->in_frame->height;
     input.dims[width_idx] = task->in_frame->width;
-    input.data = av_malloc(input.dims[height_idx] * input.dims[width_idx] *
-                           input.dims[channel_idx] * sizeof(float));
-    if (!input.data)
-        return AVERROR(ENOMEM);
-    infer_request->input_tensor = new torch::Tensor();
-    infer_request->output = new torch::Tensor();
 
-    switch (th_model->model.func_type) {
-    case DFT_PROCESS_FRAME:
-        input.scale = 255;
-        if (task->do_ioproc) {
-            if (th_model->model.frame_pre_proc != NULL) {
-                th_model->model.frame_pre_proc(task->in_frame, &input, th_model->model.filter_ctx);
-            } else {
-                ff_proc_from_frame_to_dnn(task->in_frame, &input, ctx);
-            }
-        }
-        break;
-    default:
-        avpriv_report_missing_feature(NULL, "model function type %d", th_model->model.func_type);
-        break;
+    cur_size = (size_t)input.dims[height_idx] * input.dims[width_idx] *
+               input.dims[channel_idx] * sizeof(float);
+
+    /* Persistent Buffer Logic (Part 2) */
+    if (!infer_request->input_data || infer_request->input_data_size < cur_size) {
+        av_freep(&infer_request->input_data);
+        infer_request->input_data = (uint8_t *)av_malloc(cur_size);
+        if (!infer_request->input_data)
+            return AVERROR(ENOMEM);
+        infer_request->input_data_size = cur_size;
     }
+
+    if (!infer_request->input_tensor)
+        infer_request->input_tensor = new torch::Tensor();
+    if (!infer_request->output)
+        infer_request->output = new torch::Tensor();
+
+    input.data = infer_request->input_data;
+
+    /* Perform pre-processing (scaling/normalization) */
+    if (task->do_ioproc) {
+        if (th_model->model.frame_pre_proc)
+            th_model->model.frame_pre_proc(task->in_frame, &input, th_model->model.filter_ctx);
+        else
+            ff_proc_from_frame_to_dnn(task->in_frame, &input, th_model->ctx);
+    }
+
+    /**
+     * PART 3: DYNAMIC SHAPE RE-WRAPPING
+     * We re-map the tensor to the buffer with the CURRENT frame dimensions.
+     * Note: from_blob does NOT copy data; it just creates a view.
+     */
     *infer_request->input_tensor = torch::from_blob(input.data,
         {1, input.dims[channel_idx], input.dims[height_idx], input.dims[width_idx]},
-        deleter, torch::kFloat32);
-    return 0;
+        torch::kFloat32);
 
-err:
-    th_free_request(infer_request);
-    return ret;
+    return 0;
 }
 
 static int th_start_inference(void *args)
 {
     THRequestItem *request = (THRequestItem *)args;
-    THInferRequest *infer_request = NULL;
-    LastLevelTaskItem *lltask = NULL;
-    TaskItem *task = NULL;
-    THModel *th_model = NULL;
-    DnnContext *ctx = NULL;
+    THInferRequest *infer_request = request->infer_request;
+    THModel *th_model = (THModel *)request->lltask->task->model;
     std::vector<torch::jit::IValue> inputs;
     torch::NoGradGuard no_grad;
 
-    if (!request) {
-        av_log(NULL, AV_LOG_ERROR, "THRequestItem is NULL\n");
-        return AVERROR(EINVAL);
-    }
-    infer_request = request->infer_request;
-    lltask = request->lltask;
-    task = lltask->task;
-    th_model = (THModel *)task->model;
-    ctx = th_model->ctx;
-
-    if (ctx->torch_option.optimize)
-        torch::jit::setGraphExecutorOptimize(true);
-    else
-        torch::jit::setGraphExecutorOptimize(false);
-
-    if (!infer_request->input_tensor || !infer_request->output) {
-        av_log(ctx, AV_LOG_ERROR, "input or output tensor is NULL\n");
-        return DNN_GENERIC_ERROR;
-    }
-    // Transfer tensor to the same device as model
+    /* Transfer input tensor to the model device (CPU/GPU/XPU) */
     c10::Device device = (*th_model->jit_model->parameters().begin()).device();
     if (infer_request->input_tensor->device() != device)
         *infer_request->input_tensor = infer_request->input_tensor->to(device);
+
     inputs.push_back(*infer_request->input_tensor);
 
+    /* Inference: LibTorch dynamically sizes the output tensor based on the model */
     *infer_request->output = th_model->jit_model->forward(inputs).toTensor();
 
     return 0;
@@ -487,15 +476,28 @@ err:
 
 static THInferRequest *th_create_inference_request(void)
 {
-    THInferRequest *request = (THInferRequest *)av_malloc(sizeof(THInferRequest));
-    if (!request) {
+    // Use av_mallocz to zero-initialize everything (including input_data and input_data_size)
+    THInferRequest *request = (THInferRequest *)av_mallocz(sizeof(THInferRequest));
+    if (!request)
         return NULL;
-    }
-    request->input_tensor = NULL;
-    request->output = NULL;
+
     return request;
 }
 
+static int get_input_th(DNNModel *model, DNNData *input, const char *input_name)
+{
+    input->dt     = DNN_FLOAT;
+    input->order  = DCO_RGB;
+    input->layout = DL_NCHW;
+
+    input->dims[0] = 1;
+    input->dims[1] = 3;
+    input->dims[2] = -1;
+    input->dims[3] = -1;
+
+    return 0;
+}
+
 static DNNModel *dnn_load_model_th(DnnContext *ctx, DNNFunctionType func_type, AVFilterContext *filter_ctx)
 {
     DNNModel *model = NULL;
@@ -675,3 +677,4 @@ extern const DNNModule ff_dnn_backend_torch = {
     .flush          = dnn_flush_th,
     .free_model     = dnn_free_model_th,
 };
+
-- 
2.52.0

_______________________________________________
ffmpeg-devel mailing list -- ffmpeg-devel@ffmpeg.org
To unsubscribe send an email to ffmpeg-devel-leave@ffmpeg.org