From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <ffmpeg-devel-bounces@ffmpeg.org>
Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100])
	by master.gitmailbox.com (Postfix) with ESMTP id 87F26498BC
	for <ffmpegdev@gitmailbox.com>; Wed, 21 Feb 2024 03:08:24 +0000 (UTC)
Received: from [127.0.1.1] (localhost [127.0.0.1])
	by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id D948368D0CA;
	Wed, 21 Feb 2024 05:08:22 +0200 (EET)
Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.20])
 by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id B271F68CE13
 for <ffmpeg-devel@ffmpeg.org>; Wed, 21 Feb 2024 05:08:15 +0200 (EET)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple;
 d=intel.com; i=@intel.com; q=dns/txt; s=Intel;
 t=1708484901; x=1740020901;
 h=from:to:subject:date:message-id:references:in-reply-to:
 content-transfer-encoding:mime-version;
 bh=tqItylRMetr21MhPXHnO87aA/FKA0P893i2+XPDsYoE=;
 b=gPvjB1e106s0mveavTQk7eHGbc9SUxi+wGkDeTd4ZCNeyhAM2hPVe5K4
 kH53eF7Ct4nkvSxmYdVEOKmNppARRnPv9k98qjIpufuXLaW+wVobttbYq
 iVeiZKwt1MtnIFFQ7QPxaL5+TacxTrnqVa6eBxsnDK/8jLQw6ROum/0YW
 DtcM9tqG9xqUeFXfdTwkPBYLNWxQcH/4miYz9d+Z2UBXi5tW91D4pQFVe
 iISqV606oSLX607fZetl4NJYDETjyK3HiaE4pM5Z8J6rnvqXSSSYzJ0lo
 bvkyfGh8sNynJggYIVF/wifQO6RyH6ytXzQFoutRv91dHXxDjLzOfSmnz g==;
X-IronPort-AV: E=McAfee;i="6600,9927,10990"; a="2498638"
X-IronPort-AV: E=Sophos;i="6.06,174,1705392000"; 
   d="scan'208";a="2498638"
Received: from orviesa008.jf.intel.com ([10.64.159.148])
 by orvoesa112.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384;
 20 Feb 2024 19:08:14 -0800
X-ExtLoop1: 1
X-IronPort-AV: E=Sophos;i="6.06,174,1705392000"; 
   d="scan'208";a="5354149"
Received: from fmsmsx601.amr.corp.intel.com ([10.18.126.81])
 by orviesa008.jf.intel.com with ESMTP/TLS/AES256-GCM-SHA384;
 20 Feb 2024 19:08:13 -0800
Received: from fmsmsx610.amr.corp.intel.com (10.18.126.90) by
 fmsmsx601.amr.corp.intel.com (10.18.126.81) with Microsoft SMTP Server
 (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id
 15.1.2507.35; Tue, 20 Feb 2024 19:08:13 -0800
Received: from FMSEDG603.ED.cps.intel.com (10.1.192.133) by
 fmsmsx610.amr.corp.intel.com (10.18.126.90) with Microsoft SMTP Server
 (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id
 15.1.2507.35 via Frontend Transport; Tue, 20 Feb 2024 19:08:13 -0800
Received: from NAM04-MW2-obe.outbound.protection.outlook.com (104.47.73.169)
 by edgegateway.intel.com (192.55.55.68) with Microsoft SMTP Server
 (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id
 15.1.2507.35; Tue, 20 Feb 2024 19:08:12 -0800
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none;
 b=a5uyMjYh8tLLQ0mn+rtjKWEUmZW2qId7akkmm5P3OAhLiZfDGamyFUkK0CGVPA6Lc9HPBzlBbf0QzqnjrKjSh1uukWpTwKhtAt9WjgjUuaE0IoTZFzLKRGvXyCkB1HB3p0dRz3VeDY5VETtela7hqP+60sdB7rKpIPiB41e1HhVP81osmw82I/IVMWd5XUJDi6y9SVg86X5wOg4Y9P/wTBDBYxRSStygVNbvHWzfxU2KvEC3XQgIFef7XETDTXLQZMejLcOS2pagF4h1UkNkghoe2KKw5lWkNwyzqLXy7oG/0+jGD8g2FHJxkEwZCHZzWZmKbN/NPe7T7lPuEX9Bvw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; 
 s=arcselector9901;
 h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1;
 bh=ePGK/AHnXodo0WB6MuVzUV5FISjY/7058Xc6bvHJoDA=;
 b=C+Q9k5n/eoO1bhdVkeCzLjSq7LAn2wT1xE2plKcTNQf66Hb7sJzncyuu1qeqFdcCvBjQfiNQs2sn/Tn9fxE6haRe7LeF6tTdhWxsv8TWARu2gZR9YgxiUVbGEVQvqF9wt2XpQA9RrnszvzhKpXeTppEoJifv8lHAfd9QKi1Nfa28gfue5F5Xd5j4M60ICqXwlmBvhpiRkjkqx8FNsl63tbfVvvHGOH338t1vnul5N5LNWGXsHVsgndmRoUsV7G0///CLOvHE0VFuM99tT+hIW/waM2BCAcC3i8SF8XlrFY1IiCO/nJSGRKZwp8Dxw2axyYtndH9R6wsfO0hlGctYxg==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass
 smtp.mailfrom=intel.com; dmarc=pass action=none header.from=intel.com;
 dkim=pass header.d=intel.com; arc=none
Received: from IA1PR11MB6396.namprd11.prod.outlook.com (2603:10b6:208:3ab::10)
 by DS0PR11MB7623.namprd11.prod.outlook.com (2603:10b6:8:142::6) with
 Microsoft SMTP Server (version=TLS1_2,
 cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7292.27; Wed, 21 Feb
 2024 03:08:04 +0000
Received: from IA1PR11MB6396.namprd11.prod.outlook.com
 ([fe80::204e:5433:6d25:bde0]) by IA1PR11MB6396.namprd11.prod.outlook.com
 ([fe80::204e:5433:6d25:bde0%6]) with mapi id 15.20.7292.036; Wed, 21 Feb 2024
 03:08:04 +0000
From: "Chen, Wenbin" <wenbin.chen-at-intel.com@ffmpeg.org>
To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org>
Thread-Topic: [FFmpeg-devel] [PATCH v3] libavfi/dnn: add LibTorch as one of
 DNN backend
Thread-Index: AQHaY7gss02ZjdJ6AEC9VytNrGtX6rEUB36AgAAXLsA=
Date: Wed, 21 Feb 2024 03:08:04 +0000
Message-ID: <IA1PR11MB6396D327B1749A275E180893F8572@IA1PR11MB6396.namprd11.prod.outlook.com>
References: <20240220044824.1439205-1-wenbin.chen@intel.com>
 <a402d9ef-be22-41e2-96eb-84b53101d1e7@betaapp.fastmail.com>
In-Reply-To: <a402d9ef-be22-41e2-96eb-84b53101d1e7@betaapp.fastmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
authentication-results: dkim=none (message not signed)
 header.d=none;dmarc=none action=none header.from=intel.com;
x-ms-publictraffictype: Email
x-ms-traffictypediagnostic: IA1PR11MB6396:EE_|DS0PR11MB7623:EE_
x-ms-office365-filtering-correlation-id: d111a51c-746b-4fe4-e6d6-08dc328a524f
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: JGgnoqIDmP1/pdntsHvc45ha3KU/rwCQzNJC9If/94pEBnabgDb9UwKtaYwyGq4OrO/Zzu14jue/V1VahCVBCXF1WEU9bOifQKEAhDqBQ5IDpb4tuguvkF8Oai5ZZDbl3tFVORDKLRUYh2EUF8766kso0ZpB7e0eDc0+5zUZGbLmNmUOjQU8rWmf8e2UBo6wbT4ygc4fqZ8sCQNzlLqLKjxEE0rpEj4sUAowv34jEcF/+dXuE1gLwJQIilECBQQOosFkb0i+/h0DEnotHLPFRYImCib3c8cK9L8I2URVOFI4q8OC9kAmozAdH2huTsDdfuV8TUN5Jk+Pj4fTnU8mFST9ZdmlRr8G9TM2B9qlB4QRzU8jBATCYLyWruAlbp14E2kJMRr0bhNZEhSCS63SqKkpXqqX5MqHXSFXU85hel9nWSA7/jtb0I0pX1tp6rPENCCx5LiTipNXUWDm14z4z5a72DK0rUESLxXA0ODIRJ+chK8cp3u1O4O3WqSHLzxS+5ki1dLiuFwhmBl1joB2Tfr6zC8pVpAEKhMM1Rrf/ScH4gKoaWfjIbOP+4VcruFQxK5aWlQh2US2u4vK3aklexABe6t6Jamckjb6xCqC9lU=
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:;
 IPV:NLI; SFV:NSPM; H:IA1PR11MB6396.namprd11.prod.outlook.com; PTR:; CAT:NONE;
 SFS:(13230031)(230273577357003)(38070700009); DIR:OUT; SFP:1101; 
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: =?us-ascii?Q?FSp//WkwG4hEFsBvZUQcqm0F6hjFA0OIXtTtAFEvhvEdyATODTIk5iIfqLIw?=
 =?us-ascii?Q?JOQiyFlqx5dLPXlwQoe3cI4TkIrTFbyulD5s4q19aF/cOa5n2wBW7v9hTHxI?=
 =?us-ascii?Q?hceSBMm6oexBEACsPZBI71Jgnf7LlqHATap+Xt9LlN8ULLOCiFr4fzUVg6rB?=
 =?us-ascii?Q?17Dc6C0+UV5WDwocKU+3Holb5sSY0Qy8J2BAaQVfJ9EMBc/DZlbtcUglJ9f0?=
 =?us-ascii?Q?g54jZ3spFoNQwBZa6VH4GBz6K0RzhtpXuUVeZ4hs4da2Q9+ApmsYS5CwG/TE?=
 =?us-ascii?Q?m6RX90fBJ8EoPhb3dkecWNNIBopBgc+capQ/IlEVbr79cazBd6XC1btMW6rL?=
 =?us-ascii?Q?HbKwAumVIxPxamWJU7rhSw7h/adYaXH0tgH0khq+7Q0NGNJMl+jmsNHY/A+F?=
 =?us-ascii?Q?14htnjJndU2GIgpp84bFBQwh5vzt7g+Ho5EICpm/lpr26t1I5TGXjOPGB/Ck?=
 =?us-ascii?Q?Dk3BJQVULYpDoBFkkPcz63iwB1SIfUjcTRSWjAjSjWCIIga1Lrn92c6tLZMk?=
 =?us-ascii?Q?8K+I3G/GgIJAVPvwJOim4uS/JtCxcUcN6AAWtnu4Kv2PSyLIEIkvrjT9+j6h?=
 =?us-ascii?Q?4OgCUIFnXe95ik//xNM9drbmsMT8yCQKu5+OVBDwnZuFbE7T1KXfsIvzvtLU?=
 =?us-ascii?Q?sBeKKz2COBs7hvFkr79/9cVjmPgTreDW3yGhfMUs+bL3lFRZ6t5Cjbfr3/r0?=
 =?us-ascii?Q?g/PJyrWdbzxdUS5r1jvgbII/QfOpM+1KVGwgQdqjFNJaJ2LZPU1APJkGmQDS?=
 =?us-ascii?Q?56YwqL7U7zVLjifwS/w7RHxdnKY1mFXjoGjt+QTH/VY0amWP0xoyYUwkicB8?=
 =?us-ascii?Q?eKbFAAg5q5JDoOjeQk3xEiOeYjJExiRHoew8inqrkGa48mm2wnyidh4hI9j9?=
 =?us-ascii?Q?aaThZf4RyuRahdj/w1BxYbOTT4mcoYkGHC/57yDP4ZSUCgkhXDZkdqXns0kn?=
 =?us-ascii?Q?/Suync3B3v4eF+SdHKvqwdo1BObybq3xYDl26FPNaBRm6CXLAoUvMcbF0bxW?=
 =?us-ascii?Q?TqCNARHvLQfy2sGoRbYMP4Ni01YgLpOI6GHpzPtBIIUu8fmk9yD8tnjjKClk?=
 =?us-ascii?Q?Q/koXhK9V3M48N1IZyOfVCP7uWr1qSJWTYFmQKg6eI1ZhOvbR2MaaOFwu9SX?=
 =?us-ascii?Q?cnXpDHRG4DaOKta0srg2M6liWGm2GRv1nsU2MvccMarzEmsDFsVSaNxFMa34?=
 =?us-ascii?Q?0pAK0CvMY9X+6JuFEC4gAeEn7lb3EYVkOQ4fn8aaJLT+rpudPHDW8Iw64u84?=
 =?us-ascii?Q?GANo0Qqnj0dTqX16gABjGLnX9aGACUKQw7a2T7cWgtdNvLQWpGAtKtqd5B3a?=
 =?us-ascii?Q?RHp4QxzalhYGunc/ABSg85onJDegcRZLSh4ieNmmIERgRApNmw0D1PxYE1+j?=
 =?us-ascii?Q?e1twq/LerC7isu8RgEJzIp3X0FRV48fWcBvz9cyM2tkfN4rTcn5KeYKQ7IRG?=
 =?us-ascii?Q?4LmvEzzabhUWqZy88h9xwH8YURrS2v5XxC0ZRvaZf6Rwcz5fO3YbHKPQUh2Y?=
 =?us-ascii?Q?W27NdS4cl9yMaRZK5sWps+vHijOIEqp8PsP7oYbr5WCwoYrv/dIAxNBKtYx2?=
 =?us-ascii?Q?qjJGUoQQssfYcAN6HteXZnyOLZfOuTUzJhwqPZkB?=
MIME-Version: 1.0
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: IA1PR11MB6396.namprd11.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: d111a51c-746b-4fe4-e6d6-08dc328a524f
X-MS-Exchange-CrossTenant-originalarrivaltime: 21 Feb 2024 03:08:04.7411 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 46c98d88-e344-4ed4-8496-4ed7712e255d
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: ILwAqhK+dmscjo5csyk5VmErhlSxQhQ47C5fSXP363kF1DQSfYFXKhNawwkH4MuXoUPKDd3SAhiufWZEM+k7aQ==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: DS0PR11MB7623
X-OriginatorOrg: intel.com
Subject: Re: [FFmpeg-devel] [PATCH v3] libavfi/dnn: add LibTorch as one of
 DNN backend
X-BeenThere: ffmpeg-devel@ffmpeg.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: FFmpeg development discussions and patches <ffmpeg-devel.ffmpeg.org>
List-Unsubscribe: <https://ffmpeg.org/mailman/options/ffmpeg-devel>,
 <mailto:ffmpeg-devel-request@ffmpeg.org?subject=unsubscribe>
List-Archive: <https://ffmpeg.org/pipermail/ffmpeg-devel>
List-Post: <mailto:ffmpeg-devel@ffmpeg.org>
List-Help: <mailto:ffmpeg-devel-request@ffmpeg.org?subject=help>
List-Subscribe: <https://ffmpeg.org/mailman/listinfo/ffmpeg-devel>,
 <mailto:ffmpeg-devel-request@ffmpeg.org?subject=subscribe>
Reply-To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Errors-To: ffmpeg-devel-bounces@ffmpeg.org
Sender: "ffmpeg-devel" <ffmpeg-devel-bounces@ffmpeg.org>
Archived-At: <https://master.gitmailbox.com/ffmpegdev/IA1PR11MB6396D327B1749A275E180893F8572@IA1PR11MB6396.namprd11.prod.outlook.com/>
List-Archive: <https://master.gitmailbox.com/ffmpegdev/>
List-Post: <mailto:ffmpegdev@gitmailbox.com>

> Hello,
> 
> On Tue, 20 Feb 2024, at 05:48, wenbin.chen-at-intel.com@ffmpeg.org wrote:
> > From: Wenbin Chen <wenbin.chen@intel.com>
> >
> > PyTorch is an open source machine learning framework that accelerates
> 
> OK for me
> 
> > the path from research prototyping to production deployment. Official
> > websit: https://pytorch.org/. We call the C++ library of PyTorch as
> 
> websitE

Fixed in Patch v4. Thanks

Wenbin

> 
> > LibTorch, the same below.
> >
> > To build FFmpeg with LibTorch, please take following steps as reference:
> > 1. download LibTorch C++ library in
> > https://pytorch.org/get-started/locally/,
> > please select C++/Java for language, and other options as your need.
> > 2. unzip the file to your own dir, with command
> > unzip libtorch-shared-with-deps-latest.zip -d your_dir
> > 3. export libtorch_root/libtorch/include and
> > libtorch_root/libtorch/include/torch/csrc/api/include to $PATH
> > export libtorch_root/libtorch/lib/ to $LD_LIBRARY_PATH
> > 4. config FFmpeg with ../configure --enable-libtorch
> > --extra-cflag=-I/libtorch_root/libtorch/include
> > --extra-cflag=-I/libtorch_root/libtorch/include/torch/csrc/api/include
> > --extra-ldflags=-L/libtorch_root/libtorch/lib/
> > 5. make
> >
> > To run FFmpeg DNN inference with LibTorch backend:
> > ./ffmpeg -i input.jpg -vf
> > dnn_processing=dnn_backend=torch:model=LibTorch_model.pt -y
> output.jpg
> > The LibTorch_model.pt can be generated by Python with
> > torch.jit.script() api. Please note, torch.jit.trace() is not
> > recommanded, since it does not support ambiguous input size.
> >
> > Signed-off-by: Ting Fu <ting.fu@intel.com>
> > Signed-off-by: Wenbin Chen <wenbin.chen@intel.com>
> > ---
> >  configure                             |   5 +-
> >  libavfilter/dnn/Makefile              |   1 +
> >  libavfilter/dnn/dnn_backend_torch.cpp | 597
> ++++++++++++++++++++++++++
> >  libavfilter/dnn/dnn_interface.c       |   5 +
> >  libavfilter/dnn_filter_common.c       |  15 +-
> >  libavfilter/dnn_interface.h           |   2 +-
> >  libavfilter/vf_dnn_processing.c       |   3 +
> >  7 files changed, 624 insertions(+), 4 deletions(-)
> >  create mode 100644 libavfilter/dnn/dnn_backend_torch.cpp
> >
> > diff --git a/configure b/configure
> > index 2c635043dd..450ef54a80 100755
> > --- a/configure
> > +++ b/configure
> > @@ -279,6 +279,7 @@ External library support:
> >    --enable-libtheora       enable Theora encoding via libtheora [no]
> >    --enable-libtls          enable LibreSSL (via libtls), needed for
> > https support
> >                             if openssl, gnutls or mbedtls is not used
> > [no]
> > +  --enable-libtorch        enable Torch as one DNN backend [no]
> >    --enable-libtwolame      enable MP2 encoding via libtwolame [no]
> >    --enable-libuavs3d       enable AVS3 decoding via libuavs3d [no]
> >    --enable-libv4l2         enable libv4l2/v4l-utils [no]
> > @@ -1901,6 +1902,7 @@ EXTERNAL_LIBRARY_LIST="
> >      libtensorflow
> >      libtesseract
> >      libtheora
> > +    libtorch
> >      libtwolame
> >      libuavs3d
> >      libv4l2
> > @@ -2781,7 +2783,7 @@ cbs_vp9_select="cbs"
> >  deflate_wrapper_deps="zlib"
> >  dirac_parse_select="golomb"
> >  dovi_rpu_select="golomb"
> > -dnn_suggest="libtensorflow libopenvino"
> > +dnn_suggest="libtensorflow libopenvino libtorch"
> >  dnn_deps="avformat swscale"
> >  error_resilience_select="me_cmp"
> >  evcparse_select="golomb"
> > @@ -6886,6 +6888,7 @@ enabled libtensorflow     && require
> > libtensorflow tensorflow/c/c_api.h TF_Versi
> >  enabled libtesseract      && require_pkg_config libtesseract tesseract
> > tesseract/capi.h TessBaseAPICreate
> >  enabled libtheora         && require libtheora theora/theoraenc.h
> > th_info_init -ltheoraenc -ltheoradec -logg
> >  enabled libtls            && require_pkg_config libtls libtls tls.h
> > tls_configure
> > +enabled libtorch          && check_cxxflags -std=c++14 && require_cpp
> > libtorch torch/torch.h "torch::Tensor" -ltorch -lc10 -ltorch_cpu
> > -lstdc++ -lpthread
> >  enabled libtwolame        && require libtwolame twolame.h twolame_init
> > -ltwolame &&
> >                               { check_lib libtwolame twolame.h
> > twolame_encode_buffer_float32_interleaved -ltwolame ||
> >                                 die "ERROR: libtwolame must be
> > installed and version must be >= 0.3.10"; }
> > diff --git a/libavfilter/dnn/Makefile b/libavfilter/dnn/Makefile
> > index 5d5697ea42..3d09927c98 100644
> > --- a/libavfilter/dnn/Makefile
> > +++ b/libavfilter/dnn/Makefile
> > @@ -6,5 +6,6 @@ OBJS-$(CONFIG_DNN)                           +=
> > dnn/dnn_backend_common.o
> >
> >  DNN-OBJS-$(CONFIG_LIBTENSORFLOW)             += dnn/dnn_backend_tf.o
> >  DNN-OBJS-$(CONFIG_LIBOPENVINO)               +=
> dnn/dnn_backend_openvino.o
> > +DNN-OBJS-$(CONFIG_LIBTORCH)                  += dnn/dnn_backend_torch.o
> >
> >  OBJS-$(CONFIG_DNN)                           += $(DNN-OBJS-yes)
> > diff --git a/libavfilter/dnn/dnn_backend_torch.cpp
> > b/libavfilter/dnn/dnn_backend_torch.cpp
> > new file mode 100644
> > index 0000000000..54d3b309a1
> > --- /dev/null
> > +++ b/libavfilter/dnn/dnn_backend_torch.cpp
> > @@ -0,0 +1,597 @@
> > +/*
> > + * Copyright (c) 2024
> > + *
> > + * This file is part of FFmpeg.
> > + *
> > + * FFmpeg is free software; you can redistribute it and/or
> > + * modify it under the terms of the GNU Lesser General Public
> > + * License as published by the Free Software Foundation; either
> > + * version 2.1 of the License, or (at your option) any later version.
> > + *
> > + * FFmpeg is distributed in the hope that it will be useful,
> > + * but WITHOUT ANY WARRANTY; without even the implied warranty of
> > + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
> GNU
> > + * Lesser General Public License for more details.
> > + *
> > + * You should have received a copy of the GNU Lesser General Public
> > + * License along with FFmpeg; if not, write to the Free Software
> > + * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA
> > 02110-1301 USA
> > + */
> > +
> > +/**
> > + * @file
> > + * DNN Torch backend implementation.
> > + */
> > +
> > +#include <torch/torch.h>
> > +#include <torch/script.h>
> > +
> > +extern "C" {
> > +#include "../internal.h"
> > +#include "dnn_io_proc.h"
> > +#include "dnn_backend_common.h"
> > +#include "libavutil/opt.h"
> > +#include "queue.h"
> > +#include "safe_queue.h"
> > +}
> > +
> > +typedef struct THOptions{
> > +    char *device_name;
> > +    int optimize;
> > +} THOptions;
> > +
> > +typedef struct THContext {
> > +    const AVClass *c_class;
> > +    THOptions options;
> > +} THContext;
> > +
> > +typedef struct THModel {
> > +    THContext ctx;
> > +    DNNModel *model;
> > +    torch::jit::Module *jit_model;
> > +    SafeQueue *request_queue;
> > +    Queue *task_queue;
> > +    Queue *lltask_queue;
> > +} THModel;
> > +
> > +typedef struct THInferRequest {
> > +    torch::Tensor *output;
> > +    torch::Tensor *input_tensor;
> > +} THInferRequest;
> > +
> > +typedef struct THRequestItem {
> > +    THInferRequest *infer_request;
> > +    LastLevelTaskItem *lltask;
> > +    DNNAsyncExecModule exec_module;
> > +} THRequestItem;
> > +
> > +
> > +#define OFFSET(x) offsetof(THContext, x)
> > +#define FLAGS AV_OPT_FLAG_FILTERING_PARAM
> > +static const AVOption dnn_th_options[] = {
> > +    { "device", "device to run model", OFFSET(options.device_name),
> > AV_OPT_TYPE_STRING, { .str = "cpu" }, 0, 0, FLAGS },
> > +    { "optimize", "turn on graph executor optimization",
> > OFFSET(options.optimize), AV_OPT_TYPE_INT, { .i64 = 0 }, 0, 1, FLAGS},
> > +    { NULL }
> > +};
> > +
> > +AVFILTER_DEFINE_CLASS(dnn_th);
> > +
> > +static int extract_lltask_from_task(TaskItem *task, Queue
> > *lltask_queue)
> > +{
> > +    THModel *th_model = (THModel *)task->model;
> > +    THContext *ctx = &th_model->ctx;
> > +    LastLevelTaskItem *lltask = (LastLevelTaskItem
> > *)av_malloc(sizeof(*lltask));
> > +    if (!lltask) {
> > +        av_log(ctx, AV_LOG_ERROR, "Failed to allocate memory for
> > LastLevelTaskItem\n");
> > +        return AVERROR(ENOMEM);
> > +    }
> > +    task->inference_todo = 1;
> > +    task->inference_done = 0;
> > +    lltask->task = task;
> > +    if (ff_queue_push_back(lltask_queue, lltask) < 0) {
> > +        av_log(ctx, AV_LOG_ERROR, "Failed to push back
> > lltask_queue.\n");
> > +        av_freep(&lltask);
> > +        return AVERROR(ENOMEM);
> > +    }
> > +    return 0;
> > +}
> > +
> > +static void th_free_request(THInferRequest *request)
> > +{
> > +    if (!request)
> > +        return;
> > +    if (request->output) {
> > +        delete(request->output);
> > +        request->output = NULL;
> > +    }
> > +    if (request->input_tensor) {
> > +        delete(request->input_tensor);
> > +        request->input_tensor = NULL;
> > +    }
> > +    return;
> > +}
> > +
> > +static inline void destroy_request_item(THRequestItem **arg)
> > +{
> > +    THRequestItem *item;
> > +    if (!arg || !*arg) {
> > +        return;
> > +    }
> > +    item = *arg;
> > +    th_free_request(item->infer_request);
> > +    av_freep(&item->infer_request);
> > +    av_freep(&item->lltask);
> > +    ff_dnn_async_module_cleanup(&item->exec_module);
> > +    av_freep(arg);
> > +}
> > +
> > +static void dnn_free_model_th(DNNModel **model)
> > +{
> > +    THModel *th_model;
> > +    if (!model || !*model)
> > +        return;
> > +
> > +    th_model = (THModel *) (*model)->model;
> > +    while (ff_safe_queue_size(th_model->request_queue) != 0) {
> > +        THRequestItem *item = (THRequestItem
> > *)ff_safe_queue_pop_front(th_model->request_queue);
> > +        destroy_request_item(&item);
> > +    }
> > +    ff_safe_queue_destroy(th_model->request_queue);
> > +
> > +    while (ff_queue_size(th_model->lltask_queue) != 0) {
> > +        LastLevelTaskItem *item = (LastLevelTaskItem
> > *)ff_queue_pop_front(th_model->lltask_queue);
> > +        av_freep(&item);
> > +    }
> > +    ff_queue_destroy(th_model->lltask_queue);
> > +
> > +    while (ff_queue_size(th_model->task_queue) != 0) {
> > +        TaskItem *item = (TaskItem
> > *)ff_queue_pop_front(th_model->task_queue);
> > +        av_frame_free(&item->in_frame);
> > +        av_frame_free(&item->out_frame);
> > +        av_freep(&item);
> > +    }
> > +    ff_queue_destroy(th_model->task_queue);
> > +    delete th_model->jit_model;
> > +    av_opt_free(&th_model->ctx);
> > +    av_freep(&th_model);
> > +    av_freep(model);
> > +}
> > +
> > +static int get_input_th(void *model, DNNData *input, const char
> > *input_name)
> > +{
> > +    input->dt = DNN_FLOAT;
> > +    input->order = DCO_RGB;
> > +    input->layout = DL_NCHW;
> > +    input->dims[0] = 1;
> > +    input->dims[1] = 3;
> > +    input->dims[2] = -1;
> > +    input->dims[3] = -1;
> > +    return 0;
> > +}
> > +
> > +static void deleter(void *arg)
> > +{
> > +    av_freep(&arg);
> > +}
> > +
> > +static int fill_model_input_th(THModel *th_model, THRequestItem
> > *request)
> > +{
> > +    LastLevelTaskItem *lltask = NULL;
> > +    TaskItem *task = NULL;
> > +    THInferRequest *infer_request = NULL;
> > +    DNNData input = { 0 };
> > +    THContext *ctx = &th_model->ctx;
> > +    int ret, width_idx, height_idx, channel_idx;
> > +
> > +    lltask = (LastLevelTaskItem
> > *)ff_queue_pop_front(th_model->lltask_queue);
> > +    if (!lltask) {
> > +        ret = AVERROR(EINVAL);
> > +        goto err;
> > +    }
> > +    request->lltask = lltask;
> > +    task = lltask->task;
> > +    infer_request = request->infer_request;
> > +
> > +    ret = get_input_th(th_model, &input, NULL);
> > +    if ( ret != 0) {
> > +        goto err;
> > +    }
> > +    width_idx = dnn_get_width_idx_by_layout(input.layout);
> > +    height_idx = dnn_get_height_idx_by_layout(input.layout);
> > +    channel_idx = dnn_get_channel_idx_by_layout(input.layout);
> > +    input.dims[height_idx] = task->in_frame->height;
> > +    input.dims[width_idx] = task->in_frame->width;
> > +    input.data = av_malloc(input.dims[height_idx] *
> > input.dims[width_idx] *
> > +                           input.dims[channel_idx] * sizeof(float));
> > +    if (!input.data)
> > +        return AVERROR(ENOMEM);
> > +    infer_request->input_tensor = new torch::Tensor();
> > +    infer_request->output = new torch::Tensor();
> > +
> > +    switch (th_model->model->func_type) {
> > +    case DFT_PROCESS_FRAME:
> > +        input.scale = 255;
> > +        if (task->do_ioproc) {
> > +            if (th_model->model->frame_pre_proc != NULL) {
> > +                th_model->model->frame_pre_proc(task->in_frame,
> > &input, th_model->model->filter_ctx);
> > +            } else {
> > +                ff_proc_from_frame_to_dnn(task->in_frame, &input, ctx);
> > +            }
> > +        }
> > +        break;
> > +    default:
> > +        avpriv_report_missing_feature(NULL, "model function type %d",
> > th_model->model->func_type);
> > +        break;
> > +    }
> > +    *infer_request->input_tensor = torch::from_blob(input.data,
> > +        {1, 1, input.dims[channel_idx], input.dims[height_idx],
> > input.dims[width_idx]},
> > +        deleter, torch::kFloat32);
> > +    return 0;
> > +
> > +err:
> > +    th_free_request(infer_request);
> > +    return ret;
> > +}
> > +
> > +static int th_start_inference(void *args)
> > +{
> > +    THRequestItem *request = (THRequestItem *)args;
> > +    THInferRequest *infer_request = NULL;
> > +    LastLevelTaskItem *lltask = NULL;
> > +    TaskItem *task = NULL;
> > +    THModel *th_model = NULL;
> > +    THContext *ctx = NULL;
> > +    std::vector<torch::jit::IValue> inputs;
> > +    torch::NoGradGuard no_grad;
> > +
> > +    if (!request) {
> > +        av_log(NULL, AV_LOG_ERROR, "THRequestItem is NULL\n");
> > +        return AVERROR(EINVAL);
> > +    }
> > +    infer_request = request->infer_request;
> > +    lltask = request->lltask;
> > +    task = lltask->task;
> > +    th_model = (THModel *)task->model;
> > +    ctx = &th_model->ctx;
> > +
> > +    if (ctx->options.optimize)
> > +        torch::jit::setGraphExecutorOptimize(true);
> > +    else
> > +        torch::jit::setGraphExecutorOptimize(false);
> > +
> > +    if (!infer_request->input_tensor || !infer_request->output) {
> > +        av_log(ctx, AV_LOG_ERROR, "input or output tensor is NULL\n");
> > +        return DNN_GENERIC_ERROR;
> > +    }
> > +    inputs.push_back(*infer_request->input_tensor);
> > +
> > +    *infer_request->output =
> > th_model->jit_model->forward(inputs).toTensor();
> > +
> > +    return 0;
> > +}
> > +
> > +static void infer_completion_callback(void *args) {
> > +    THRequestItem *request = (THRequestItem*)args;
> > +    LastLevelTaskItem *lltask = request->lltask;
> > +    TaskItem *task = lltask->task;
> > +    DNNData outputs = { 0 };
> > +    THInferRequest *infer_request = request->infer_request;
> > +    THModel *th_model = (THModel *)task->model;
> > +    torch::Tensor *output = infer_request->output;
> > +
> > +    c10::IntArrayRef sizes = output->sizes();
> > +    outputs.order = DCO_RGB;
> > +    outputs.layout = DL_NCHW;
> > +    outputs.dt = DNN_FLOAT;
> > +    if (sizes.size() == 5) {
> > +        // 5 dimensions: [batch_size, frame_nubmer, channel, height,
> > width]
> > +        // this format of data is normally used for video frame SR
> > +        outputs.dims[0] = sizes.at(0); // N
> > +        outputs.dims[1] = sizes.at(2); // C
> > +        outputs.dims[2] = sizes.at(3); // H
> > +        outputs.dims[3] = sizes.at(4); // W
> > +    } else {
> > +        avpriv_report_missing_feature(&th_model->ctx, "Support of this
> > kind of model");
> > +        goto err;
> > +    }
> > +
> > +    switch (th_model->model->func_type) {
> > +    case DFT_PROCESS_FRAME:
> > +        if (task->do_ioproc) {
> > +            outputs.scale = 255;
> > +            outputs.data = output->data_ptr();
> > +            if (th_model->model->frame_post_proc != NULL) {
> > +                th_model->model->frame_post_proc(task->out_frame,
> > &outputs, th_model->model->filter_ctx);
> > +            } else {
> > +                ff_proc_from_dnn_to_frame(task->out_frame, &outputs,
> > &th_model->ctx);
> > +            }
> > +        } else {
> > +            task->out_frame->width =
> > outputs.dims[dnn_get_width_idx_by_layout(outputs.layout)];
> > +            task->out_frame->height =
> > outputs.dims[dnn_get_height_idx_by_layout(outputs.layout)];
> > +        }
> > +        break;
> > +    default:
> > +        avpriv_report_missing_feature(&th_model->ctx, "model function
> > type %d", th_model->model->func_type);
> > +        goto err;
> > +    }
> > +    task->inference_done++;
> > +    av_freep(&request->lltask);
> > +err:
> > +    th_free_request(infer_request);
> > +
> > +    if (ff_safe_queue_push_back(th_model->request_queue, request) < 0)
> > {
> > +        destroy_request_item(&request);
> > +        av_log(&th_model->ctx, AV_LOG_ERROR, "Unable to push back
> > request_queue when failed to start inference.\n");
> > +    }
> > +}
> > +
> > +static int execute_model_th(THRequestItem *request, Queue
> > *lltask_queue)
> > +{
> > +    THModel *th_model = NULL;
> > +    LastLevelTaskItem *lltask;
> > +    TaskItem *task = NULL;
> > +    int ret = 0;
> > +
> > +    if (ff_queue_size(lltask_queue) == 0) {
> > +        destroy_request_item(&request);
> > +        return 0;
> > +    }
> > +
> > +    lltask = (LastLevelTaskItem *)ff_queue_peek_front(lltask_queue);
> > +    if (lltask == NULL) {
> > +        av_log(NULL, AV_LOG_ERROR, "Failed to get
> > LastLevelTaskItem\n");
> > +        ret = AVERROR(EINVAL);
> > +        goto err;
> > +    }
> > +    task = lltask->task;
> > +    th_model = (THModel *)task->model;
> > +
> > +    ret = fill_model_input_th(th_model, request);
> > +    if ( ret != 0) {
> > +        goto err;
> > +    }
> > +    if (task->async) {
> > +        avpriv_report_missing_feature(&th_model->ctx, "LibTorch
> > async");
> > +    } else {
> > +        ret = th_start_inference((void *)(request));
> > +        if (ret != 0) {
> > +            goto err;
> > +        }
> > +        infer_completion_callback(request);
> > +        return (task->inference_done == task->inference_todo) ? 0 :
> > DNN_GENERIC_ERROR;
> > +    }
> > +
> > +err:
> > +    th_free_request(request->infer_request);
> > +    if (ff_safe_queue_push_back(th_model->request_queue, request) < 0)
> > {
> > +        destroy_request_item(&request);
> > +    }
> > +    return ret;
> > +}
> > +
> > +static int get_output_th(void *model, const char *input_name, int
> > input_width, int input_height,
> > +                                   const char *output_name, int
> > *output_width, int *output_height)
> > +{
> > +    int ret = 0;
> > +    THModel *th_model = (THModel*) model;
> > +    THContext *ctx = &th_model->ctx;
> > +    TaskItem task = { 0 };
> > +    THRequestItem *request = NULL;
> > +    DNNExecBaseParams exec_params = {
> > +        .input_name     = input_name,
> > +        .output_names   = &output_name,
> > +        .nb_output      = 1,
> > +        .in_frame       = NULL,
> > +        .out_frame      = NULL,
> > +    };
> > +    ret = ff_dnn_fill_gettingoutput_task(&task, &exec_params,
> > th_model, input_height, input_width, ctx);
> > +    if ( ret != 0) {
> > +        goto err;
> > +    }
> > +
> > +    ret = extract_lltask_from_task(&task, th_model->lltask_queue);
> > +    if ( ret != 0) {
> > +        av_log(ctx, AV_LOG_ERROR, "unable to extract last level task
> > from task.\n");
> > +        goto err;
> > +    }
> > +
> > +    request = (THRequestItem*)
> > ff_safe_queue_pop_front(th_model->request_queue);
> > +    if (!request) {
> > +        av_log(ctx, AV_LOG_ERROR, "unable to get infer request.\n");
> > +        ret = AVERROR(EINVAL);
> > +        goto err;
> > +    }
> > +
> > +    ret = execute_model_th(request, th_model->lltask_queue);
> > +    *output_width = task.out_frame->width;
> > +    *output_height = task.out_frame->height;
> > +
> > +err:
> > +    av_frame_free(&task.out_frame);
> > +    av_frame_free(&task.in_frame);
> > +    return ret;
> > +}
> > +
> > +static THInferRequest *th_create_inference_request(void)
> > +{
> > +    THInferRequest *request = (THInferRequest
> > *)av_malloc(sizeof(THInferRequest));
> > +    if (!request) {
> > +        return NULL;
> > +    }
> > +    request->input_tensor = NULL;
> > +    request->output = NULL;
> > +    return request;
> > +}
> > +
> > +static DNNModel *dnn_load_model_th(const char *model_filename,
> > DNNFunctionType func_type, const char *options, AVFilterContext
> > *filter_ctx)
> > +{
> > +    DNNModel *model = NULL;
> > +    THModel *th_model = NULL;
> > +    THRequestItem *item = NULL;
> > +    THContext *ctx;
> > +
> > +    model = (DNNModel *)av_mallocz(sizeof(DNNModel));
> > +    if (!model) {
> > +        return NULL;
> > +    }
> > +
> > +    th_model = (THModel *)av_mallocz(sizeof(THModel));
> > +    if (!th_model) {
> > +        av_freep(&model);
> > +        return NULL;
> > +    }
> > +    th_model->model = model;
> > +    model->model = th_model;
> > +    th_model->ctx.c_class = &dnn_th_class;
> > +    ctx = &th_model->ctx;
> > +    //parse options
> > +    av_opt_set_defaults(ctx);
> > +    if (av_opt_set_from_string(ctx, options, NULL, "=", "&") < 0) {
> > +        av_log(ctx, AV_LOG_ERROR, "Failed to parse options \"%s\"\n",
> > options);
> > +        return NULL;
> > +    }
> > +
> > +    c10::Device device = c10::Device(ctx->options.device_name);
> > +    if (!device.is_cpu()) {
> > +        av_log(ctx, AV_LOG_ERROR, "Not supported device:\"%s\"\n",
> > ctx->options.device_name);
> > +        goto fail;
> > +    }
> > +
> > +    try {
> > +        th_model->jit_model = new torch::jit::Module;
> > +        (*th_model->jit_model) = torch::jit::load(model_filename);
> > +    } catch (const c10::Error& e) {
> > +        av_log(ctx, AV_LOG_ERROR, "Failed to load torch model\n");
> > +        goto fail;
> > +    }
> > +
> > +    th_model->request_queue = ff_safe_queue_create();
> > +    if (!th_model->request_queue) {
> > +        goto fail;
> > +    }
> > +
> > +    item = (THRequestItem *)av_mallocz(sizeof(THRequestItem));
> > +    if (!item) {
> > +        goto fail;
> > +    }
> > +    item->lltask = NULL;
> > +    item->infer_request = th_create_inference_request();
> > +    if (!item->infer_request) {
> > +        av_log(NULL, AV_LOG_ERROR, "Failed to allocate memory for
> > Torch inference request\n");
> > +        goto fail;
> > +    }
> > +    item->exec_module.start_inference = &th_start_inference;
> > +    item->exec_module.callback = &infer_completion_callback;
> > +    item->exec_module.args = item;
> > +
> > +    if (ff_safe_queue_push_back(th_model->request_queue, item) < 0) {
> > +        goto fail;
> > +    }
> > +    item = NULL;
> > +
> > +    th_model->task_queue = ff_queue_create();
> > +    if (!th_model->task_queue) {
> > +        goto fail;
> > +    }
> > +
> > +    th_model->lltask_queue = ff_queue_create();
> > +    if (!th_model->lltask_queue) {
> > +        goto fail;
> > +    }
> > +
> > +    model->get_input = &get_input_th;
> > +    model->get_output = &get_output_th;
> > +    model->options = NULL;
> > +    model->filter_ctx = filter_ctx;
> > +    model->func_type = func_type;
> > +    return model;
> > +
> > +fail:
> > +    if (item) {
> > +        destroy_request_item(&item);
> > +        av_freep(&item);
> > +    }
> > +    dnn_free_model_th(&model);
> > +    return NULL;
> > +}
> > +
> > +static int dnn_execute_model_th(const DNNModel *model,
> > DNNExecBaseParams *exec_params)
> > +{
> > +    THModel *th_model = (THModel *)model->model;
> > +    THContext *ctx = &th_model->ctx;
> > +    TaskItem *task;
> > +    THRequestItem *request;
> > +    int ret = 0;
> > +
> > +    ret = ff_check_exec_params(ctx, DNN_TH, model->func_type,
> > exec_params);
> > +    if (ret != 0) {
> > +        av_log(ctx, AV_LOG_ERROR, "exec parameter checking fail.\n");
> > +        return ret;
> > +    }
> > +
> > +    task = (TaskItem *)av_malloc(sizeof(TaskItem));
> > +    if (!task) {
> > +        av_log(ctx, AV_LOG_ERROR, "unable to alloc memory for task
> > item.\n");
> > +        return AVERROR(ENOMEM);
> > +    }
> > +
> > +    ret = ff_dnn_fill_task(task, exec_params, th_model, 0, 1);
> > +    if (ret != 0) {
> > +        av_freep(&task);
> > +        av_log(ctx, AV_LOG_ERROR, "unable to fill task.\n");
> > +        return ret;
> > +    }
> > +
> > +    ret = ff_queue_push_back(th_model->task_queue, task);
> > +    if (ret < 0) {
> > +        av_freep(&task);
> > +        av_log(ctx, AV_LOG_ERROR, "unable to push back task_queue.\n");
> > +        return ret;
> > +    }
> > +
> > +    ret = extract_lltask_from_task(task, th_model->lltask_queue);
> > +    if (ret != 0) {
> > +        av_log(ctx, AV_LOG_ERROR, "unable to extract last level task
> > from task.\n");
> > +        return ret;
> > +    }
> > +
> > +    request = (THRequestItem
> > *)ff_safe_queue_pop_front(th_model->request_queue);
> > +    if (!request) {
> > +        av_log(ctx, AV_LOG_ERROR, "unable to get infer request.\n");
> > +        return AVERROR(EINVAL);
> > +    }
> > +
> > +    return execute_model_th(request, th_model->lltask_queue);
> > +}
> > +
> > +static DNNAsyncStatusType dnn_get_result_th(const DNNModel *model,
> > AVFrame **in, AVFrame **out)
> > +{
> > +    THModel *th_model = (THModel *)model->model;
> > +    return ff_dnn_get_result_common(th_model->task_queue, in, out);
> > +}
> > +
> > +static int dnn_flush_th(const DNNModel *model)
> > +{
> > +    THModel *th_model = (THModel *)model->model;
> > +    THRequestItem *request;
> > +
> > +    if (ff_queue_size(th_model->lltask_queue) == 0)
> > +        // no pending task need to flush
> > +        return 0;
> > +
> > +    request = (THRequestItem
> > *)ff_safe_queue_pop_front(th_model->request_queue);
> > +    if (!request) {
> > +        av_log(&th_model->ctx, AV_LOG_ERROR, "unable to get infer
> > request.\n");
> > +        return AVERROR(EINVAL);
> > +    }
> > +
> > +    return execute_model_th(request, th_model->lltask_queue);
> > +}
> > +
> > +extern const DNNModule ff_dnn_backend_torch = {
> > +    .load_model     = dnn_load_model_th,
> > +    .execute_model  = dnn_execute_model_th,
> > +    .get_result     = dnn_get_result_th,
> > +    .flush          = dnn_flush_th,
> > +    .free_model     = dnn_free_model_th,
> > +};
> > diff --git a/libavfilter/dnn/dnn_interface.c
> > b/libavfilter/dnn/dnn_interface.c
> > index e843826aa6..b9f71aea53 100644
> > --- a/libavfilter/dnn/dnn_interface.c
> > +++ b/libavfilter/dnn/dnn_interface.c
> > @@ -28,6 +28,7 @@
> >
> >  extern const DNNModule ff_dnn_backend_openvino;
> >  extern const DNNModule ff_dnn_backend_tf;
> > +extern const DNNModule ff_dnn_backend_torch;
> >
> >  const DNNModule *ff_get_dnn_module(DNNBackendType backend_type,
> void
> > *log_ctx)
> >  {
> > @@ -40,6 +41,10 @@ const DNNModule
> *ff_get_dnn_module(DNNBackendType
> > backend_type, void *log_ctx)
> >      case DNN_OV:
> >          return &ff_dnn_backend_openvino;
> >      #endif
> > +    #if (CONFIG_LIBTORCH == 1)
> > +    case DNN_TH:
> > +        return &ff_dnn_backend_torch;
> > +    #endif
> >      default:
> >          av_log(log_ctx, AV_LOG_ERROR,
> >                  "Module backend_type %d is not supported or
> > enabled.\n",
> > diff --git a/libavfilter/dnn_filter_common.c
> > b/libavfilter/dnn_filter_common.c
> > index f012d450a2..7d194c9ade 100644
> > --- a/libavfilter/dnn_filter_common.c
> > +++ b/libavfilter/dnn_filter_common.c
> > @@ -53,12 +53,22 @@ static char **separate_output_names(const char
> > *expr, const char *val_sep, int *
> >
> >  int ff_dnn_init(DnnContext *ctx, DNNFunctionType func_type,
> > AVFilterContext *filter_ctx)
> >  {
> > +    DNNBackendType backend = ctx->backend_type;
> > +
> >      if (!ctx->model_filename) {
> >          av_log(filter_ctx, AV_LOG_ERROR, "model file for network is
> > not specified\n");
> >          return AVERROR(EINVAL);
> >      }
> >
> > -    if (ctx->backend_type == DNN_TF) {
> > +    if (backend == DNN_TH) {
> > +        if (ctx->model_inputname)
> > +            av_log(filter_ctx, AV_LOG_WARNING, "LibTorch backend do
> > not require inputname, "\
> > +                                               "inputname will be
> > ignored.\n");
> > +        if (ctx->model_outputnames)
> > +            av_log(filter_ctx, AV_LOG_WARNING, "LibTorch backend do
> > not require outputname(s), "\
> > +                                               "all outputname(s) will
> > be ignored.\n");
> > +        ctx->nb_outputs = 1;
> > +    } else if (backend == DNN_TF) {
> >          if (!ctx->model_inputname) {
> >              av_log(filter_ctx, AV_LOG_ERROR, "input name of the model
> > network is not specified\n");
> >              return AVERROR(EINVAL);
> > @@ -115,7 +125,8 @@ int ff_dnn_get_input(DnnContext *ctx, DNNData
> > *input)
> >
> >  int ff_dnn_get_output(DnnContext *ctx, int input_width, int
> > input_height, int *output_width, int *output_height)
> >  {
> > -    char * output_name = ctx->model_outputnames ?
> > ctx->model_outputnames[0] : NULL;
> > +    char * output_name = ctx->model_outputnames && ctx->backend_type
> > != DNN_TH ?
> > +                         ctx->model_outputnames[0] : NULL;
> >      return ctx->model->get_output(ctx->model->model,
> > ctx->model_inputname, input_width, input_height,
> >                                      (const char *)output_name,
> > output_width, output_height);
> >  }
> > diff --git a/libavfilter/dnn_interface.h b/libavfilter/dnn_interface.h
> > index 852d88baa8..63f492e690 100644
> > --- a/libavfilter/dnn_interface.h
> > +++ b/libavfilter/dnn_interface.h
> > @@ -32,7 +32,7 @@
> >
> >  #define DNN_GENERIC_ERROR FFERRTAG('D','N','N','!')
> >
> > -typedef enum {DNN_TF = 1, DNN_OV} DNNBackendType;
> > +typedef enum {DNN_TF = 1, DNN_OV, DNN_TH} DNNBackendType;
> >
> >  typedef enum {DNN_FLOAT = 1, DNN_UINT8 = 4} DNNDataType;
> >
> > diff --git a/libavfilter/vf_dnn_processing.c
> > b/libavfilter/vf_dnn_processing.c
> > index e7d21eef32..fdac31665e 100644
> > --- a/libavfilter/vf_dnn_processing.c
> > +++ b/libavfilter/vf_dnn_processing.c
> > @@ -50,6 +50,9 @@ static const AVOption dnn_processing_options[] = {
> >  #endif
> >  #if (CONFIG_LIBOPENVINO == 1)
> >      { "openvino",    "openvino backend flag",      0,
> >       AV_OPT_TYPE_CONST,     { .i64 = DNN_OV },    0, 0, FLAGS, .unit =
> > "backend" },
> > +#endif
> > +#if (CONFIG_LIBTORCH == 1)
> > +    { "torch",       "torch backend flag",         0,
> >       AV_OPT_TYPE_CONST,     { .i64 = DNN_TH },    0, 0, FLAGS,
> > "backend" },
> >  #endif
> >      DNN_COMMON_OPTIONS
> >      { NULL }
> > --
> > 2.34.1
> >
> > _______________________________________________
> > ffmpeg-devel mailing list
> > ffmpeg-devel@ffmpeg.org
> > https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
> >
> > To unsubscribe, visit link above, or email
> > ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
> 
> --
> Jean-Baptiste Kempf -  President
> +33 672 704 734
> _______________________________________________
> ffmpeg-devel mailing list
> ffmpeg-devel@ffmpeg.org
> https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
> 
> To unsubscribe, visit link above, or email
> ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".