From: Michael Niedermayer <michael@niedermayer.cc>
To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org>
Subject: Re: [FFmpeg-devel] [PATCH 000/279 v2] New channel layout API
Date: Sun, 19 Dec 2021 13:51:45 +0100
Message-ID: <20211219125145.GO2829255@pb2> (raw)
In-Reply-To: <31397d1-438-8bfd-5a79-a07a67982559@passwd.hu>
[-- Attachment #1.1: Type: text/plain, Size: 6348 bytes --]
On Sun, Dec 19, 2021 at 12:35:11PM +0100, Marton Balint wrote:
>
>
> On Sat, 18 Dec 2021, Michael Niedermayer wrote:
>
> > On Sat, Dec 18, 2021 at 02:36:12PM +0100, Michael Niedermayer wrote:
> > > On Fri, Dec 17, 2021 at 07:04:08PM +0100, Marton Balint wrote:
> > > >
> > > >
> > > > On Fri, 17 Dec 2021, Michael Niedermayer wrote:
> > > >
> > > > > On Fri, Dec 17, 2021 at 01:04:19AM +0100, Marton Balint wrote:
> > > > > >
> > > > > >
> > > > > > On Thu, 16 Dec 2021, James Almer wrote:
> > > > > >
> > > > > > > Resending the first two patches only, since this is meant to
> > > > > > > show the implementation of one of the several suggestions made
> > > > > > > in the previous set that need to be discussed and hopefully
> > > > > > > resolved in a call.
> > > > > >
> > > > > > Can you push the full branch somewhere?
> > > > > >
> > > > > > >
> > > > > > > The proposals so far to extend the API to support either custom
> > > > > > > labels for channels are, or some form of extra user information.
> > > > > > >
> > > > > > > - Fixed array of bytes to hold a label. Simple solution, but
> > > > > > > the labels will have a hard limit that can only be extended
> > > > > > > with a major bump. This is what i implemented in this version.
> > > > > > > - "char *name" per channel that the user may allocate and the
> > > > > > > API will manage, duplicate and free. Simple solution, and the
> > > > > > > name can be arbitrarily long, but inefficient (av_strdup() per
> > > > > > > channel with a custom label on layout copy).
> > > > > > > - "const char *name" per channel for compile time constants, or
> > > > > > > that the user may allocate and free. Very efficient, but for
> > > > > > > non compile time strings ensuring they outlive the layout can
> > > > > > > be tricky.
> > > > > > > - Refcounted AVChannelCustom with a dictionary. This can't be
> > > > > > > done with AVBufferRef, so it would require some other form
> > > > > > > of reference counting. And a dictionary may add quite a bit of
> > > > > > > complexity to the API, as you can set anything on them.
> > > > > >
> > > > > > Until we have proper refcounting API we can make the AVBufferRef in
> > > > > > AVChannelLayout a void *, and only allow channel_layout functions to
> > > > > > dereference it as an AVBufferRef. This would mean adding some extra helper
> > > > > > functions to channel layout, but overall it is not unsolvable.
> > > > > >
> > > > > > The real question is that if you want to use refcounting and add helpers to
> > > > > > query / replace per-channel metadata, or you find the idea too heavy weight
> > > > > > and would like to stick to flat structs.
> > > > >
> > > > > what is the advantage of refcounting for channel metadata ?
> > > > > is it about the used memory, about the reduced need to copy ?
> > > >
> > > > Basicly it is the ability to store per-channel metadata in avdictionary,
> > > > because otherwise it would have to be copyed, and avdictionary is very
> > > > ineffective at copying because of many mallocs.
> > > >
> > > > >
> > > > > what kind of metadata and what size do you expect ?
> > > > > bytes, kilobytes, megabytes, gigabytes per channel ?
> > > >
> > > > Usually, nothing, because most format don't have support for per-channel
> > > > metadata. In some cases it is going to be a couple of textual metadata
> > > > key-value pairs, such as language, label, group, speaker, positon, so 4-5
> > > > dynamically allocated string pairs, plus the AVDictionary itself, multiplied
> > > > by the number of channels in a layout.
> > > >
> > > > >
> > > > > what is the overhead for dynamic allocation and ref counting?
> > > > > that is at which point does it even make sense ?
> > > >
> > > > I don't have exact measurements. It is generally felt that copying
> > > > AVDictionary per-channel is a huge overhead for something as lightweight as
> > > > an audio frame which is a 2-4 kB per channel at most and only a couple of
> > > > allocs usually not dependant on the number of channels. That's why
> > > > refcounting was proposed.
> > >
> > > I was thinking more at a AVStream / AVCodecParameters level.
> >
> > > How will a demuxer transport such metadata over a AVPacket into a decoder
> > > outputting metadata-filled AVFrames?
> >
> > or is this never needed ?
>
> I am not sure I understand. Usually metadata is passed from demuxer to
> decoder by avcodec_parameters_to_context(), this is used for all metadata
> which is in AVCodecParameters.
>
> For per-packet metadata ff_decode_frame_props() has some automatic packet
> side data -> frame side data transfer.
>
> AVStream side data may be transferred to AVPacket side data if
> av_format_inject_global_side_data() is used, but it is not enabled by
> default.
The sidedata in AVPacket is not channel specific, the data in AVFrames new
channels is.
The later is as you wrote expensive to copy/alloc so it needs ref counting
what i was trying to point to was that if we need a way to transfer this
data on a per packet base from demuxer forward then we need a flat format.
At least with the current AVPackets
And a flat format is much lighter to copy around so then one is pushed
toward the question, "should that be used for AVFrames too?"
a small <10kb audio frame with 8 audio channels and a ref counted dictionary
or whatever per each of the 8 channels vs. a flat description.
an extra is that a flat format is very simple to memcmp() to check if it
changed.
I have no position in this above, iam just wanting to make sure nothing
is missed in this somewhat rushed design.
also theres yet another orthogonal aspect which has been missed i think
This per channel description whatever its form will be is usefull for another
case. And that are the video/image channels. generally this is
red/green/blue/(alpha) but there is material with other channels,
IR/NIR/UV/... and i imagine probably depth from some lidar/radar system
Ideally such a future image channel API can share parts with the audio one
thx
[...]
--
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
Opposition brings concord. Out of discord comes the fairest harmony.
-- Heraclitus
[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]
[-- Attachment #2: Type: text/plain, Size: 251 bytes --]
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
prev parent reply other threads:[~2021-12-19 12:51 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-12-16 13:21 James Almer
2021-12-16 13:21 ` [FFmpeg-devel] [PATCH 001/279 v2] Add a new " James Almer
2021-12-16 17:20 ` Paul B Mahol
2021-12-16 18:27 ` James Almer
2021-12-16 18:31 ` Paul B Mahol
2021-12-16 19:14 ` James Almer
2021-12-16 23:27 ` Marton Balint
2021-12-17 2:34 ` James Almer
2021-12-17 12:43 ` James Almer
2021-12-16 13:21 ` [FFmpeg-devel] [PATCH 002/279 v2] fate: add a channel_layout API test James Almer
2021-12-17 0:04 ` [FFmpeg-devel] [PATCH 000/279 v2] New channel layout API Marton Balint
2021-12-17 2:37 ` James Almer
2021-12-17 19:20 ` Marton Balint
2021-12-17 19:32 ` James Almer
2021-12-17 11:24 ` Michael Niedermayer
2021-12-17 18:04 ` Marton Balint
2021-12-18 13:36 ` Michael Niedermayer
2021-12-18 14:15 ` Michael Niedermayer
2021-12-19 11:35 ` Marton Balint
2021-12-19 12:51 ` Michael Niedermayer [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20211219125145.GO2829255@pb2 \
--to=michael@niedermayer.cc \
--cc=ffmpeg-devel@ffmpeg.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
This inbox may be cloned and mirrored by anyone:
git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git
# If you have public-inbox 1.1+ installed, you may
# initialize and index your mirror using the following commands:
public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \
ffmpegdev@gitmailbox.com
public-inbox-index ffmpegdev
Example config snippet for mirrors.
AGPL code for this site: git clone https://public-inbox.org/public-inbox.git