From: "\"zhilizhao(赵志立)\"" <quinkblack@foxmail.com> To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org> Subject: Re: [FFmpeg-devel] [PATCH v2 1/8] avformat/movenc: add PCM in mp4 support Date: Tue, 28 Feb 2023 12:41:22 +0800 Message-ID: <tencent_2113E9537E15C5E9196EA725561316884E05@qq.com> (raw) In-Reply-To: <CAEu79SZNoZ+k1atC8V9SOoVuo5ZE=L9SqHs0tO=aaq_suRXfMg@mail.gmail.com> > On Feb 28, 2023, at 04:24, Jan Ekström <jeebjp@gmail.com> wrote: > > On Fri, Feb 24, 2023 at 8:29 PM Zhao Zhili <quinkblack@foxmail.com> wrote: >> >> From: Zhao Zhili <zhilizhao@tencent.com> >> >> It's defined by ISO/IEC 23003-5. >> >> Fixes ticket #10185 >> >> Signed-off-by: Zhao Zhili <zhilizhao@tencent.com> > > Technically if you wanted to split these commits, then this > implementation would have to be limited to one audio channel streams, > as 23003-5:2020 > says as follows: > > - The ChannelLayout as defined in ISO/IEC 14496-12 is mandatory if the > PCM channel count is larger than one. OK. Will do a ping-pong with v3. > >> --- >> libavformat/movenc.c | 60 +++++++++++++++++++++++++++++++++++++++++++- >> 1 file changed, 59 insertions(+), 1 deletion(-) >> >> diff --git a/libavformat/movenc.c b/libavformat/movenc.c >> index c4fcb5f8b1..3315057b88 100644 >> --- a/libavformat/movenc.c >> +++ b/libavformat/movenc.c >> @@ -1179,6 +1179,47 @@ static int mov_write_btrt_tag(AVIOContext *pb, MOVTrack *track) >> return update_size(pb, pos); >> } >> >> +static int is_mp4_pcm_codec(enum AVCodecID codec) >> +{ >> + switch (codec) { >> + case AV_CODEC_ID_PCM_S16BE: >> + case AV_CODEC_ID_PCM_S16LE: >> + case AV_CODEC_ID_PCM_S24BE: >> + case AV_CODEC_ID_PCM_S24LE: >> + case AV_CODEC_ID_PCM_S32BE: >> + case AV_CODEC_ID_PCM_S32LE: >> + >> + case AV_CODEC_ID_PCM_F32BE: >> + case AV_CODEC_ID_PCM_F32LE: >> + case AV_CODEC_ID_PCM_F64BE: >> + case AV_CODEC_ID_PCM_F64LE: >> + return 1; >> + default: >> + return 0; >> + } >> +} > > This should be unnecessary if the check is switched to the codec_tags > in mov_write_audio_tag. > >> + >> +static int mov_write_pcmc_tag(AVFormatContext *s, AVIOContext *pb, MOVTrack *track) >> +{ >> + int64_t pos = avio_tell(pb); >> + int format_flags; >> + >> + avio_wb32(pb, 0); /* size */ >> + ffio_wfourcc(pb, "pcmC"); >> + avio_wb32(pb, 0); /* version & flags */ >> + >> + /* 0x01: indicates little-endian format */ >> + format_flags = (track->par->codec_id == AV_CODEC_ID_PCM_F32LE || >> + track->par->codec_id == AV_CODEC_ID_PCM_F64LE || >> + track->par->codec_id == AV_CODEC_ID_PCM_S16LE || >> + track->par->codec_id == AV_CODEC_ID_PCM_S24LE || >> + track->par->codec_id == AV_CODEC_ID_PCM_S32LE); >> + avio_w8(pb, format_flags); >> + avio_w8(pb, track->par->bits_per_raw_sample); >> + >> + return update_size(pb, pos); >> +} > > Generally looks good. I utilized av_get_exact_bits_per_sample, but > since mov_write_audio_tag also writes down bits_per_raw_sample, so I > think being consistent should be OK :) > >> + >> static int mov_write_audio_tag(AVFormatContext *s, AVIOContext *pb, MOVMuxContext *mov, MOVTrack *track) >> { >> int64_t pos = avio_tell(pb); >> @@ -1243,7 +1284,8 @@ static int mov_write_audio_tag(AVFormatContext *s, AVIOContext *pb, MOVMuxContex >> } else { /* reserved for mp4/3gp */ >> avio_wb16(pb, track->par->ch_layout.nb_channels); >> if (track->par->codec_id == AV_CODEC_ID_FLAC || >> - track->par->codec_id == AV_CODEC_ID_ALAC) { >> + track->par->codec_id == AV_CODEC_ID_ALAC || >> + is_mp4_pcm_codec(track->par->codec_id)) { > > I know why you think you should be doing this, but: > > 1. 14496-12:2022 still defines AudioSampleEntry::samplesize as > "template unsigned int(16) samplesize = 16;", so ISOBMFF itself only > lets you set 16 here. > 2. 23003-5:2020 does not allow other values to be written (the > downstream spec should explicitly allow writing of non-template > values). This is probably because pcmC itself contains PCM_sample_size > for this same use. > > So writing samplesize here is technically incorrect, even though I > know that most likely some implementations do this (technically > against these specs). Technically it’s against spec. However: 1. It lets old version of libavformat able to demux PCM in mp4. samplesize is used inside mov_parse_stsd_audio for AV_CODEC_ID_PCM. 2. samplesize is a fixed value in spec. If other mp4 demuxer implementations ignore this field, it’s fine. If they use samplesize, it’s better to set the real value. It’s a problem only if they reject samplesize != 16, which is unlikely. What do you think? > > Why channelcount was made non-template yet samplesize was not? A very > good question. Most likely the channel layout v1 and DRC boxes' > changes started requiring the former, but not the latter? > >> avio_wb16(pb, track->par->bits_per_raw_sample); >> } else { >> avio_wb16(pb, 16); >> @@ -1307,6 +1349,8 @@ static int mov_write_audio_tag(AVFormatContext *s, AVIOContext *pb, MOVMuxContex >> ret = mov_write_dmlp_tag(s, pb, track); >> else if (track->vos_len > 0) >> ret = mov_write_glbl_tag(pb, track); >> + else if (track->mode == MODE_MP4 && is_mp4_pcm_codec(track->par->codec_id)) >> + ret = mov_write_pcmc_tag(s, pb, track); > > It would seem that the generic vos_data extradata writing should be > the last else if in this logic? Could be unnecessary cargo culting, > but at least so far all new codec handling went before it? > > Personally I would have just based this check on the codec tag, a la > > else if (tag == MOV_MP4_IPCM_TAG || tag == MOV_MP4_FPCM_TAG) > // pcmC is required for these tags as per ISO/IEC 23003-5 > ret = mov_write_pcmc_tag(s, pb, track); Will update in v3. > > As that matches the specification's note: > > - Mandatory: Yes, if codingname of SampleEntry is ‘ipcm’ or ‘fpcm’. > > The codec tags are also only defined in codec_mp4_tags, so they would > only get utilized in MP4 and other formats which supposedly support > MP4 formats (ISMV, PSP), so no additional check for that should be > required. > > They already contain stuff such as AV_CODEC_ID_MPEGH_3D_AUDIO, so I > wouldn't consider this a problem :) . Mostly in the sense that if a > more limited set of codecs is wanted for ISMV/PSP, then they should > receive their own more limited listings and not share the mp4 tag > listing :) . > > If you wonder how much the codec tag listing gives you freedoms, you > can check the logic in libavformat/mux.c::validate_codec_tag. > >> >> if (ret < 0) >> return ret; >> @@ -7744,6 +7788,20 @@ static const AVCodecTag codec_mp4_tags[] = { >> { AV_CODEC_ID_MPEGH_3D_AUDIO, MKTAG('m', 'h', 'm', '1') }, >> { AV_CODEC_ID_TTML, MOV_MP4_TTML_TAG }, >> { AV_CODEC_ID_TTML, MOV_ISMV_TTML_TAG }, >> + >> + /* ISO/IEC 23003-5 integer formats */ >> + { AV_CODEC_ID_PCM_S16BE, MKTAG('i', 'p', 'c', 'm') }, >> + { AV_CODEC_ID_PCM_S16LE, MKTAG('i', 'p', 'c', 'm') }, >> + { AV_CODEC_ID_PCM_S24BE, MKTAG('i', 'p', 'c', 'm') }, >> + { AV_CODEC_ID_PCM_S24LE, MKTAG('i', 'p', 'c', 'm') }, >> + { AV_CODEC_ID_PCM_S32BE, MKTAG('i', 'p', 'c', 'm') }, >> + { AV_CODEC_ID_PCM_S32LE, MKTAG('i', 'p', 'c', 'm') }, >> + /* ISO/IEC 23003-5 floating-point formats */ >> + { AV_CODEC_ID_PCM_F32BE, MKTAG('f', 'p', 'c', 'm') }, >> + { AV_CODEC_ID_PCM_F32LE, MKTAG('f', 'p', 'c', 'm') }, >> + { AV_CODEC_ID_PCM_F64BE, MKTAG('f', 'p', 'c', 'm') }, >> + { AV_CODEC_ID_PCM_F64LE, MKTAG('f', 'p', 'c', 'm') }, >> + > > This listing seems alright. Personally I added a define for these two > identifiers, but either is fine. > >> { AV_CODEC_ID_NONE, 0 }, >> }; >> #if CONFIG_MP4_MUXER || CONFIG_PSP_MUXER >> -- >> 2.34.1 >> >> _______________________________________________ >> ffmpeg-devel mailing list >> ffmpeg-devel@ffmpeg.org >> https://ffmpeg.org/mailman/listinfo/ffmpeg-devel >> >> To unsubscribe, visit link above, or email >> ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe". > _______________________________________________ > ffmpeg-devel mailing list > ffmpeg-devel@ffmpeg.org > https://ffmpeg.org/mailman/listinfo/ffmpeg-devel > > To unsubscribe, visit link above, or email > ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe". _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
next prev parent reply other threads:[~2023-02-28 4:41 UTC|newest] Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top [not found] <20230224182849.426345-1-quinkblack@foxmail.com> 2023-02-24 18:28 ` Zhao Zhili 2023-02-27 13:36 ` Tomas Härdin 2023-02-27 20:24 ` Jan Ekström 2023-02-28 4:41 ` "zhilizhao(赵志立)" [this message] 2023-02-24 18:28 ` [FFmpeg-devel] [PATCH v2 2/8] avformat/mov: check that pcmC box is of the expected type Zhao Zhili 2023-02-27 13:36 ` Tomas Härdin 2023-02-27 16:26 ` Jan Ekström 2023-03-05 22:00 ` Jan Ekström 2023-02-24 18:28 ` [FFmpeg-devel] [PATCH v2 3/8] avformat/mov: base the endianness on just the LSB Zhao Zhili 2023-02-27 13:36 ` Tomas Härdin 2023-03-05 22:01 ` Jan Ekström 2023-02-24 18:28 ` [FFmpeg-devel] [PATCH v2 4/8] avformat/mov: fix ISO/IEC 23003-5 support Zhao Zhili 2023-02-27 13:36 ` Tomas Härdin 2023-02-24 18:28 ` [FFmpeg-devel] [PATCH v2 5/8] avformat/isom_tags: remove ipcm from movaudio_tags Zhao Zhili 2023-02-27 13:36 ` Tomas Härdin 2023-02-24 18:28 ` [FFmpeg-devel] [PATCH v2 6/8] avformat/mov: parse ISO-14496-12 ChannelLayout Zhao Zhili 2023-02-27 13:36 ` Tomas Härdin 2023-02-24 18:28 ` [FFmpeg-devel] [PATCH v2 7/8] avformat/movenc: write ChannelLayout box for PCM Zhao Zhili 2023-02-27 13:36 ` Tomas Härdin 2023-02-24 18:28 ` [FFmpeg-devel] [PATCH v2 8/8] fate/mov: add PCM in mp4 test Zhao Zhili
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=tencent_2113E9537E15C5E9196EA725561316884E05@qq.com \ --to=quinkblack@foxmail.com \ --cc=ffmpeg-devel@ffmpeg.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel This inbox may be cloned and mirrored by anyone: git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git # If you have public-inbox 1.1+ installed, you may # initialize and index your mirror using the following commands: public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \ ffmpegdev@gitmailbox.com public-inbox-index ffmpegdev Example config snippet for mirrors. AGPL code for this site: git clone https://public-inbox.org/public-inbox.git