From: Leo Izen <leo.izen@gmail.com>
To: ffmpeg-devel@ffmpeg.org
Subject: Re: [FFmpeg-devel] [PATCH] avcodec/jpegxl_parser: fix various memory issues
Date: Mon, 2 Oct 2023 19:41:19 -0400
Message-ID: <a83317e6-9f72-4dc4-bdc6-d87ab9d2bae6@gmail.com> (raw)
In-Reply-To: <AS8P250MB0744069D58B4F78F598043848FC5A@AS8P250MB0744.EURP250.PROD.OUTLOOK.COM>
On 10/2/23 16:40, Andreas Rheinhardt wrote:
> Leo Izen:
>> The spec caps the prefix alphabet size to 32768 (i.e. 1 << 15) so we
>> need to check for that and reject alphabets that are too large.
>
> No, we don't "need to", we can. FFmpeg is not a validator tool.
We need to because we risk over-allocating otherwise. If the signalled
value is far too large, we consume a pointlessly large amount of memory.
>
>>
>> Additionally, there's no need to allocate buffers that are as large as
>> the maximum alphabet size as these aren't stack-allocated, they're heap
>> allocated and thus can be variable size.
>>
>> Added an overflow check as well, which fixes leaking the buffer, and
>> capping the alphabet size fixes two potential overruns as well.
>>
>> Fixes: out of array access
>> Fixes: 62089/clusterfuzz-testcase-minimized-ffmpeg_DEMUXER_fuzzer-
>> 5437089094959104.fuzz
>>
>> Found-by: continuous fuzzing process
>> https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg
>> Found-by: Hardik Shah of Vehere (Dawn Treaders team)
>> Co-authored-by: Michael Niedermayer <michael@niedermayer.cc>
>> Signed-off-by: Leo Izen <leo.izen@gmail.com>
>> ---
>> libavcodec/jpegxl_parser.c | 23 +++++++++++++++++------
>> 1 file changed, 17 insertions(+), 6 deletions(-)
>>
>> diff --git a/libavcodec/jpegxl_parser.c b/libavcodec/jpegxl_parser.c
>> index d25a1b6e1d..51af0f4ed1 100644
>> --- a/libavcodec/jpegxl_parser.c
>> +++ b/libavcodec/jpegxl_parser.c
>> @@ -46,6 +46,8 @@
>> #define JXL_FLAG_USE_LF_FRAME 32
>> #define JXL_FLAG_SKIP_ADAPTIVE_LF_SMOOTH 128
>>
>> +#define MAX_PREFIX_ALPHABET_SIZE (1u << 15)
>> +
>> #define clog1p(x) (ff_log2(x) + !!(x))
>> #define unpack_signed(x) (((x) & 1 ? -(x)-1 : (x))/2)
>> #define div_ceil(x, y) (((x) - 1) / (y) + 1)
>> @@ -724,16 +726,17 @@ static int read_vlc_prefix(GetBitContext *gb, JXLEntropyDecoder *dec, JXLSymbolD
>> if (ret < 0)
>> goto end;
>>
>> - buf = av_calloc(1, 262148); // 32768 * 8 + 4
>> + buf = av_calloc(1, dist->alphabet_size * (2 * sizeof(int8_t) + sizeof(int16_t) + sizeof(uint32_t))
>> + + sizeof(uint32_t));
>
> You can avoid the multiplication by using av_calloc((2 * sizeof(int8_t)
> + sizeof(int16_t) + sizeof(uint32_t)) + sizeof(uint32_t),
> dist->alphabet_size).
That's not the same thing. This will cause us to overallocate by
dist-alphabet_size - 4 bytes. Is that okay?
>
>> if (!buf) {
>> ret = AVERROR(ENOMEM);
>> goto end;
>> }
>>
>> level2_lens = (int8_t *)buf;
>> - level2_lens_s = (int8_t *)(buf + 32768);
>> - level2_syms = (int16_t *)(buf + 65536);
>> - level2_codecounts = (uint32_t *)(buf + 131072);
>> + level2_lens_s = (int8_t *)(buf + dist->alphabet_size * sizeof(int8_t));
>> + level2_syms = (int16_t *)(buf + dist->alphabet_size * (2 * sizeof(int8_t)));
>> + level2_codecounts = (uint32_t *)(buf + dist->alphabet_size * (2 * sizeof(int8_t) + sizeof(int16_t)));
>>
>> total_code = 0;
>> for (int i = 0; i < dist->alphabet_size; i++) {
>> @@ -742,6 +745,10 @@ static int read_vlc_prefix(GetBitContext *gb, JXLEntropyDecoder *dec, JXLSymbolD
>> int extra = 3 + get_bits(gb, 2);
>> if (repeat_count_prev)
>> extra = 4 * (repeat_count_prev - 2) - repeat_count_prev + extra;
>> + if (i + extra > dist->alphabet_size) {
>> + ret = AVERROR_INVALIDDATA;
>> + goto end;
>> + }
>> for (int j = 0; j < extra; j++)
>> level2_lens[i + j] = prev;
>> total_code += (32768 >> prev) * extra;
>> @@ -772,8 +779,10 @@ static int read_vlc_prefix(GetBitContext *gb, JXLEntropyDecoder *dec, JXLSymbolD
>> }
>> }
>>
>> - if (total_code != 32768 && level2_codecounts[0] < dist->alphabet_size - 1)
>> - return AVERROR_INVALIDDATA;
>> + if (total_code != 32768 && level2_codecounts[0] < dist->alphabet_size - 1) {
>> + ret = AVERROR_INVALIDDATA;
>> + goto end;
>> + }
>>
>> for (int i = 1; i < dist->alphabet_size + 1; i++)
>> level2_codecounts[i] += level2_codecounts[i - 1];
>> @@ -848,6 +857,8 @@ static int read_distribution_bundle(GetBitContext *gb, JXLEntropyDecoder *dec,
>> if (get_bits1(gb)) {
>> int n = get_bits(gb, 4);
>> dist->alphabet_size = 1 + (1 << n) + get_bitsz(gb, n);
>> + if (dist->alphabet_size > MAX_PREFIX_ALPHABET_SIZE)
>> + return AVERROR_INVALIDDATA;
>> } else {
>> dist->alphabet_size = 1;
>> }
>
> _______________________________________________
> ffmpeg-devel mailing list
> ffmpeg-devel@ffmpeg.org
> https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
>
> To unsubscribe, visit link above, or email
> ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
next prev parent reply other threads:[~2023-10-02 23:41 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-10-02 20:25 Leo Izen
2023-10-02 20:40 ` Andreas Rheinhardt
2023-10-02 23:41 ` Leo Izen [this message]
2023-10-02 23:47 ` Andreas Rheinhardt
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=a83317e6-9f72-4dc4-bdc6-d87ab9d2bae6@gmail.com \
--to=leo.izen@gmail.com \
--cc=ffmpeg-devel@ffmpeg.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
This inbox may be cloned and mirrored by anyone:
git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git
# If you have public-inbox 1.1+ installed, you may
# initialize and index your mirror using the following commands:
public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \
ffmpegdev@gitmailbox.com
public-inbox-index ffmpegdev
Example config snippet for mirrors.
AGPL code for this site: git clone https://public-inbox.org/public-inbox.git