Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
 help / color / mirror / Atom feed
From: Timo Rothenpieler <timo@rothenpieler.org>
To: ffmpeg-devel@ffmpeg.org
Subject: Re: [FFmpeg-devel] [PATCH] avutil/mem: limit alignment to maximum simd align
Date: Tue, 27 Feb 2024 19:45:42 +0100
Message-ID: <14a01e86-c131-4ea4-89b2-18b0a3564459@rothenpieler.org> (raw)
In-Reply-To: <bb2393d6-2a5a-438b-89f8-f6d1c4e87f20@rothenpieler.org>

On 26.02.2024 17:58, Timo Rothenpieler wrote:
> On 11/02/2024 18:40, Timo Rothenpieler wrote:
>> FFmpeg has instances of DECLARE_ALIGNED(32, ...) in a lot of structs,
>> which then end up heap-allocated.
>> By declaring any variable in a struct, or tree of structs, to be 32 byte
>> aligned, it allows the compiler to safely assume the entire struct
>> itself is also 32 byte aligned.
>>
>> This might make the compiler emit code which straight up crashes or
>> misbehaves in other ways, and at least in one instances is now
>> documented to actually do (see ticket 10549 on trac).
>> The issue there is that an unrelated variable in SingleChannelElement is
>> declared to have an alignment of 32 bytes. So if the compiler does a copy
>> in decode_cpe() with avx instructions, but ffmpeg is built with
>> --disable-avx, this results in a crash, since the memory is only 16 byte
>> aligned.
>>
>> Mind you, even if the compiler does not emit avx instructions, the code
>> is still invalid and could misbehave. It just happens not to. Declaring
>> any variable in a struct with a 32 byte alignment promises 32 byte
>> alignment of the whole struct to the compiler.
>>
>> This patch limits the maximum alignment to the maximum possible simd
>> alignment according to configure.
>> While not perfect, it at the very least gets rid of a lot of UB, by
>> matching up the maximum DECLARE_ALIGNED value with the alignment of heap
>> allocations done by lavu.
>> ---
>>   libavutil/mem.c          |  2 +-
>>   libavutil/mem_internal.h | 33 ++++++++++++++++++++++++++++-----
>>   2 files changed, 29 insertions(+), 6 deletions(-)
>>
>> diff --git a/libavutil/mem.c b/libavutil/mem.c
>> index 36b8940a0c..62163b4cb3 100644
>> --- a/libavutil/mem.c
>> +++ b/libavutil/mem.c
>> @@ -62,7 +62,7 @@ void  free(void *ptr);
>>   #endif /* MALLOC_PREFIX */
>> -#define ALIGN (HAVE_AVX512 ? 64 : (HAVE_AVX ? 32 : 16))
>> +#define ALIGN (HAVE_SIMD_ALIGN_64 ? 64 : (HAVE_SIMD_ALIGN_32 ? 32 : 16))
>>   /* NOTE: if you want to override these functions with your own
>>    * implementations (not recommended) you have to link libav* as
>> diff --git a/libavutil/mem_internal.h b/libavutil/mem_internal.h
>> index 2448c606f1..b1d89a0605 100644
>> --- a/libavutil/mem_internal.h
>> +++ b/libavutil/mem_internal.h
>> @@ -76,27 +76,50 @@
>>    */
>>   #if defined(__INTEL_COMPILER) && __INTEL_COMPILER < 1110 || 
>> defined(__SUNPRO_C)
>> -    #define DECLARE_ALIGNED(n,t,v)      t __attribute__ ((aligned 
>> (n))) v
>> +    #define DECLARE_ALIGNED_T(n,t,v)    t __attribute__ ((aligned 
>> (n))) v
>>       #define DECLARE_ASM_ALIGNED(n,t,v)  t __attribute__ ((aligned 
>> (n))) v
>>       #define DECLARE_ASM_CONST(n,t,v)    const t __attribute__ 
>> ((aligned (n))) v
>>   #elif defined(__DJGPP__)
>> -    #define DECLARE_ALIGNED(n,t,v)      t __attribute__ ((aligned 
>> (FFMIN(n, 16)))) v
>> +    #define DECLARE_ALIGNED_T(n,t,v)    t __attribute__ ((aligned 
>> (FFMIN(n, 16)))) v
>>       #define DECLARE_ASM_ALIGNED(n,t,v)  t av_used __attribute__ 
>> ((aligned (FFMIN(n, 16)))) v
>>       #define DECLARE_ASM_CONST(n,t,v)    static const t av_used 
>> __attribute__ ((aligned (FFMIN(n, 16)))) v
>>   #elif defined(__GNUC__) || defined(__clang__)
>> -    #define DECLARE_ALIGNED(n,t,v)      t __attribute__ ((aligned 
>> (n))) v
>> +    #define DECLARE_ALIGNED_T(n,t,v)    t __attribute__ ((aligned 
>> (n))) v
>>       #define DECLARE_ASM_ALIGNED(n,t,v)  t av_used __attribute__ 
>> ((aligned (n))) v
>>       #define DECLARE_ASM_CONST(n,t,v)    static const t av_used 
>> __attribute__ ((aligned (n))) v
>>   #elif defined(_MSC_VER)
>> -    #define DECLARE_ALIGNED(n,t,v)      __declspec(align(n)) t v
>> +    #define DECLARE_ALIGNED_T(n,t,v)    __declspec(align(n)) t v
>>       #define DECLARE_ASM_ALIGNED(n,t,v)  __declspec(align(n)) t v
>>       #define DECLARE_ASM_CONST(n,t,v)    __declspec(align(n)) static 
>> const t v
>>   #else
>> -    #define DECLARE_ALIGNED(n,t,v)      t v
>> +    #define DECLARE_ALIGNED_T(n,t,v)    t v
>>       #define DECLARE_ASM_ALIGNED(n,t,v)  t v
>>       #define DECLARE_ASM_CONST(n,t,v)    static const t v
>>   #endif
>> +#if HAVE_SIMD_ALIGN_64
>> +    #define ALIGN_64 64
>> +    #define ALIGN_32 32
>> +#elif HAVE_SIMD_ALIGN_32
>> +    #define ALIGN_64 32
>> +    #define ALIGN_32 32
>> +#else
>> +    #define ALIGN_64 16
>> +    #define ALIGN_32 16
>> +#endif
>> +
>> +#define DECLARE_ALIGNED(n,t,v) DECLARE_ALIGNED_V(n,t,v)
>> +
>> +// Macro needs to be double-wrapped in order to expand
>> +// possible other macros being passed for n.
>> +#define DECLARE_ALIGNED_V(n,t,v) DECLARE_ALIGNED_##n(t,v)
>> +
>> +#define DECLARE_ALIGNED_4(t,v)  DECLARE_ALIGNED_T(       4, t, v)
>> +#define DECLARE_ALIGNED_8(t,v)  DECLARE_ALIGNED_T(       8, t, v)
>> +#define DECLARE_ALIGNED_16(t,v) DECLARE_ALIGNED_T(      16, t, v)
>> +#define DECLARE_ALIGNED_32(t,v) DECLARE_ALIGNED_T(ALIGN_32, t, v)
>> +#define DECLARE_ALIGNED_64(t,v) DECLARE_ALIGNED_T(ALIGN_64, t, v)
>> +
>>   // Some broken preprocessors need a second expansion
>>   // to be forced to tokenize __VA_ARGS__
>>   #define E1(x) x
> 
> I intend to push this patch soon.

applied
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".

      reply	other threads:[~2024-02-27 18:45 UTC|newest]

Thread overview: 29+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-12-03 20:10 [FFmpeg-devel] [PATCH] avutil/mem: always align by at least 32 bytes Timo Rothenpieler
2023-12-06 12:27 ` Timo Rothenpieler
2023-12-06 12:31 ` James Almer
2023-12-06 12:56   ` Timo Rothenpieler
2023-12-06 12:50 ` Ronald S. Bultje
2023-12-06 12:54   ` James Almer
2023-12-06 13:25 ` Martin Storsjö
2023-12-06 13:27   ` Timo Rothenpieler
2023-12-06 13:29     ` Martin Storsjö
2023-12-08  0:15   ` Timo Rothenpieler
2023-12-08  5:57     ` Martin Storsjö
2023-12-08 10:01 ` Andreas Rheinhardt
2023-12-08 17:56   ` Timo Rothenpieler
2023-12-08 18:11     ` Nicolas George
2023-12-09  5:23     ` Andreas Rheinhardt
2024-01-12 23:10       ` Timo Rothenpieler
2024-01-13  0:57         ` [FFmpeg-devel] [PATCH] avutil/mem: limit alignment to maximum simg align Timo Rothenpieler
2024-01-13  1:00           ` Timo Rothenpieler
2024-01-13 15:24           ` Timo Rothenpieler
2024-01-13 15:46           ` [FFmpeg-devel] [PATCH v2] avutil/mem: limit alignment to maximum simd align Timo Rothenpieler
2024-02-09 19:22             ` Timo Rothenpieler
2024-02-11 14:05               ` Sam James
2024-02-11 14:22               ` Rémi Denis-Courmont
2024-02-11 15:47                 ` Timo Rothenpieler
2024-02-11 14:00             ` Andreas Rheinhardt
2024-02-11 16:06               ` Timo Rothenpieler
2024-02-11 17:40               ` [FFmpeg-devel] [PATCH] " Timo Rothenpieler
2024-02-26 16:58                 ` Timo Rothenpieler
2024-02-27 18:45                   ` Timo Rothenpieler [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=14a01e86-c131-4ea4-89b2-18b0a3564459@rothenpieler.org \
    --to=timo@rothenpieler.org \
    --cc=ffmpeg-devel@ffmpeg.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

This inbox may be cloned and mirrored by anyone:

	git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git

	# If you have public-inbox 1.1+ installed, you may
	# initialize and index your mirror using the following commands:
	public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \
		ffmpegdev@gitmailbox.com
	public-inbox-index ffmpegdev

Example config snippet for mirrors.


AGPL code for this site: git clone https://public-inbox.org/public-inbox.git