From: James Almer <jamrial@gmail.com> To: ffmpeg-devel@ffmpeg.org Subject: Re: [FFmpeg-devel] [PATCHv2 1/5] lavu/float_dsp: add double-precision scalar product Date: Thu, 30 May 2024 16:10:28 -0300 Message-ID: <15eed07d-538d-407e-b85a-d1421bfd5d22@gmail.com> (raw) In-Reply-To: <20240530190659.65309-1-remi@remlab.net> On 5/30/2024 4:06 PM, Rémi Denis-Courmont wrote: > The function pointer is appended to the structure for backward binary > compatibility. Fortunately, this is allocated by libavutil, not by the > user, so increasing the structure size is safe. > --- > libavutil/float_dsp.c | 12 ++++++++++++ > libavutil/float_dsp.h | 31 ++++++++++++++++++++++++++++++- > 2 files changed, 42 insertions(+), 1 deletion(-) > > diff --git a/libavutil/float_dsp.c b/libavutil/float_dsp.c > index e9fb023466..08bbc85e3e 100644 > --- a/libavutil/float_dsp.c > +++ b/libavutil/float_dsp.c > @@ -132,6 +132,17 @@ float avpriv_scalarproduct_float_c(const float *v1, const float *v2, int len) > return p; > } > > +double ff_scalarproduct_double_c(const double *v1, const double *v2, > + size_t len) > +{ > + double p = 0.0; > + > + for (size_t i = 0; i < len; i++) > + p += v1[i] * v2[i]; > + > + return p; > +} > + > av_cold AVFloatDSPContext *avpriv_float_dsp_alloc(int bit_exact) > { > AVFloatDSPContext *fdsp = av_mallocz(sizeof(AVFloatDSPContext)); > @@ -149,6 +160,7 @@ av_cold AVFloatDSPContext *avpriv_float_dsp_alloc(int bit_exact) > fdsp->vector_fmul_reverse = vector_fmul_reverse_c; > fdsp->butterflies_float = butterflies_float_c; > fdsp->scalarproduct_float = avpriv_scalarproduct_float_c; > + fdsp->scalarproduct_double = ff_scalarproduct_double_c; > > #if ARCH_AARCH64 > ff_float_dsp_init_aarch64(fdsp); > diff --git a/libavutil/float_dsp.h b/libavutil/float_dsp.h > index 342a8715c5..5053aa240d 100644 > --- a/libavutil/float_dsp.h > +++ b/libavutil/float_dsp.h > @@ -19,6 +19,8 @@ > #ifndef AVUTIL_FLOAT_DSP_H > #define AVUTIL_FLOAT_DSP_H > > +#include <stddef.h> > + > typedef struct AVFloatDSPContext { > /** > * Calculate the entry wise product of two vectors of floats and store the result in > @@ -187,19 +189,46 @@ typedef struct AVFloatDSPContext { > */ > void (*vector_dmul)(double *dst, const double *src0, const double *src1, > int len); > + > + /** > + * Calculate the scalar product of two vectors of doubles. > + * > + * @param v1 first vector > + * @param v2 second vector > + * @param len length of vectors > + * > + * @return inner product of the vectors > + */ > + double (*scalarproduct_double)(const double *v1, const double *v2, > + size_t len); > } AVFloatDSPContext; > > /** > - * Return the scalar product of two vectors. > + * Return the scalar product of two vectors of floats. > * > * @param v1 first input vector > + * constraints: 32-byte aligned > * @param v2 first input vector > + * constraints: 32-byte aligned > * @param len number of elements > + * constraints: multiple of 16 Why are you adding this to the doxy for scalarproduct_float()? Those constrains are not correct for it. They are for scalarproduct_double() which you're adding now. > * > * @return sum of elementwise products > */ > float avpriv_scalarproduct_float_c(const float *v1, const float *v2, int len); > > +/** > + * Return the scalar product of two vectors of doubles. > + * > + * @param v1 first input vector > + * @param v2 first input vector > + * @param len number of elements > + * > + * @return inner product of the vectors > + */ > +double ff_scalarproduct_double_c(const double *v1, const double *v2, > + size_t len); > + > void ff_float_dsp_init_aarch64(AVFloatDSPContext *fdsp); > void ff_float_dsp_init_arm(AVFloatDSPContext *fdsp); > void ff_float_dsp_init_ppc(AVFloatDSPContext *fdsp, int strict); _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".
next prev parent reply other threads:[~2024-05-30 19:10 UTC|newest] Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top 2024-05-30 19:06 Rémi Denis-Courmont 2024-05-30 19:06 ` [FFmpeg-devel] [PATCH 2/5] lavu/lls: use ff_scalarproduct_double_c() Rémi Denis-Courmont 2024-05-30 19:06 ` [FFmpeg-devel] [PATCH 3/5] lavfi: get rid of bespoke double scalar products Rémi Denis-Courmont 2024-05-30 19:06 ` [FFmpeg-devel] [PATCH 4/5] checkasm/float_dsp: add double-precision scalar product Rémi Denis-Courmont 2024-05-30 19:06 ` [FFmpeg-devel] [PATCH 5/5] lavu/float_dsp: R-V V scalarproduct_double Rémi Denis-Courmont 2024-05-30 19:10 ` James Almer [this message] 2024-05-30 19:21 ` [FFmpeg-devel] [PATCHv2 1/5] lavu/float_dsp: add double-precision scalar product Rémi Denis-Courmont 2024-05-30 19:28 ` James Almer 2024-05-30 19:31 ` Rémi Denis-Courmont 2024-05-30 19:33 ` James Almer
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=15eed07d-538d-407e-b85a-d1421bfd5d22@gmail.com \ --to=jamrial@gmail.com \ --cc=ffmpeg-devel@ffmpeg.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel This inbox may be cloned and mirrored by anyone: git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git # If you have public-inbox 1.1+ installed, you may # initialize and index your mirror using the following commands: public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \ ffmpegdev@gitmailbox.com public-inbox-index ffmpegdev Example config snippet for mirrors. AGPL code for this site: git clone https://public-inbox.org/public-inbox.git