Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
 help / color / mirror / Atom feed
* [FFmpeg-devel] [PATCH] libavfilter/signature_lookup: fix jaccard distance
@ 2024-05-29 12:06 Gerion Entrup
  2024-05-31 20:17 ` Michael Niedermayer
  0 siblings, 1 reply; 4+ messages in thread
From: Gerion Entrup @ 2024-05-29 12:06 UTC (permalink / raw)
  To: ffmpeg-devel; +Cc: Sachin Tilloo

Actually, the jaccard distance is defined as D = 1 - intersect / union.
Additionally, the distance value is compared against a constant that
must be between 0 and 1, which is not the case here. Both facts together
has led to the fact, that the function always returned a matching course
signature. To leave the constant intact and to avoid floating point
computation, this commit multiplies with 1 << 16 making the constant
effectively 9000 / (1<<16) =~ 0.14.

Reported-by: Sachin Tilloo <sachin.tilloo@gmail.com>
Reviewed-by: Sachin Tilloo <sachin.tilloo@gmail.com>
Tested-by: Sachin Tilloo <sachin.tilloo@gmail.com>
---
 libavfilter/signature_lookup.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/libavfilter/signature_lookup.c b/libavfilter/signature_lookup.c
index b39a3e225b..b90b63f3f2 100644
--- a/libavfilter/signature_lookup.c
+++ b/libavfilter/signature_lookup.c
@@ -127,9 +127,10 @@ static int get_jaccarddist(SignatureContext *sc, CoarseSignature *first, CoarseS
 {
     int jaccarddist, i, composdist = 0, cwthcount = 0;
     for (i = 0; i < 5; i++) {
-        if ((jaccarddist = intersection_word(first->data[i], second->data[i])) > 0) {
+        if ((jaccarddist = (1 << 16) * intersection_word(first->data[i], second->data[i])) > 0) {
             jaccarddist /= FFMAX(union_word(first->data[i], second->data[i]), 1);
         }
+        jaccarddist = (1 << 16) - jaccarddist;
         if (jaccarddist >= sc->thworddist) {
             if (++cwthcount > 2) {
                 /* more than half (5/2) of distances are too wide */
-- 
2.43.2

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [FFmpeg-devel] [PATCH] libavfilter/signature_lookup: fix jaccard distance
  2024-05-29 12:06 [FFmpeg-devel] [PATCH] libavfilter/signature_lookup: fix jaccard distance Gerion Entrup
@ 2024-05-31 20:17 ` Michael Niedermayer
  2024-06-02 12:02   ` Gerion Entrup
  0 siblings, 1 reply; 4+ messages in thread
From: Michael Niedermayer @ 2024-05-31 20:17 UTC (permalink / raw)
  To: FFmpeg development discussions and patches


[-- Attachment #1.1: Type: text/plain, Size: 2405 bytes --]

On Wed, May 29, 2024 at 02:06:07PM +0200, Gerion Entrup wrote:
> Actually, the jaccard distance is defined as D = 1 - intersect / union.
> Additionally, the distance value is compared against a constant that
> must be between 0 and 1, which is not the case here. Both facts together
> has led to the fact, that the function always returned a matching course
> signature. To leave the constant intact and to avoid floating point
> computation, this commit multiplies with 1 << 16 making the constant
> effectively 9000 / (1<<16) =~ 0.14.
> 
> Reported-by: Sachin Tilloo <sachin.tilloo@gmail.com>
> Reviewed-by: Sachin Tilloo <sachin.tilloo@gmail.com>
> Tested-by: Sachin Tilloo <sachin.tilloo@gmail.com>
> ---
>  libavfilter/signature_lookup.c | 3 ++-
>  1 file changed, 2 insertions(+), 1 deletion(-)
> 
> diff --git a/libavfilter/signature_lookup.c b/libavfilter/signature_lookup.c
> index b39a3e225b..b90b63f3f2 100644
> --- a/libavfilter/signature_lookup.c
> +++ b/libavfilter/signature_lookup.c
> @@ -127,9 +127,10 @@ static int get_jaccarddist(SignatureContext *sc, CoarseSignature *first, CoarseS
>  {
>      int jaccarddist, i, composdist = 0, cwthcount = 0;
>      for (i = 0; i < 5; i++) {
> -        if ((jaccarddist = intersection_word(first->data[i], second->data[i])) > 0) {
> +        if ((jaccarddist = (1 << 16) * intersection_word(first->data[i], second->data[i])) > 0) {
>              jaccarddist /= FFMAX(union_word(first->data[i], second->data[i]), 1);
>          }
> +        jaccarddist = (1 << 16) - jaccarddist;
>          if (jaccarddist >= sc->thworddist) {
>              if (++cwthcount > 2) {
>                  /* more than half (5/2) of distances are too wide */

this seems not to apply, maybe iam missing something ?

Applying: libavfilter/signature_lookup: fix jaccard distance
Using index info to reconstruct a base tree...
M	libavfilter/signature_lookup.c
Falling back to patching base and 3-way merge...
Auto-merging libavfilter/signature_lookup.c
CONFLICT (content): Merge conflict in libavfilter/signature_lookup.c
error: Failed to merge in the changes.



[...]

-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Homeopathy is like voting while filling the ballot out with transparent ink.
Sometimes the outcome one wanted occurs. Rarely its worse than filling out
a ballot properly.

[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]

[-- Attachment #2: Type: text/plain, Size: 251 bytes --]

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [FFmpeg-devel] [PATCH] libavfilter/signature_lookup: fix jaccard distance
  2024-05-31 20:17 ` Michael Niedermayer
@ 2024-06-02 12:02   ` Gerion Entrup
  2024-06-07 10:19     ` Michael Niedermayer
  0 siblings, 1 reply; 4+ messages in thread
From: Gerion Entrup @ 2024-06-02 12:02 UTC (permalink / raw)
  To: ffmpeg-devel; +Cc: Sachin Tilloo

Actually, the jaccard distance is defined as D = 1 - intersect / union.
Additionally, the distance value is compared against a constant that
must be between 0 and 1, which is not the case here. Both facts together
has led to the fact, that the function always returned a matching course
signature. To leave the constant intact and to avoid floating point
computation, this commit multiplies with 1 << 16 making the constant
effectively 9000 / (1<<16) =~ 0.14.

Reported-by: Sachin Tilloo <sachin.tilloo@gmail.com>
Reviewed-by: Sachin Tilloo <sachin.tilloo@gmail.com>
Tested-by: Sachin Tilloo <sachin.tilloo@gmail.com>
---
Sorry, it should apply clean now.

 libavfilter/signature_lookup.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/libavfilter/signature_lookup.c b/libavfilter/signature_lookup.c
index a0ca818a9b..46602874de 100644
--- a/libavfilter/signature_lookup.c
+++ b/libavfilter/signature_lookup.c
@@ -127,9 +127,10 @@ static int get_jaccarddist(SignatureContext *sc, CoarseSignature *first, CoarseS
 {
     int jaccarddist, i, composdist = 0, cwthcount = 0;
     for (i = 0; i < 5; i++) {
-        if ((jaccarddist = intersection_word(first->data[i], second->data[i])) > 0) {
+        if ((jaccarddist = (1 << 16) * intersection_word(first->data[i], second->data[i])) > 0) {
             jaccarddist /= union_word(first->data[i], second->data[i]);
         }
+        jaccarddist = (1 << 16) - jaccarddist;
         if (jaccarddist >= sc->thworddist) {
             if (++cwthcount > 2) {
                 /* more than half (5/2) of distances are too wide */
-- 
2.43.2

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [FFmpeg-devel] [PATCH] libavfilter/signature_lookup: fix jaccard distance
  2024-06-02 12:02   ` Gerion Entrup
@ 2024-06-07 10:19     ` Michael Niedermayer
  0 siblings, 0 replies; 4+ messages in thread
From: Michael Niedermayer @ 2024-06-07 10:19 UTC (permalink / raw)
  To: FFmpeg development discussions and patches


[-- Attachment #1.1: Type: text/plain, Size: 1112 bytes --]

On Sun, Jun 02, 2024 at 02:02:53PM +0200, Gerion Entrup wrote:
> Actually, the jaccard distance is defined as D = 1 - intersect / union.
> Additionally, the distance value is compared against a constant that
> must be between 0 and 1, which is not the case here. Both facts together
> has led to the fact, that the function always returned a matching course
> signature. To leave the constant intact and to avoid floating point
> computation, this commit multiplies with 1 << 16 making the constant
> effectively 9000 / (1<<16) =~ 0.14.
> 
> Reported-by: Sachin Tilloo <sachin.tilloo@gmail.com>
> Reviewed-by: Sachin Tilloo <sachin.tilloo@gmail.com>
> Tested-by: Sachin Tilloo <sachin.tilloo@gmail.com>
> ---
> Sorry, it should apply clean now.
> 
>  libavfilter/signature_lookup.c | 3 ++-
>  1 file changed, 2 insertions(+), 1 deletion(-)

will apply

thx

[...]

-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Its not that you shouldnt use gotos but rather that you should write
readable code and code with gotos often but not always is less readable

[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]

[-- Attachment #2: Type: text/plain, Size: 251 bytes --]

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2024-06-07 10:19 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-05-29 12:06 [FFmpeg-devel] [PATCH] libavfilter/signature_lookup: fix jaccard distance Gerion Entrup
2024-05-31 20:17 ` Michael Niedermayer
2024-06-02 12:02   ` Gerion Entrup
2024-06-07 10:19     ` Michael Niedermayer

Git Inbox Mirror of the ffmpeg-devel mailing list - see https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

This inbox may be cloned and mirrored by anyone:

	git clone --mirror https://master.gitmailbox.com/ffmpegdev/0 ffmpegdev/git/0.git

	# If you have public-inbox 1.1+ installed, you may
	# initialize and index your mirror using the following commands:
	public-inbox-init -V2 ffmpegdev ffmpegdev/ https://master.gitmailbox.com/ffmpegdev \
		ffmpegdev@gitmailbox.com
	public-inbox-index ffmpegdev

Example config snippet for mirrors.


AGPL code for this site: git clone https://public-inbox.org/public-inbox.git