From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ffbox0-bg.mplayerhq.hu (ffbox0-bg.ffmpeg.org [79.124.17.100]) by master.gitmailbox.com (Postfix) with ESMTP id 72D954881B for ; Fri, 19 Jan 2024 15:30:12 +0000 (UTC) Received: from [127.0.1.1] (localhost [127.0.0.1]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTP id C699868C254; Fri, 19 Jan 2024 17:30:09 +0200 (EET) Received: from EUR04-DB3-obe.outbound.protection.outlook.com (mail-db3eur04on2073.outbound.protection.outlook.com [40.107.6.73]) by ffbox0-bg.mplayerhq.hu (Postfix) with ESMTPS id DB80B68CC9D for ; Fri, 19 Jan 2024 17:30:02 +0200 (EET) ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=JwKKobxOwMi72LK59kgX9JI3DxjX+bWNdAkfbHGx/6t712ukz5/o+bZcjXxvr/eLrjKvdE4RTMYMLytfFzC+DK6fCbQ1lgisyxtVvWvpPxrXuY7L2YUROvBVNMzHVrKguXQBrtEYkYx2mx0ZYKSZPhRwK40u/w98UNwhr3wV+mXa26kcAr5DgkXUIJ0POf9XHVKLTnpMo6O7eKoDPzvuUPDkq1jwQfL/PLG2sJIhlDqPf/bCATxMU4N580wp5M4Nl7aEwHaHoHisxyr2q7EPM8Z0sQzgwRJLN4umFobeVvSMtHFLPpGvxR/f7daZJdgNMTTKz2ak7lxeIDunwRW5tQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=DlrVPpanR98U8+PfmB7hsCYXOELxkxe+lJptSNKNtms=; b=K8Y94EVaQZNPa6ncs5EjD71YMBRiA+gPx3dmPXyLs31ZXHNsf3siGkRpbKICG8ERSxB9PswwAwB8iZIx3AfzQrCVRZ/kCnTPdBOXXMjWt+Udc2fNDvVxznsPiQrL6W8QFVyijddgUFWpWJYTmOLn/bFHjbU5JsEt9zQyK4+94hzxUy6cEoAzhFqjbKm5s9rlGuiMSzF5ET0vVUdLUeHuoBPeBirdVoafuFqmm0Tsi6W7pIcD3Rpv/4T8QyLVTOBGLPkPwVWtuHxZx7fA/Tz9BRdgudQrSr69wDvfJutN5IBakJ4VFH7pZjhIN8wgd452yDhlf5G4KyY4XwEYCa9CgA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=axelera.ai; dmarc=pass action=none header.from=axelera.ai; dkim=pass header.d=axelera.ai; arc=none Received: from DB8P194MB0839.EURP194.PROD.OUTLOOK.COM (2603:10a6:10:16b::21) by AS8P194MB1286.EURP194.PROD.OUTLOOK.COM (2603:10a6:20b:3d2::10) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7228.9; Fri, 19 Jan 2024 15:30:00 +0000 Received: from DB8P194MB0839.EURP194.PROD.OUTLOOK.COM ([fe80::10ba:9021:161e:c004]) by DB8P194MB0839.EURP194.PROD.OUTLOOK.COM ([fe80::10ba:9021:161e:c004%5]) with mapi id 15.20.7228.009; Fri, 19 Jan 2024 15:30:00 +0000 To: "ffmpeg-devel@ffmpeg.org" Thread-Topic: RISC-V vector DSP functions: Motivation for commit 446b009 Thread-Index: AdpK68+SrJKoEYZBRD+92ap362gqHw== Date: Fri, 19 Jan 2024 15:30:00 +0000 Message-ID: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-publictraffictype: Email x-ms-traffictypediagnostic: DB8P194MB0839:EE_|AS8P194MB1286:EE_ x-ms-office365-filtering-correlation-id: cae5b3a4-da6d-4174-5d22-08dc19038000 x-ms-exchange-senderadcheck: 1 x-ms-exchange-antispam-relay: 0 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: EK1xfsBXa+Wvnte+euIDfsLECLQQrzqNmn78A+iyZCviKWrUrszjFCVrN56wmxZBRy3hgJQvyGpOAvCDiXwmhPFGr5UX7rqJUmA/Rr7p9cQ3a8z7PAezKi/WFl+EMhpLEFswwkAjKqm/Oo/wVVlwvNI8jT7R0++7CtzTxBwhMVi0KkCJPoDK0oYoP7di5J6we71LXNAmxRmAyF6uJ2z1KHQulJ1t9ggQUcULu4Sp8FkNrW8+AfEfI/qDViiejb3503ytfSPhKQhOwI9oihJNjmIaFXGRhg+qBC3ACF5Dujp21ewpk/trMclIv/PukVex1MjaC6lXFxxsfrYHnpTMSYxy2eSXi3sdaEQJ8HQtXV0816J1i0G22kNU54beZx+K61NgrPm/4sCsUmQuajDHZEMwk/tZorgybk7iTO1TwHeIHYhV0Kok9B0hdwCNcRUGzl7pk8C2iOEEePsmZrepinvmZzPznynEUJEDo8iI8x/Qm1PvqQpOCw8exk+6RVD8sX5rpxPE6hMmHeAuo7cVzv5rHfa61/icyiN3lAVL65/FS/SzROXbd7J78RLN3H1M4VszcSW4yl9QEj2vL3yYuy18uI0lKk87EUvaudVvvm6DKgZp8k0cBVrxMLoCvDLi x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:DB8P194MB0839.EURP194.PROD.OUTLOOK.COM; PTR:; CAT:NONE; SFS:(13230031)(346002)(136003)(39840400004)(396003)(376002)(366004)(230922051799003)(64100799003)(1800799012)(451199024)(186009)(26005)(83380400001)(71200400001)(478600001)(9686003)(7696005)(6506007)(41300700001)(52536014)(6916009)(122000001)(5660300002)(2906002)(44832011)(55016003)(316002)(64756008)(86362001)(8676002)(4326008)(8936002)(38100700002)(4744005)(33656002)(38070700009)(66476007)(76116006)(66446008)(66556008)(66946007); DIR:OUT; SFP:1101; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?utf-8?B?TTZDcmpFVExscEhOaGdqREVKOGJQNkNzZ096a3dJNGtHMXozWXE5a0crb0Fk?= =?utf-8?B?RlRyOXBJdkl2MHZJdkNaWUNtaEo3NnhXWURkUGJMakNkbFpGNWY5S040ZXFu?= =?utf-8?B?Qkg4Yk1GK3gvSnQrUVV6T1VIeVpzOCtVR25KVElWTUxjRFUxVnBWSUZRM2RF?= =?utf-8?B?SW9QZ21VMCtmVW02MXQ5dW5Nc3dodExRV3J5eGtuRHJMU1hEem9DaE54U3dn?= =?utf-8?B?TExpRzFRdWJQOHhmeFl1SGRTb01WbXlYUURldnRiZXZDUnQ3NnBkS2lKdEFD?= =?utf-8?B?dEFtQk4zR3ZNNWE3WGhjR3pqWGxkVU00dFdzckVyV3N1dnBjVGs3S2pRQnhu?= =?utf-8?B?MlNOd2RYTy9RWEhiZ2ErU2xSR0dxYUdoUFNLaSt4MmdtKzBoQTByUUx3MzBt?= =?utf-8?B?bWhOb1UzbDR6NU80a1c1aWxjbHJJZnJCVW5pbnhyMFpwbCtLZC9qTklvZWw4?= =?utf-8?B?bDF4MlRkMmxVR0VkOWNieEZERjFUZWtuS3ZSa3ZNTHNtaWt0ejdDNWlKZHEv?= =?utf-8?B?YWgrcHRvTlkwUW4rK01Jck5Gd1N0WEFQbjJZWEFmM2lqc0pVcGkyQTNTSWdF?= =?utf-8?B?OXJlQjVxRW5nZk5jZ1lFTVU4REkyTjdBYmdoRFlJVXZocGZkdFk2RE53eDNH?= =?utf-8?B?UGE4TzV3eXJtcE8yaDFucWtxRFo1WUpqeDR2ODIwZkV4azhaQUJnRkhlS2JU?= =?utf-8?B?Q09qMEs1YzBrZmlTdW9RODVINWxNSDY2eWhCK3pEdytoSWF0L21FUTlCUDJp?= =?utf-8?B?eC92SThvMFoyZkRDRmdoelEzWElMblZ4aHlQTnhFZzFWa01hMnU0QUlpeGFp?= =?utf-8?B?akI2VW5SYkU3WHhWV3ZPZzVYbEFuclZJcTJQbTdEb0k2LzBrQ2RBcW5uZVZ5?= =?utf-8?B?MXNwVHBBQytTcXdvSVlrNlZDalVPOTVnS3NhNnVaalVPWVJ1U3YzY1RSSEl5?= =?utf-8?B?VnlQdHZRc1JRS0phQUk3UW1DaS81OXdhZUhsUFJZR042UXRxK1BEYllDMStK?= =?utf-8?B?T1ozNndveTVvZ2pmaWthZGhFbFdFM1pFT1RmVkpnejRhSG1OZTBqTW16aGIv?= =?utf-8?B?WUpwSzF5azZNU08xdEpqeDVuMTZNa2RLNnM4eVVXZVpRcC9DU2IydUY3eXZx?= =?utf-8?B?Z2JFUVlCaWsraVZDY2hRS0h0UytRa25oeGpZVkRnUUxHdm5yY2hUZlczbTBB?= =?utf-8?B?MHRsc0RsR2Q3ZEtqbWlnNlZtR3poUEhQOXBEbkZtVnlqb0FyZktMTU1ReXN1?= =?utf-8?B?MXYxdnhVRlc3cUZvNmhqNXR6TlJmZFl2dXRmYTY1eFFlQ3RVak9HM2hjaWow?= =?utf-8?B?Mm1rNktzaWExZUpHbGo5ZHk0T1laaUNHS1F4ZVVpQ1pRTm9pR0Y3bm00L3RI?= =?utf-8?B?d2JxNWlaNmxXMUU2WGpNdDdwcnF4YUw5dDRhS0ltVTJFUWllM0dVc0tXTzlt?= =?utf-8?B?WXJPYzlMQVNjK3g1N1VYemszNkhaUzBvbm5veERpT0d0ZEJ2MTJrUUptY3hW?= =?utf-8?B?WWpLMUlRM3k4b0tGSTlpOXRXUmlPV3ZUajBReHRiWDZ0a1F4bC9sUUJyTnRT?= =?utf-8?B?eGtiTm53MjJyVFZoc3B0cTZCUmhYRk1SRHpjRTA0bXowQXI3SWtnWWlmN1o4?= =?utf-8?B?b24ySFlXWGM1QnZUTmhNR1ZlS0Z2L0pERjNpQ0NCU1JXQVNPZWJSUkZmTnZM?= =?utf-8?B?Nm9MZnhxc1o4NDJDNHNhckxmTHFkMkVvckhaOHRvakx0Y1JtZ1JJaHNZMlNX?= =?utf-8?B?cDFNem1oTVU2MnB1alNrSFI5RnV0Mk5FWEhYMXZSa2I0ZXZuSGRzVW14TGo1?= =?utf-8?B?ZVczYU0ycURDcFJnNiswSUZYMXp3aUxoM0dZRnMyeTIvSTlmMnRKZC9jU25H?= =?utf-8?B?bWZZK01aam0yMnltNGo0WkJYMUV6QVlyY3VTQXc2TDNMU1Y0aGFoZU1yRUNn?= =?utf-8?B?TnUzQXRwZjh3bmp0Qk9ybm53U0drVWp2RlJXSDlqSlBIUlhkWE1zaTA0bm82?= =?utf-8?B?cHplU0E3K05jY2QvZUx3dFg2aUxRRkFDWTlxTnhFLzdQc3g3Zm5BRmU5dlFD?= =?utf-8?B?SXJCVGQrZGNGWnlxSGg5b1ErU2g3YnU4RjkzQWtQYlJ6SWRXOVYvanU1ZXJS?= =?utf-8?B?dEdESEtmZVBpRUg1QW9NYWlIcnNzSGh3WmZpSi9EUFhER21zODdpUllHRDZt?= =?utf-8?B?d2c9PQ==?= MIME-Version: 1.0 X-OriginatorOrg: axelera.ai X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: DB8P194MB0839.EURP194.PROD.OUTLOOK.COM X-MS-Exchange-CrossTenant-Network-Message-Id: cae5b3a4-da6d-4174-5d22-08dc19038000 X-MS-Exchange-CrossTenant-originalarrivaltime: 19 Jan 2024 15:30:00.2612 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 9c838ba7-c38c-416d-a793-1de07a190ebd X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: /ayV4W2H8CnGa/vDgp4Iw5bN7+JXDxRkiOrbRve9pYhYeOtY8L1HSQYxUhNWcEDmYcXEuGBeWRMu8hDp4S8tX1hgLnzBkWsQ1dDI+HlPkmU= X-MS-Exchange-Transport-CrossTenantHeadersStamped: AS8P194MB1286 X-Content-Filtered-By: Mailman/MimeDel 2.1.29 Subject: [FFmpeg-devel] RISC-V vector DSP functions: Motivation for commit 446b009 X-BeenThere: ffmpeg-devel@ffmpeg.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: FFmpeg development discussions and patches List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , From: Michael Platzer via ffmpeg-devel Reply-To: FFmpeg development discussions and patches Cc: Michael Platzer , "remi@remlab.net" Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" Archived-At: List-Archive: List-Post: Hi, Commit 446b0090cbb66ee614dcf6ca79c78dc8eb7f0e37 by Remi Denis-Courmont has replaced RISC-V vector loads and stores with negative stride with vrgather (generalized permutation within vector registers) instructions in order to reverse the elements in a vector register. The commit message explains that this change was done, but it does not explain why. I fail to see what could possibly have motivated this change. The RISC-V vector loads and stores support negative stride values for use cases such as this one. Using vrgather instead replaces the more specific operation with a more generic one, which is likely to be less performant on most HW architectures. In addition, it requires to setup an index vector, thus raising dynamic instruction count. Could someone familiar with this change (perhaps Remi himself) please explain the motivation for this change? Thanks, Michael _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".