Performance Analysis and Optimization of the Vector-Kronecker Product Multiplication - Recherche en informatique (CRI)
 Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Performance Analysis and Optimization of the Vector-Kronecker Product Multiplication

Résumé

The Kronecker product, also called tensor product, is a fundamental matrix algebra operation, used to model complex systems using structured descriptions. This operation needs to be computed efficiently, since it is a critical kernel for iterative algorithms. In this work, we focus on the vector-kronecker product operation, where we present an in-depth performance analysis of a sequential and a parallel algorithm previously proposed. Based on this analysis, we proposed three optimizations: changing the memory access pattern, reducing load imbalance and manually vectorizing some portions of the code with Intel SSE4.2 intrinsics. The obtained results show better cache usage and load balance, thus improving the performance, especially for larger matrices.
Fichier principal
Vignette du fichier
A-734.pdf (457.09 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02958782 , version 1 (29-10-2020)

Identifiants

Citer

Alexandre Azevedo, Cristiana Bentes, Maria Clicia Castro, Claude Tadonki. Performance Analysis and Optimization of the Vector-Kronecker Product Multiplication. 11th Workshop on Applications for Multi-Core Architectures (WAMCA 2020), in conjunction the 2020 IEEE 32nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Sep 2020, Porto, Portugal. ⟨10.1109/SBAC-PAD49847.2020.00044⟩. ⟨hal-02958782⟩
25 Consultations
400 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More