QR Factorization of Block Low-Rank Matrices on Multi-instance GPU

Satoshi Ohshima, Akihiro Ida, Rio Yokota, Ichitaro Yamazaki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

The QR factorization, which is a fundamental operation in linear algebra, is used extensively in scientific simulations. The acceleration and memory reduction of it are important research targets. QR factorization using block low-rank matrices (BLR-QR) has previously been proposed to address this issue. In this study, we consider its implementation on a GPU. Current CPUs and GPUs have numerous computational cores and the performance consists of the total performance of them. Therefore, the degree of parallelism of the target calculation is important for obtaining high performance. By contrast, many applications, including BLR-QR, do not have sufficient parallelism. Batched computation has attracted attention for achieving high performance in such calculations. However, the use of it requires major code rewriting and is extremely laborious. Thus, we propose the use of the multi-instance GPU (MIG) feature of current GPUs. Using MIG, we succeeded in obtaining a 53.3% time reduction over the CPU and 77.6% over the GPU without MIG. From the above result, we succeeded in demonstrating rapid implementation of BLR-QR on MIG and usefulness of MIG.

Original languageEnglish
Title of host publicationParallel and Distributed Computing, Applications and Technologies - 23rd International Conference, PDCAT 2022, Proceedings
EditorsHiroyuki Takizawa, Hong Shen, Toshihiro Hanawa, Jong Hyuk Park, Hui Tian, Ryusuke Egawa
PublisherSpringer Science and Business Media Deutschland GmbH
Pages359-369
Number of pages11
ISBN (Print)9783031299261
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event23rd International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2022 - Sendai, Japan
Duration: Dec 7 2022Dec 9 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13798 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference23rd International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2022
Country/TerritoryJapan
CitySendai
Period12/7/2212/9/22

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'QR Factorization of Block Low-Rank Matrices on Multi-instance GPU'. Together they form a unique fingerprint.

Cite this