Workshop on Approximate Computing in Numerical Linear Algebra

7-10 Oct 2025 Paris (France)

We present a novel linear-complexity bottom-up sketching-based algorithm for constructing a H2 matrix and its high performance GPU implementation. The construction algorithm requires both a black-box sketching operator and an entry evaluation function. The novelty of our GPU approach centers around the design and implementation of the above two operations in batched mode on GPU with accommodation for variable-size data structures in a batch. The batch algorithms minimize the number of kernel launches and maximize the GPU throughput.
When applied to covariance matrices, volume IE matrices and H2 update operations, our proposed GPU implementation achieves up to 13X speedup over our CPU implementation, and up to 1000X speedup over an existing GPU implementation of the top-down sketching-based algorithm from the H2Opus library. This is the first GPU implementation of the class of bottom-up sketching-based H2 construction algorithms.

Subject :	:	Talk
Topics	:	Session 5
PDF version	:	PDF version

Privacy | Accessibility