Abstract: An increase in ICT devices and services has led to a rise in carbon emissions. As a result, there is a growing demand for energy-efficient software; however, this demand remains unmet due to ...
Abstract: Code-based Distributed Matrix Multiplication (DMM) has been widely studied as an effective method for large-scale matrix computations in distributed systems. Two central challenges in ...
As software projects progress, quality of code assumes paramount importance as it affects reliability, maintainability and security of software. For this reason, static analysis tools are used in ...
There was an error while loading. Please reload this page.
CUDA-L2 is a system that combines large language models (LLMs) and reinforcement learning (RL) to automatically optimize Half-precision General Matrix Multiply (HGEMM) CUDA kernels. CUDA-L2 ...