AI RESEARCH

LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis

arXiv CS.AI

ArXi:2603.05904v1 Announce Type: cross GPU design space exploration (DSE) for modern AI workloads, such as Large-Language Model (LLM) inference, is challenging because of GPUs' vast, multi-modal design spaces, high simulation costs, and complex design optimization objectives (e.g. performance, power and area trade-offs). Existing automated DSE methods are often prohibitively expensive, either requiring an excessive number of exploration samples or depending on intricate, manually crafted analyses of interdependent critical paths guided by human heuristics.