AI RESEARCH

Fast and Flexible Audio Bandwidth Extension via Vocos

arXiv CS.LG

ArXi:2603.07285v1 Announce Type: cross We propose a Vocos-based bandwidth extension model that enhances audio at 8-48 kHz by generating missing high-frequency content. Inputs are resampled to 48 kHz and processed by a neural vocoder backbone, enabling a single network to arbitrary upsampling ratios. A lightweight Linkwitz-Riley-inspired refiner merges the original low band with the generated high frequencies via a smooth crossover.