AI RESEARCH

WikiDBGraph: A Data Management Benchmark Suite for Collaborative Learning over Database Silos

arXiv CS.LG

ArXi:2505.16635v3 Announce Type: replace-cross Relational databases are often fragmented across organizations, creating data silos that hinder distributed data management and mining. Collaborative learning (CL) -- techniques that enable multiple parties to train models jointly without sharing raw data -- offers a principled approach to this challenge. However, existing CL frameworks (e.g., federated and split learning) remain limited in real-world deployments.