AI RESEARCH

When Slots Compete: Slot Merging in Object-Centric Learning

arXiv CS.CV

ArXi:2603.11246v1 Announce Type: new Slot-based object-centric learning represents an image as a set of latent slots with a decoder that combines them into an image or features. The decoder specifies how slots are combined into an output, but the slot set is typically fixed: the number of slots is chosen upfront and slots are only refined. This can lead to multiple slots competing for overlapping regions of the same entity rather than focusing on distinct regions. We