AI RESEARCH

PEARL: Personalized Streaming Video Understanding Model

arXiv CS.AI

ArXi:2603.20422v1 Announce Type: cross Human cognition of new concepts is inherently a streaming process: we continuously recognize new objects or identities and update our memories over time. However, current multimodal personalization methods are largely limited to static images or offline videos. This disconnects continuous visual input from instant real-world feedback, limiting their ability to provide the real-time, interactive personalized responses essential for future AI assistants.