AI RESEARCH
How Blind and Low-Vision Individuals Prefer Large Vision-Language Model-Generated Scene Descriptions
arXiv CS.AI
•
ArXi:2502.14883v3 Announce Type: replace-cross For individuals with blindness or low vision (BLV), navigating complex environments can pose serious risks. Large Vision-Language Models (LVLMs) show promise for generating scene descriptions, but their effectiveness for BLV users remains underexplored. To address this gap, we conducted a user study with eight BLV participants to systematically evaluate preferences for six types of LVLM descriptions. While they helped to reduce fear and improve actionability, user ratings showed wide variation in sufficiency and conciseness.