ForgeryGPT: A Multimodal LLM for Interpretable Image Forgery Detection and Localization

ArXi:2410.10238v3 Announce Type: replace-cross Multimodal Large Language Models (MLLMs), such as GPT4o, have shown strong capabilities in visual reasoning and explanation generation. However, despite these strengths, they face significant challenges in the increasingly critical task of Image Forgery Detection and Localization (IFDL). Moreover, existing IFDL methods are typically limited to the learning of low-level semantic-agnostic clues and merely provide a single outcome judgment.