@tanjamunz

Visual Analysis of Scene-Graph-Based Visual Question Answering

, , , , , , and . Proceedings of the 16th International Symposium on Visual Information Communication and Interaction, page 1–8. New York, NY, USA, Association for Computing Machinery, (Oct 20, 2023)
DOI: 10.1145/3615522.3615547

Abstract

Scene-graph-based Visual Question Answering (VQA) has emerged as a burgeoning field in Deep Learning research, with a growing demand for robust and interpretable VQA systems. In this paper, we present a novel visual analysis approach that addresses two critical objectives in VQA: identifying and correcting prediction issues and providing insights into model decision-making processes through visualizing internal information. Our approach builds on the GraphVQA framework, which uses graph neural networks to process scene graphs representing images and which was trained on the widely-used GQA dataset. Our analysis tool aims at users familiar with the basics of graph-based VQA. By leveraging query-based scene analysis and visualization of crucial internal states, we are able to detect and pinpoint reasons for inaccurate predictions, facilitating model refinement and dataset curation. Identifying expressive internal states is a challenge. Through rigorous computer-based evaluations and presentation of a use case, we demonstrate the effectiveness of our analysis tool and model state visualization.

Links and resources

Tags

community