copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Visual Analysis of Scene-Graph-Based Visual Question Answering

N. Schäfer, S. Künzel, T. Munz-Körner, P. Tilli, S. Vidyapu, N. Thang Vu, and D. Weiskopf. Proceedings of the 16th International Symposium on Visual Information Communication and Interaction, page 1–8. New York, NY, USA, Association for Computing Machinery, (Oct 20, 2023)
DOI: 10.1145/3615522.3615547

Abstract

Scene-graph-based Visual Question Answering (VQA) has emerged as a burgeoning field in Deep Learning research, with a growing demand for robust and interpretable VQA systems. In this paper, we present a novel visual analysis approach that addresses two critical objectives in VQA: identifying and correcting prediction issues and providing insights into model decision-making processes through visualizing internal information. Our approach builds on the GraphVQA framework, which uses graph neural networks to process scene graphs representing images and which was trained on the widely-used GQA dataset. Our analysis tool aims at users familiar with the basics of graph-based VQA. By leveraging query-based scene analysis and visualization of crucial internal states, we are able to detect and pinpoint reasons for inaccurate predictions, facilitating model refinement and dataset curation. Identifying expressive internal states is a challenge. Through rigorous computer-based evaluations and presentation of a use case, we demonstrate the effectiveness of our analysis tool and model state visualization.

Links and resources

BibTeX key: Schaefer2023
entry type: inproceedings
address: New York, NY, USA
booktitle: Proceedings of the 16th International Symposium on Visual Information Communication and Interaction
year: 2023
month: 10
day: 20
pages: 1–8
publisher: Association for Computing Machinery
series: VINCI '23
isbn: 9798400707513
location: Guangzhou, China
DOI: 10.1145/3615522.3615547
url: https://doi.org/10.1145/3615522.3615547

@tanjamunz's tags highlighted

Cite this publication

@inproceedings{Schaefer2023, abstract = {Scene-graph-based Visual Question Answering (VQA) has emerged as a burgeoning field in Deep Learning research, with a growing demand for robust and interpretable VQA systems. In this paper, we present a novel visual analysis approach that addresses two critical objectives in VQA: identifying and correcting prediction issues and providing insights into model decision-making processes through visualizing internal information. Our approach builds on the GraphVQA framework, which uses graph neural networks to process scene graphs representing images and which was trained on the widely-used GQA dataset. Our analysis tool aims at users familiar with the basics of graph-based VQA. By leveraging query-based scene analysis and visualization of crucial internal states, we are able to detect and pinpoint reasons for inaccurate predictions, facilitating model refinement and dataset curation. Identifying expressive internal states is a challenge. Through rigorous computer-based evaluations and presentation of a use case, we demonstrate the effectiveness of our analysis tool and model state visualization.}, added-at = {2023-10-23T19:14:40.000+0200}, address = {New York, NY, USA}, author = {Schäfer, Noel and Künzel, Sebastian and Munz-Körner, Tanja and Tilli, Pascal and Vidyapu, Sandeep and Thang Vu, Ngoc and Weiskopf, Daniel}, biburl = {https://puma.ub.uni-stuttgart.de/bibtex/245f80081761d7a4507e7f2b210b79768/tanjamunz}, booktitle = {Proceedings of the 16th International Symposium on Visual Information Communication and Interaction}, day = 20, doi = {10.1145/3615522.3615547}, interhash = {e858fa98d08c5e0e3d2b4baefa5bd60e}, intrahash = {45f80081761d7a4507e7f2b210b79768}, isbn = {9798400707513}, keywords = {exc2075 myown pn6 vis(us) visus visus:kuenzesn visus:munzta visus:weiskopf}, location = {Guangzhou, China}, month = {10}, pages = {1–8}, publisher = {Association for Computing Machinery}, series = {VINCI '23}, timestamp = {2023-10-23T19:14:40.000+0200}, title = {Visual Analysis of Scene-Graph-Based Visual Question Answering}, url = {https://doi.org/10.1145/3615522.3615547}, year = 2023 }

PUMA

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Visual Analysis of Scene-Graph-Based Visual Question Answering

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

PUMA

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Visual Analysis of Scene-Graph-Based Visual Question Answering

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Visual Analysis of Scene-Graph-Based Visual Question Answering

Comments and Reviews
(0)