The semantic annotation of large multimedia corpora
is essential for numerous tasks. Be it for the training of
classification algorithms, efficient content retrieval, or for analytical
reasoning, appropriate labels are often the first necessity
before automatic processing becomes efficient. However, manual
labeling of large datasets is time-consuming and tedious. Hence,
we present a new visual approach for labeling and retrieval of
reports in multimedia news corpora. It combines automatic classifier
training based on caption text from news reports with human
interpretation to ease the annotation process. In our approach,
users can initialize labels with keyword queries and iteratively
annotate examples to train a classifier. The proposed visualization
displays representative results in an overview that allows to
follow different annotation strategies (e.g., active learning) and
assess the quality of the classifier. Based on a usage scenario, we
demonstrate the successful application of our approach. Therein,
users label several topics which interest them and retrieve related
documents with high confidence from three years of news reports.
%0 Conference Paper
%1 han2018visual
%A Han, Qi
%A John, Markus
%A Kurzhals, Kuno
%A Messner, Johannes
%A Ertl, Thomas
%B 2018 Leipzig Symposium on Visualization in Applications
%D 2018
%K 2018 vis(us) vis-gis visus:ertl visus:hanqi visus:johnms visus:kurzhako
%T Visual Interactive Labeling of Large Multimedia News Corpora
%X The semantic annotation of large multimedia corpora
is essential for numerous tasks. Be it for the training of
classification algorithms, efficient content retrieval, or for analytical
reasoning, appropriate labels are often the first necessity
before automatic processing becomes efficient. However, manual
labeling of large datasets is time-consuming and tedious. Hence,
we present a new visual approach for labeling and retrieval of
reports in multimedia news corpora. It combines automatic classifier
training based on caption text from news reports with human
interpretation to ease the annotation process. In our approach,
users can initialize labels with keyword queries and iteratively
annotate examples to train a classifier. The proposed visualization
displays representative results in an overview that allows to
follow different annotation strategies (e.g., active learning) and
assess the quality of the classifier. Based on a usage scenario, we
demonstrate the successful application of our approach. Therein,
users label several topics which interest them and retrieve related
documents with high confidence from three years of news reports.
@inproceedings{han2018visual,
abstract = {The semantic annotation of large multimedia corpora
is essential for numerous tasks. Be it for the training of
classification algorithms, efficient content retrieval, or for analytical
reasoning, appropriate labels are often the first necessity
before automatic processing becomes efficient. However, manual
labeling of large datasets is time-consuming and tedious. Hence,
we present a new visual approach for labeling and retrieval of
reports in multimedia news corpora. It combines automatic classifier
training based on caption text from news reports with human
interpretation to ease the annotation process. In our approach,
users can initialize labels with keyword queries and iteratively
annotate examples to train a classifier. The proposed visualization
displays representative results in an overview that allows to
follow different annotation strategies (e.g., active learning) and
assess the quality of the classifier. Based on a usage scenario, we
demonstrate the successful application of our approach. Therein,
users label several topics which interest them and retrieve related
documents with high confidence from three years of news reports.},
added-at = {2018-12-16T20:33:07.000+0100},
author = {Han, Qi and John, Markus and Kurzhals, Kuno and Messner, Johannes and Ertl, Thomas},
biburl = {https://puma.ub.uni-stuttgart.de/bibtex/2d8744b00cedee1bcc77630dc7675f505/qihan},
booktitle = {2018 Leipzig Symposium on Visualization in Applications},
interhash = {101690e7d39347fd94744ccea7da9f16},
intrahash = {d8744b00cedee1bcc77630dc7675f505},
keywords = {2018 vis(us) vis-gis visus:ertl visus:hanqi visus:johnms visus:kurzhako},
month = oct,
timestamp = {2018-12-17T18:26:10.000+0100},
title = {Visual Interactive Labeling of Large Multimedia News Corpora},
year = 2018
}