Document Type
Article
Publication Date
8-2025
Publisher
Elsevier
Source Publication
Big Data Research
Source ISSN
2214-5796
Abstract
While temporal sentiment labels prove invaluable for video tagging, segmentation, and labeling tasks in multimedia studies, large-scale manual annotation remains cost and time-prohibitive. Emerging Online Time-Sync Comment (TSC) datasets offer promising alternatives for generating sentiment maps. However, limitations in existing TSC scope and a lack of resource-constrained data creation guidelines hinder broader use. This study addresses these challenges by proposing a novel system for automated TSC generation utilizing recent YouTube comments as a readily accessible source of time-synchronized data. The efficacy of our multi-platform data mining system is evaluated through extensive long-term trials, leading to the development and analysis of two large-scale TSC datasets. Benchmarking against original temporal Automatic Speech Recognition (ASR) sentiment annotations validates the accuracy of our generated data. This work establishes a promising method for automatic TSC generation, laying the groundwork for further advancements in multimedia research and paving the way for novel sentiment analysis applications.
Recommended Citation
Ma, Jiachen; Sakib, Nazmus; Anik, Fahim Islam; and Ahamed, Sheikh Iqbal, "Time-Synchronized Sentiment Labeling Via Autonomous Online Comments Data Mining: A Multimodal Information Fusion on LargeScale Multimedia Data" (2025). Computer Science Faculty Research and Publications. 103.
https://epublications.marquette.edu/comp_fac/103
Comments
Accepted version. Big Data Research, Vol. 41 (August 2025): 100552. DOI. © 2025 Elsevier, Inc. Used with permission.