Abstract
In multimodal presentations the perceived audiovisual quality
assessment is significantly influenced by the content of both the
audio and visual tracks. Based on our earlier subjective quality test
for finding the optimal trade-off between audio and video quality,
this paper proposes a novel method for relative multimodal
complexity analysis to derive the fusion parameter in objective
audiovisual quality metrics. Audio and video qualities are first
estimated separately using advanced quality models, and then they
are combined into the overall audiovisual quality using a linear
fusion. Based on carefully designed auditory and visual features,
the relative complexity analysis model across sensory modalities is
proposed for deriving the fusion parameter. Experimental results
have demonstrated that the content adaptive fusion parameter can
improve the prediction accuracy of objective audiovisual quality
metrics, compared to the fusion parameters obtained from the
subjective quality tests using other known optimization methods.
Original language | English |
---|---|
Title of host publication | 2011 18th IEEE International Conference on Image Processing (ICIP) |
Publisher | IEEE |
Publication date | 2011 |
ISBN (Print) | 978-1-4577-1304-0 |
ISBN (Electronic) | 978-1-4577-1302-6 |
DOIs | |
Publication status | Published - 2011 |
Event | 18th IEEE International Conference on Image Processing - Brussels, Belgium Duration: 11 Sep 2011 → 14 Sep 2011 Conference number: 18 http://www.icip2011.org/ |
Conference
Conference | 18th IEEE International Conference on Image Processing |
---|---|
Number | 18 |
Country/Territory | Belgium |
City | Brussels |
Period | 11/09/2011 → 14/09/2011 |
Internet address |
Series | International Conference on Image Processing. Proceedings |
---|---|
ISSN | 1522-4880 |
Keywords
- Content analysis
- Quality fusion
- Audiovisual quality assessment
- Multimodal complexity