Open Access
Measuring the Perceived Importance of Speech Segments for Transmission over IP Networks

Yusuke HIWASAKI, Toru MORINAGA, Jotaro IKEDO, Akitoshi KATAOKA

  • Full Text Views

    202

  • Cite this
  • Free PDF (964.8KB)

Summary :

This paper presents a way of using a linear regression model to produce a single-valued criterion that indicates the perceived importance of each block in a stream of speech blocks. This method is superior to the conventional approach, voice activity detection (VAD), in that it provides a dynamically changing priority value for speech segments with finer granularity. The approach can be used in conjunction with scalable speech coding techniques in the context of IP QoS services to achieve a flexible form of quality control for speech transmission. A simple linear regression model is used to estimate a mean opinion score (MOS) of the various cases of missing speech segments. The estimated MOS is a continuous value that can be mapped to priority levels with arbitrary granularity. Through subjective evaluation, we show the validity of the calculated priority values.

Publication
IEICE TRANSACTIONS on Communications Vol.E89-B No.2 pp.326-333
Publication Date
2006/02/01
Publicized
Online ISSN
1745-1345
DOI
10.1093/ietcom/e89-b.2.326
Type of Manuscript
Special Section PAPER (Special Section on Multimedia QoS Evaluation and Management Technologies)
Category

Authors

Keyword

FlyerIEICE has prepared a flyer regarding multilingual services. Please use the one in your native language.