This paper studies a set of lip synchronization mechanisms for heterogeneous network environments. The set consists of four schemes, types 0 through 3, which are classified into the single-stream approach and the multi-stream approach. Types 0 and 1 belong to the single-stream approach, which interleaves voice and video to form a single transport stream for transmission. On the other hand, types 2 and 3, both of which are the multi-stream approach, set up separate transport streams for the individual media streams. Types 0 and 2 do not exert synchronization control at the destination, while types 1 and 3 do. We first discuss the features of each type in terms of networks intended for use, required synchronization quality of each medium, physical locations of media sources and implementation complexity. Then, a synchronization algorithm, which is referred to as the virtual-time rendering (VTR) algorithm, is specified for stored media; MPEG video and voice are considered in this paper. We implemented the four types on an ATM LAN and on an interconnected ATM-wireless LAN under the TCP protocol. The mean square error of synchronization, total pause time, throughput and total output time were measured in each of the two networks. We compare the measured performance among the four types to find out which one is the most suitable for a given condition of the underlying communication network and traffic.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Shuji TASAKA, Yutaka ISHIBASHI, "Media Synchronization in Heterogeneous Networks: Stored Media Case" in IEICE TRANSACTIONS on Communications,
vol. E81-B, no. 8, pp. 1624-1636, August 1998, doi: .
Abstract: This paper studies a set of lip synchronization mechanisms for heterogeneous network environments. The set consists of four schemes, types 0 through 3, which are classified into the single-stream approach and the multi-stream approach. Types 0 and 1 belong to the single-stream approach, which interleaves voice and video to form a single transport stream for transmission. On the other hand, types 2 and 3, both of which are the multi-stream approach, set up separate transport streams for the individual media streams. Types 0 and 2 do not exert synchronization control at the destination, while types 1 and 3 do. We first discuss the features of each type in terms of networks intended for use, required synchronization quality of each medium, physical locations of media sources and implementation complexity. Then, a synchronization algorithm, which is referred to as the virtual-time rendering (VTR) algorithm, is specified for stored media; MPEG video and voice are considered in this paper. We implemented the four types on an ATM LAN and on an interconnected ATM-wireless LAN under the TCP protocol. The mean square error of synchronization, total pause time, throughput and total output time were measured in each of the two networks. We compare the measured performance among the four types to find out which one is the most suitable for a given condition of the underlying communication network and traffic.
URL: https://globals.ieice.org/en_transactions/communications/10.1587/e81-b_8_1624/_p
Copy
@ARTICLE{e81-b_8_1624,
author={Shuji TASAKA, Yutaka ISHIBASHI, },
journal={IEICE TRANSACTIONS on Communications},
title={Media Synchronization in Heterogeneous Networks: Stored Media Case},
year={1998},
volume={E81-B},
number={8},
pages={1624-1636},
abstract={This paper studies a set of lip synchronization mechanisms for heterogeneous network environments. The set consists of four schemes, types 0 through 3, which are classified into the single-stream approach and the multi-stream approach. Types 0 and 1 belong to the single-stream approach, which interleaves voice and video to form a single transport stream for transmission. On the other hand, types 2 and 3, both of which are the multi-stream approach, set up separate transport streams for the individual media streams. Types 0 and 2 do not exert synchronization control at the destination, while types 1 and 3 do. We first discuss the features of each type in terms of networks intended for use, required synchronization quality of each medium, physical locations of media sources and implementation complexity. Then, a synchronization algorithm, which is referred to as the virtual-time rendering (VTR) algorithm, is specified for stored media; MPEG video and voice are considered in this paper. We implemented the four types on an ATM LAN and on an interconnected ATM-wireless LAN under the TCP protocol. The mean square error of synchronization, total pause time, throughput and total output time were measured in each of the two networks. We compare the measured performance among the four types to find out which one is the most suitable for a given condition of the underlying communication network and traffic.},
keywords={},
doi={},
ISSN={},
month={August},}
Copy
TY - JOUR
TI - Media Synchronization in Heterogeneous Networks: Stored Media Case
T2 - IEICE TRANSACTIONS on Communications
SP - 1624
EP - 1636
AU - Shuji TASAKA
AU - Yutaka ISHIBASHI
PY - 1998
DO -
JO - IEICE TRANSACTIONS on Communications
SN -
VL - E81-B
IS - 8
JA - IEICE TRANSACTIONS on Communications
Y1 - August 1998
AB - This paper studies a set of lip synchronization mechanisms for heterogeneous network environments. The set consists of four schemes, types 0 through 3, which are classified into the single-stream approach and the multi-stream approach. Types 0 and 1 belong to the single-stream approach, which interleaves voice and video to form a single transport stream for transmission. On the other hand, types 2 and 3, both of which are the multi-stream approach, set up separate transport streams for the individual media streams. Types 0 and 2 do not exert synchronization control at the destination, while types 1 and 3 do. We first discuss the features of each type in terms of networks intended for use, required synchronization quality of each medium, physical locations of media sources and implementation complexity. Then, a synchronization algorithm, which is referred to as the virtual-time rendering (VTR) algorithm, is specified for stored media; MPEG video and voice are considered in this paper. We implemented the four types on an ATM LAN and on an interconnected ATM-wireless LAN under the TCP protocol. The mean square error of synchronization, total pause time, throughput and total output time were measured in each of the two networks. We compare the measured performance among the four types to find out which one is the most suitable for a given condition of the underlying communication network and traffic.
ER -