한국과학기술원 도서관

서지주요정보
Statistical voice/data multiplexing with integrated flow control = 집적 유통 제어에 의한 음성 및 데이타의 통계적 다중화에 관한 연구
서명 / 저자	Statistical voice/data multiplexing with integrated flow control = 집적 유통 제어에 의한 음성 및 데이타의 통계적 다중화에 관한 연구 / Hyeong-Ho Lee.
발행사항	[서울 : 한국과학기술원, 1983].
Online Access	원문보기 원문인쇄

소장정보

등록번호

4102328

소장위치/청구기호

학술문화관(문화관) 보존서고

DEE 8307

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

In this dissertation, various aspects of statistical voice/data multiplexing have been studied, and new results have been obtained in three areas that attract recent research interests. First, the on-off speech statistics of conversational speech have been measured using an accurate speech detector and a large data base. Based on the measurement results, probability density functions of silence and talkspurt durations have been modeled approximately by two weighted geometric probability density functions. Then, for any value of hangover or fill-in, important speech parameters such as speech activity, and average silence and talkspurt durations have been calculated based on the model of probability density function of silence durations, and compared with those measured. Directly measured values of speech parameters and those calculated have been shown to be in good agreement for a practical range of hangover and fill-in time. For a large hangover time greater than 200 ms, silences and talkspurts have been fit by an exponential distrubution and a constant-plus-exponential distribution in the continuous time domain, respectively. On the other hand, for a large fill-in time greater than 200 ms, silences and talkspurts have been modeled by a constant-plus-exponential distribution and an exponential distribution, respectively. With both large hangover and fill-in values, the talkspurt model agrees closely with the measured data, but the silence model does not agree as closely as the talkspurt model. Second, the performances of embedded adaptive differential pulse code modulation (ADPCM), embedded adaptive delta modulation (ADM), and the same systems with a delayed-decision scheme have been studied with real speech over a wide dynamic range. The embedded ADPCM and ADM coders have been obtained by modifying the conventional ADPCM and ADM coders. The basic scheme of the embedded ADPCM coder is based on the ADPCM originally proposed by Cummiskey et al. For embedded ADM systems, three representative ADM systems have been chosen. They are continuously variable slope DM (CVSD), constant factor DM (CFDM) and hybrid companding DM (HCDM). Among these embedded coders, the performance of the embedded HCDM is superior to the other coders over a wide range of transmission rate from 16 kbits/s to 64 kbits/s. When the delayed-decision scheme is applied to the embedded ADPCM, the performance is improved significantly at all transmission rates. But, in the embedded ADM systems with 16 KHz sampling rate, the performance improvement resulting from delayed decision is not so much drastic as is in the embedded ADPCM with the same number of delayed samples. Finally, new statistical voice/data multiplexers have been proposed. The structures of these multiplexers are based on a new queueing method, a frame management scheme and an integrated flow control algorithm. For the queueing purpose, two separate voice and data buffers are used to queue voice and data packets independently. Each queue discipline is first-in first-out (FIFO). Four versions of master frame management have been considered. They are fixed boundary (FB), movable boundary for data traffic (MBD), movable boundary for voice traffic (MBV), and movable boundary for both voice and data traffics (MBB). For the integrated flow control algorithm, speech coding techniques such as variable rate coding and embedded coding have been considered. To evaluate the performances of the proposed multiplexers, new analytical methods have been proposed. For the performance analysis, a two-state Markov model has been chosen to reflect the frame-by-frame correlation of talkspurt activity for the same talker. For a data traffic model, three representitive models have been considered. They are Poisson, batch Poisson, and mixed Poisson arrival processes. By using these input traffic models, analytical formulations for the performance analysis have been made from the equivalent queueing models of the proposed multiplexing systems. Then, numerical results on the performance of the multiplexers have been obtained by using the Gauss-Seidel iteration method. These results show that there exist tradeoffs between data and voice traffic performances for a given input traffic load and the output channel rate. To study various aspects of the proposed voice/data multiplexing systems, computer simulation has been done with real speech. Two types of the statistical voice/data multiplexers have been studied; one employs a variable rate ADPCM coder and the other an embedded ADPCM coder with a delayed-decision scheme. The performance results of these multiplexers have been obtained with various system parameter values. Both simulation and analysis results show that the quality of received speech is largely dependent on the average input rate of data traffic in statistical voice/data multiplexing with MBB frame management.

본 논문에서는 음성 및 데이타의 통계적 다중화에 관한 제반 문제들이 연구되었으며, 특히 최근 관심이 집중되고 있는 세가지 분야에서 새로운 결과들을 얻었다. 첫째로 충분히 많은 데이타 베이스와 정확한 음성 검출기를 사용하여 대화시의 음성 신호에 대한 통계적 특성을 실험적으로 측정하였다. 측 정결과에 근거하여 silence 길이와 talkspurt 길이의 확률 밀도 함수들을 두 개의 기하학적인 확률 밀도 함수를 조합하여 근사적으로 표시하였다. 또한 임의의 hangover 나 fill-in 시간에 대하여, 음성의 activity, 평균 silence 및 talkspurt 의 길이 등과 같은 중요한 음성 변수들이 앞서 구한 silence 의 확률 밀도 함수 model 에 근거하여 이론적으로 계산되고 실험적으로도 측정되었다. 음성 변수들의 측정치와 계산치는 실용적인 hangover 와 fill-in 시간 영역에서 잘 부합되는 것을 알 수 있었다. 200 ms 보다 큰 hangover시간을 사용할 경우, silence 와 talkspurt 의 길이는 continuous time domain 에서 각각 exponential 분포와 constant-plus-exponential 분포로 modeling 된다. 반면에 200 ms 보다 큰 fill-in 시간을 사용할 경우, silence 와 talkspurt 의 길이는 각각 constant-plus-exponential 분포와 exponential 분포로 modeling 된다. 200 ms 보다 큰 hangover 나 fill-in 시간의 어떠한 경우에도 talkspurt 분포 model 은 실제 측정 데이타와 잘 부합하였으나, silence 분포 model 은 talkspurt 분포 model 만큼 아주 잘 맞지는 않았다. 둘째로 음성 및 데이타의 통계적 다중화 장비에 사용할 수 있는 새로운 형태의 음성 파형 부호화기들의 성능에 대해 연구하였다. 비교 연구된 음성 부호화기들은 embedded adaptive differential code modulation (ADPCM), embedded adaptive delta modulation (ADM) 방식이며, 각각 의 부호화기에 대하여 delayed decision 방법을 적용한 새로운 embedded 부호화 방식도 연구되었다. Embedded ADPCM 과 ADM 부호화기는 각각 종래의 ADPCM 과 ADM 부호화기를 변형하여 고안되었다. Embedded ADPCM 의 기본 구조는 Cummiskey 등이 제안한 ADPCM 방식에 근거하여 고안되었으며, embedded ADM에 대해서는 세가지 대표적인 ADM즉 continuously variable slope DM (CVSD), constant factor DM (CFDM) 과 hybrid companding DM (HCDM) 을 변형하여 embedded CVSD, embedded CFDM 및 embedded HCDM 부호화기를 고안하였다. 실제의 음성 신호를 사용한 컴퓨터 simulation 의 결과에 의하면 입력 신호 대잡음비 또는 segmented 입력신호 대 잡음비를 성능의 판별 기준으로 삼을 경우, 위 네가지 embedded부호화기 중에서 embedded HCDM 이 넓은 전송속도 범위 (16 - 64 kbits/s) 에 걸쳐서 가장 우수한 성능을 가진 것으로 판명되었다. 한편 delayed decision방법을 embedded ADPCM 에 적용한 경우 그 성능이 모든 전송속도에서 획기적으로 향상되었으나, 16 kHz sampling rate 를 사용한 embedded ADM 부호화기들에 delayed decision 방법을 사용한 경우에는 같은 수의 입력 sample을 지연 시켜 delayed decision 방법에 의해 embedded ADPCM 부호화한 경우보다 성능 향상도가 낮았다. 마지막으로 새로운 음성 및 데이타의 통계적 다중화 장비들을 제안하고 그 성능에 대하여 연구 되었다. 이 다중화 장비들의 구조는 새로운 queueing 방법, frame management 방법 및 집적 유통 제어 algorithm 들에 근거하여 구성된다. 즉 음성과 데이타 packet 을 별도로 queueing 시키기 위하여 두개의 분리된 음성 buffer와 데이타 buffer를 사용한다. 이때 각각의 queue 는 first-in first-out (FIFO) 규칙에 의하여 동작한다. Master frame management를 위하여 fixed boundary (FB), movable boundary for data traffic (MBD), movable boundary for voice traffic (MBV) 및 movable boundary for both voice and data traffics(MBB) 의 네가지 방식이 사용되었다. 또한 집접 유통 제어를 위하여 variable rate coding 및 embedded coding 과 같은 음성 부호화 방식을 사용하였다. 제안된 다중화 장비들의 성능을 연구 하기 위하여 같은 화자(talker) 의 talkspurt activity 가 인접한 frame 사이에 상관 관계가 있음을 반영하여 주는 two-state Markov model 을 음성 신호의 model 로 사용하였다. 데이타 신호의 model 로는 Poisson, batch Poisson 및 mixed Poisson arrival process 를 사용하였다. 이러한 입력 model 들과 제안된 다중화 시스템의 등가 queueing model 을 이용하여, 성능 분석을 위한 이론적인 formulation 을 세우고 Gauss-Seidel iteration 방법에 의하여 수치 해석적인 결과를 얻었다. 이 결과에 의하면 주어진 입력 traffic load 와 출력 channel rate 에서 음성신호에 대한 성능과 데이타 신호에 대한 성능 사이에 tradeoff 가 있음을 알 수 있었다. 즉 다중화 장비의 음성 신호에 대한 성능을 높이면 음o}o} 신호에 대한 성능이 낮아지고, 반대로 데이타 신호에 대한 성능을 높이면 음성 신호에 대한 성능이 낮아진다. 제안된 다중화 장비의 여러가지 측면을 연구하기 위하여 실제 음성 신호를 가지고 두가지 유형의 음성 및 데이타의 통계적 다중화 장비에 대한 컴퓨터 simulation 을 행하였다. 그중 하나는 variable rate ADPCM 음성 부호화기를 사용한 것이고 다른 하나는 delayed decision 방법을 적용한 embedded ADPCM 음성 부호화기를 사용한 것이다. 이 두가지 다중화 장비의 성능에 대하여 여러가지 시스템 변수들의 값을 변화시키면서 실험적으로 결과를 얻었다. 이 simulation결과와 analysis 결과에 의하면 MBB frame management 를 사용한 음성 및 데이타의 통계적 다중화 장비에서 재생된 음성의 질이 입력 데이타 traffic의 평균 rate 에 따라 크게 좌우된다는 것을 알 수 있었다.

서지기타정보

서지기타정보
청구기호	{DEE 8307
형태사항	iii, 267 p. : 삽화 ; 26 cm
언어	영어
일반주기	Includes appendix 저자명의 한글표기 : 이형호 지도교수의 영문표기 : Chong-Kwan Un 지도교수의 한글표기 : 은종관
학위논문	학위논문(박사) - 한국과학기술원 : 전기및전자공학과,
서지주기	Reference : p. 254-266
주제	Delta modulation. Speech processing systems. Data traffic management systems (Computer systems) Automatic machinery --Buffer stocks. 통계적 방법. --과학기술용어시소러스 다중화 장치. --과학기술용어시소러스 음성 다중화. --과학기술용어시소러스 트래픽 처리. --과학기술용어시소러스 버퍼 방식. --과학기술용어시소러스 Multiplexing.

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서