777米奇影院狠狠色,无套熟女AV呻吟在线观看,国产精品兄妹在线观看麻豆 ,国产自国产自愉自愉免费24区

當(dāng)前位置:首頁 > 譯文賞析

編碼解碼器主要特點(diǎn)-中英對(duì)照

發(fā)布時(shí)間:2013-2-4      閱讀次數(shù):1663

編碼解碼器主要特點(diǎn)-中英對(duì)照
Speex的主要特點(diǎn)可以總結(jié)有以下幾點(diǎn):
•免費(fèi)軟件/開放源碼、免專利費(fèi)和版權(quán)費(fèi)
•采用嵌入型的比特流來集成窄頻帶和寬頻帶
•適用的比特率的范圍很廣(從2.15 kbps 到44 kbps)
•動(dòng)態(tài)比特率轉(zhuǎn)換(AMR)和可變比特率(VBR)運(yùn)算
•聲音活動(dòng)探測(cè)(VAD和VBR整合)和不連續(xù)傳送(DTX)
•可變的復(fù)雜性
•嵌入式的寬頻帶結(jié)構(gòu)(可擴(kuò)展的采樣率)
•32kHz的超寬頻帶的采樣率
•強(qiáng)度立體聲編碼選項(xiàng)
•定點(diǎn)的實(shí)現(xiàn)
2.3 預(yù)處理器
這一部分引用了在1.1.x 分支中介紹的預(yù)處理器模塊。預(yù)處理器是設(shè)計(jì)用來在運(yùn)行編碼器之前來處理聲音的。預(yù)處理器提供了三個(gè)主要的功能:
•噪音抑制
•自動(dòng)增益控制(AGC)
•聲音活動(dòng)探測(cè)(VAD)
圖2.1 聲學(xué)回音模型
    降噪器可以用來減少出現(xiàn)在輸入信號(hào)中的背景噪音的數(shù)量。不論這降噪以后的信號(hào)是不是由Speex來進(jìn)行編碼,這過程都提供了更高質(zhì)量的語音。然而,在編碼解碼器使用降噪的信號(hào)的時(shí)候,都會(huì)得到附加的好處。語音編碼解碼器通常(也包括Speex)不能很好地處理嘈雜的輸入,即會(huì)傾向于放大噪音。而降噪器則會(huì)大大的減少這個(gè)影響。   
    自動(dòng)增益控制(AGC)是一種用來處理下面這種情況的特性:由于不同的設(shè)置之間存在大量的差別,所以記錄的音量可能會(huì)有差別。AGC提供了一種將某一信號(hào)調(diào)節(jié)到參考音量的方法。這對(duì)于網(wǎng)絡(luò)語音電話很有用,因?yàn)樗獬巳斯ふ{(diào)節(jié)麥克風(fēng)增益的需求。另外一個(gè)優(yōu)勢(shì)怎是通過將麥克風(fēng)的增益設(shè)置到一個(gè)保守(低的)水平,從而更容易的避免剪音。
    由預(yù)處理器所提供的聲音活動(dòng)探測(cè)(VAD)比由編碼解碼器所直接提供的要更加先進(jìn)。
2.4 自適應(yīng)抖動(dòng)緩沖器
當(dāng)傳送中的聲音(或者就此而言的任何內(nèi)容)超過了UDP或者RTP,數(shù)據(jù)包可能會(huì)丟失,或者經(jīng)過不同的延遲而到達(dá),或者甚至發(fā)生故障。抖動(dòng)緩沖器的目的是給數(shù)據(jù)包重新排序并且使它們緩沖足夠長的時(shí)間(但是不會(huì)超過所必須得時(shí)間)從而使它們能夠傳送從而解碼。
2.5 回聲消除器
在任何的免手持的通訊系統(tǒng)中(圖2.1),來自遠(yuǎn)端的語音都是在本地的揚(yáng)聲器上播放,然后傳送到房間里并且由麥克風(fēng)所捕捉到。如果由麥克風(fēng)所捕捉的音頻直接發(fā)送到遠(yuǎn)端的話,那么遠(yuǎn)端的用戶會(huì)聽到他自己聲音的回音。因此回聲消除器的作用就是在回聲被發(fā)送到遠(yuǎn)端之前將其消除。回聲消除器的目的是改善遠(yuǎn)端的通話質(zhì)量,理解這一點(diǎn)是很重要的。
2.6 重采樣器
在一些情況下,將音頻從一種采樣率轉(zhuǎn)換成另一種采樣率可能是一種很有用的方法。這么做是有很多種原因的。這樣可以混合不同采樣率的數(shù)據(jù)流,從而來支持聲卡所不能支持的采樣率,或者來進(jìn)行轉(zhuǎn)碼等等。這就是為什么現(xiàn)在重采樣器會(huì)成為Speex項(xiàng)目的一部分的原因。這個(gè)重采樣器可以用來在任何兩種任意的頻率(比率只能是一個(gè)有理數(shù))之間相互轉(zhuǎn)換,并且可以在質(zhì)量/復(fù)雜性之間達(dá)到平衡。
2.2 Codec
The main characteristics of Speex can be summarized as follows:
• Free software/open-source, patent and royalty-free
• Integration of narrowband and wideband using an embedded bit-stream
• Wide range of bit-rates available (from 2.15 kbps to 44 kbps)
• Dynamic bit-rate switching (AMR) and Variable Bit-Rate (VBR) operation
• Voice Activity Detection (VAD, integrated with VBR) and discontinuous transmission (DTX)
• Variable complexity
• Embedded wideband structure (scalable sampling rate)
• Ultra-wideband sampling rate at 32 kHz
• Intensity stereo encoding option
• Fixed-point implementation
2.3 Preprocessor
This part refers to the preprocessor module introduced in the 1.1.x branch. The preprocessor is designed to be used on the
audio before running the encoder. The preprocessor provides three main functionalities:
• noise suppression
• automatic gain control (AGC)
• voice activity detection (VAD)
8
2 Codec description
Figure 2.1: Acoustic echo model
The denoiser can be used to reduce the amount of background noise present in the input signal. This provides higher quality
speech whether or not the denoised signal is encoded with Speex (or at all). However, when using the denoised signal with the
codec, there is an additional benefit. Speech codecs in general (Speex included) tend to perform poorly on noisy input, which
tends to amplify the noise. The denoiser greatly reduces this effect.
Automatic gain control (AGC) is a feature that deals with the fact that the recording volume may vary by a large amount
between different setups. The AGC provides a way to adjust a signal to a reference volume. This is useful for voice over
IP because it removes the need for manual adjustment of the microphone gain. A secondary advantage is that by setting the
microphone gain to a conservative (low) level, it is easier to avoid clipping.
The voice activity detector (VAD) provided by the preprocessor is more advanced than the one directly provided in the
codec.
2.4 Adaptive Jitter Buffer
When transmitting voice (or any content for that matter) over UDP or RTP, packet may be lost, arrive with different delay,
or even out of order. The purpose of a jitter buffer is to reorder packets and buffer them long enough (but no longer than
necessary) so they can be sent to be decoded.
2.5 Acoustic Echo Canceller
In any hands-free communication system (Fig. 2.1), speech from the remote end is played in the local loudspeaker, propagates
in the room and is captured by the microphone. If the audio captured from the microphone is sent directly to the remote end,
then the remove user hears an echo of his voice. An acoustic echo canceller is designed to remove the acoustic echo before it
is sent to the remote end. It is important to understand that the echo canceller is meant to improve the quality on the remote
end.
2.6 Resampler
In some cases, it may be useful to convert audio from one sampling rate to another. There are many reasons for that. It can
be for mixing streams that have different sampling rates, for supporting sampling rates that the soundcard doesn’t support, for
transcoding, etc. That’s why there is now a resampler that is part of the Speex project. This resampler can be used to convert
between any two arbitrary rates (the ratio must only be a rational number) and there is control over the quality/complexity
tradeoff.

武漢翻譯公司

2013.2.5

  返回>>Top
-x