Professional Documents
Culture Documents
Voice Quality Solutions For Wireless Networks
Voice Quality Solutions For Wireless Networks
Voice Quality Solutions For Wireless Networks
| March 2012 |
Contents
Background . ......................................................................................... Voice Q uality - The P roblem ....................................................................... Wireless Networks .................................................................................... Increasing Network Capacity with AMR-HR ................................................. Issues Impacting Voice Quality in Wireless Networks .................................... Voice Q uality Metrics ................................................................................ D itech Networks Overview .......................................................................... Voice Q uality Assurance . ........................................................................... Experience Intelligence ............................................................................... Conclusion . ......................................................................................... 3 4 6 8 11 12 15 16 19 19
2/20
Background
In the increasingly competitive wireless market, wireless operators must explore new ways to increase profits while reducing costs. Alongside capital and operating expense concerns, there are issues of customer retention and pressures to improve network capacity and service quality as ways to increase revenue from existing users, as well as attract new ones. Investing in better voice quality for wireless calls is a cost-effective way of improving customer satisfaction, increasing network capacity with minimal capital investment, and gaining competitive advantage by presenting subscribers with a powerful service differentiator.
3/20
Impairment
Background Noise Far-end Noise Acoustic Echo
Cause
Noise from equipment (trains, airplanes) or people in the background Background noise in the called party location Sound reflections from the caller location are heard by called party after slight delay. Networks or handsets are not engineered to proper levels; Callers speaking unusually loudly or softly.
Effect
Caller has difficulty hearing the called party Caller hears the background noise from the called party, making conversation difficult Called party hears the caller twice, with a slight delay; often interpreted as noise; can be very annoying. Caller or called party sound too soft or loud.
Level Mismatch
All of the above occur on both ends of the call, doubling the number of potential problems to eight.
Voice problems are complex, and come from a variety of sources. Some of these issues are within the service providers control, and some are not.
Impairment
Near-end noise Far-end noise Acoustic echo Level mismatch
Called Party
Protected No protection No protection No protection
4/20
Impairment
Near-end noise Far-end noise Acoustic echo Level mismatch
Called Party
Protected Protected Protected Protected
Network-based voice enhancements are a preferred solution for most impairments. Cost effective networkbased echo cancellers are in use in both wireline and wireless networks. For advanced speech processing, network-based solutions provide four important benefits:
1. Advanced processing:
Network-based solutions are not constrained by handset battery life or device size; therefore, can implement advanced algorithms for voice processing.
2. Rapid, lower cost deployment:
Network-based solutions can be deployed rapidly, without the need to provide costly incentives to a large subscriber population to upgrade their handsets.
3. Easy upgrades for lower total cost of ownership:
Network-based solutions provide modularity to incorporate advancements in speech processing by simply downloading new software into the network-based devices.
4. Complete call path coverage regardless of handset types or destinations:
Network-based solutions ensure that all calls are covered for voice quality enhancement. Ditech Networks Voice Quality Assurance (VQA) solutions improve the quality of communications networks for wireless service providers. Our full line of voice quality enhancement and echo cancellation solutions leverages our technical expertise with industry-leading service and support to enable operators to improve voice quality, reduce costs, and realize greater profits through increased customer satisfaction and improved spectrum utilization. VQA provides wireless carriers the capability to optimize and measure voice quality in real-time on all networks, including CDMA, GSM, and 3G/4G.
5/20
Wireless Networks
CDMA Networks
Code Division Multiple Access (CDMA) has opened the door to new a generation of wireless communications products and services. Using digital encoding spread spectrum Radio Frequency (RF) techniques, CDMA breaks up speech into small, digitized segments and then encodes them to identify individual calls. CDMA allows wireless service providers to pack more digital signals into a particular slice of the radio network, enabling a large number of users to share the same band of spectrum, which greatly increases system capacity. At the same time, CDMA networks face a number of impairments that require voice quality improvement. For example, connectivity to the Public Switched Telephone Network (PSTN) and the use of inexpensive handsets makes hybrid and acoustic echoes prevalent, causing degraded service quality. In addition, CDMA networks are subject to the ambient noise present in wireless environments. Amplification from a users handset microphone can intensify background noise, exacerbating the noise problem on a call and further degrading the voice quality for the talker and the listener. Another voice quality problem faced by CDMA and other wireless networks is the transfer that occurs when signals travel between different networks, making voice levels too high or too low for optimal listening comfort. Ditechs VQA technology solves CDMA networks unique challenges and improves voice quality by cancelling echo, reducing high background noise, and adjusting level controls.
GSM Networks
Global System for Mobile communication (GSM) is a major digital mobile telephone system covering nearly every nation in the world. As GSM networks evolve, many carriers are faced with the challenge of providing additional network capacity to accommodate subscriber growth and demands for new services. In addition, capacity issues worsen during peak hours, especially in densely populated metropolitan centers. One method of cost-effectively increasing capacity on GSM networks is the utilization of Half-Rate (HR). HR vocoders provide an economical channel capacity solution but they have not been widely deployed because they are associated with degraded voice quality. HR creates additional voice channels but is more susceptible to noise impairments, degrading the quality of voice services. Signal strength, ambient noise, and the effects of tandem speech transcoding also reduce the sound quality of voice calls.
6/20
3G/4G Networks
The evolution of 3G/4G technology introduced a new era in wireless communications, signaling a shift in focus from voice-only communications services to the delivery of a wide range of multimedia, content-rich information applications. 3G/4G wireless capabilities increased efficiency and improved the performance of wireless networks by enabling service subscribers to make a phone call, shop on the Internet, or watch a video clip, all at the same time, from one portable multimedia device. Similar to any wireless network, 3G/4G network users encounter echo, noise, and voice intelligibility impairments that affect the perceived quality of the call. 3G/4G networks are plagued with acoustic or non-linear echo paths. Because the echo return paths are non-stationary, it is more difficult to locate and fix the problem. The high levels of background noise common in all wireless calls are also a problem in 3G/4G networks, as they significantly reduce the sound quality of calls for the end-users.
7/20
8/20
Figure 1 : : AMR employs 8 speech codec rates (light blue) in AMR-FR mode and 6 speech codec rates in AMR-HR mode. Remaining bandwidth is used by error-correcting channel coding (dark blue)
AMR-FR uses a full rate (22.8 kbps) channel and offers the best possible AMR voice quality when using the 12.2 kbps rate in good link conditions. AMR-FR is also the most robust channel mode since it has the highest amount of channel coding available when the lowest codec rates, such as the 4.75 kbps rate, are employed. AMR-HR uses a half rate (11.4 kbps) channel, which theoretically allows carriers to support two calls for every AMR-FR channel, thus doubling the available capacity on the GERAN. Use of AMR-HR can be particularly useful to address capacity demand peaks, allowing the operator to realize additional traffic channels and avoid congestion. AMR adjusts speech codec rate (light blue) and error-correcting channel coding (dark blue) according to radio conditions for optimized voice quality. In good C/I conditions, where little error-correcting coding is needed, AMR selects a higher speech codec rate for optimal speech quality (up to 12.2 kbps). In poor C/I conditions, more error-correcting coding is needed and AMR therefore selects a lower speech rate (down to 4.75 kbps) to free up bandwidth for the error-correcting coding. Poor C/I conditions often occur when the network is heavily loaded (a lot of interference from simultaneous users) or when the signal strength is low due to limited coverage, e.g., indoors or in rural areas The AMR-HR mode saves BTS hardware and improves equipment utilization by allowing more traffic channels per TRX. However, the AMR-HR mode can only be used in relatively good C/I conditions since only half the bandwidth is available for speech and error-correcting coding.
9/20
7.4kbps
FER/DCR
With Aggressive C/I thresholds in place, low bit rate codecs are only used under low C/I conditions; this allows more bandwidth to be allocated to speech encoding in higher C/I conditions, for improved voice quality performance. However, this comes at a price of increased FER/DCR. Optimum speech quality MOS can be impacted by changes to the configured C/I thresholds, as expressed by the resultant FER. The optimum MOS is obtained for a set of C/I thresholds where the best performing codec is always used. The Operational network DCR and Optimum MOS, determines the desired operating region. If a lower DCR is desired, this will come at the cost of sub-optimal MOS. When deploying an increased use of HR, maintaining the optimum MOS and Operational Network DCR is the ultimate goal. Carriers can increase AMR-HR usage and network capacity while maintaining network quality, FER, and DCR within desired levels by changing link adaptation thresholds to use more channel encoding in both AMR-HR and AMR-FR modes and by deploying VQA to compensate for resulting degradation arising from more aggressive use of low bit rate codecs.
12.2kbps
4.75kbps
5.9kbps
10/20
This intelligent design allows carriers to increase capacity without further investment in radio resources.
11/20
The impairments that cause these and other problems include acoustic echo, low signal-to-noise ratios, a lack of comfort noise, signal levels that are not normalized for consistent volume, use of low bitrate codecs or flawed transcoding between codec types, and poor intelligibility resulting from inevitable packet loss. Individually, these impairments erode customer satisfaction. Collectively, they cause customer churn and make it difficult to attract new, loyal customers.
12/20
Objective speech quality is measured by analyzing the characteristics of the speech signal or media stream and applying a perceptual model to estimate a subjective quality score. Subjective testing is expensive and cannot be used for continuously monitoring service quality. Objective testing approaches, such as PESQ (P.862), are applicable for active testing or lab testing of equipment; however, for in-service monitoring of carrier networks, it is essential to use an efficient non-intrusive approach such as the ITU-T G.107 E-Model.
13/20
The Absolute Category Rating (ACR) method is a subjective listening method where users rate an experience from 1 to 5 (1 is Bad, 2 is Poor, 3 is Fair, 4 is Good, and 5 is Excellent). The average of a large sample of such ratings is called the Mean Opinion Score (MOS), and is, perhaps, the best-known audio quality score. When tests are carried out, it is important to qualify if it was a listening (MOS-LQ) or conversational (MOS-CQ) test. The E-Model, developed within the ETSI and ITU, derives a value called the R-Factor, which includes variables addressing conversational and transmission quality. There are several mappings between R-Factors and MOS scores these mappings are approximations.
User Opinion
Maximum Obtainable For G.711 Very Satisfied Satisfied Some Users Satisfied Many Users Dissatisfied Nearly All Users Dissatisfied Not Recommended
R-Factor
93 90 100 80 90 70 80 60 70 50 60 0 50
Generally, an R-Factor of 80 or above represents a good objective. Since R-Factors are conversational metrics, the statement that R-Factors should be 80 or more implies both a good listening quality and low delay. A MOS of 4.0 is typically regarded as toll quality, and a good target level is typically 3.7 or above. MOS values are typically quoted as listening quality scores; i.e.: MOS-LQ. The typical manufacturer-quoted MOS values for G.711 and G.729A are 4.1 and 3.9, respectively, implying that G.729A could not meet the ITU scaled MOS for Satisfied. This problem is due to the scaling of MOS and not the CODECs. Typical ACR scores for CODECs should be compared to an ACR scaled range; i.e., Satisfied would be range 3.7 to 4.1.
14/20
15/20
ANC reduces the background noise coming from the wireless talkers side and makes his/her conversation more intelligible resulting in the subscriber feeling satisfied with the network quality.
Automatic Level Control (ALC) ALC technology dynamically detects level imbalances and automatically adds or attenuates by up to 15dB, as needed to bring both sides of the call to a specified volume level.
Benefits to Wireless Network Operators/ Subscribers
When wireless calls are connected, the volume levels are often unbalanced with one side of the call too high or low for comfortable listening. Subscribers will communicate more comfortably.
Adaptive Listener Enhancement (ALE) ALE automatically raises the voice volume in one direction in response to noise measured in the other direction. This feature is useful when calls are placed in noisy environments.
Benefits to Wireless Network Operators/ Subscribers
ALE automatically increases the intelligibility of the phone call. The wireless talker is not required to turn up the volume on his handset because of high background noise.
16/20
Acoustic Echo Control (AEC) AEC adaptively converges on non-linear echoes created by handsets and hands-free phones, making calls sound clear and natural. AEC attenuates echo up to 640ms and 0dB WAEPL
Benefits to Wireless Network Operators/ Subscribers
Acoustic echo becomes more problematic with VoIP induced packet delay, making the echo more noticeable. By effectively eliminating the acoustic echo from the wireless subscriber side, the user will experience a significantly improved voice quality experience.
Enhanced Voice Intelligibility (EVI) EVI rebalances the spectral characteristics of speech components to enable the listener to more easily distinguish and understand voice in loud environments.
Benefits to Wireless Network Operators/ Subscribers
EVI automatically increases the intelligibility of the wireless call, without distorting or amplifying the signal.
Low-Bitrate Codec Voice Improvement Ditechs voice quality solutions improve the quality of service delivered using low-bitrate vocoders.
Benefits to Wireless Network Operators/ Subscribers
This allows wireless carriers to increase network capacity with minimal capital investment and without compromising voice quality.
Intelligent Packet Restoration (IPR) IPR uses a predictive speech model to reconstruct missing packets in a VoIP packet stream. It provides better subjective and objective performance than the traditional packet-loss compensation methods.
Benefits to Wireless Network Operators/ Subscribers
IPR restores high-quality voice in congested IP networks that have substantial packet loss.
Compressed Speech Processing In legacy networks as well as in some of the current networks, speech path within the core network has always been in uncompressed linear domain (G.711). Because of its uncompressed linear nature, it is relatively easy to process speech signal. In Wireless Networks where speech is transmitted in a compressed format like AMR, any voice quality treatment, such as echo suppression or noise filtration, becomes a challenge when speech remains encoded in a compressed format. Most vendors first convert the compressed speech in a linear format (PCM) for voice quality treatment and then encode it again in compressed format (AMR). This process is easy to implement but the voice quality is severely affected and delay is introduced.
17/20
Ditech Networks maintains leadership in media processing technology by enabling voice quality enhancement within compressed speech format without conversion to a linear format, therefore avoiding tandem encoding and associated delay/degradation.
Leverages existing, proven voice improvement algorithms Equivalent performance to PCM-based solutions No transcoding artifacts Minimal processing delay
18/20
Experience Intelligence
In addition to the voice quality improvement, Ditechs Experience Intelligence (EXi) solution provides operators unprecedented insight into the voice quality that subscribers are experiencing. By continuously and non-intrusively monitoring all calls in both directions and reporting a broad set of voice quality statistics, voice quality can be closely monitored. Noise and Speech Levels EXi measures per-call speech level, noise level, and signal-to-noise ratio (SNR). Echo and Delay EXi measures acoustic echo and provides statistics about echo delay and return loss. Transmission Rating Factors and Mean Opinion Scores Per-call listening and conversational R-Factor and MOS values are computed continuously and nonintrusively based on the sum of measured voice quality impairments (speech, noise, and echo) as well as codec type, frame loss, and jitter. EXis objective scores follow the ITU-T G.107 E-Model standard and use a unique DSP-based approach to analyze live voice signal. Packet Measurements EXi non-intrusively collects packet statistics on an ongoing, per-call basis, and quickly identifies the source of the IP issues that are affecting voice quality
Conclusion
By thorough analysis and characterization of live networks, Ditech has developed a high-performance, easyto-deploy, and cost-effective voice quality solutions for wireless providers. VQA dynamically adapts to noise, echo, and delay characteristics on both ends of every call to ensure a consistent and clear communications experience for both the calling and called party, for all call types (wireless VoIP, wireline VoIP/PSTN, etc.). Ditechs VQA technology brings the wireless marketplace one step closer to providing a toll quality service. As Wireless Service Providers continue to introduce new services, they must also focus on service quality assurance to differentiate themselves from the competition. The strategy of using price to attract new subscribers is only part of the solution, as markets mature and customers have higher expectations of service quality. This is why service quality should matter to Wireless Service Providers. Ditechs VQA will enable the Wireless Service Providers to be more competitive in the wireless market by helping to retain existing subscribers through user satisfaction and service usage. Consumer satisfaction will result in revenue increase for network operators.
19/20
About Ditech Networks Ditech Networks (NASDAQ: DITC) is revolutionizing modern communications with advanced voice processing solutions and voice applications that offer network and speech quality enhancement, voice-to-text services, and more. Ditech believes in the power and simplicity of human speech. Leveraging over 20 years of deployments with communications providers around the world, Ditech offers unique, effective products and services that put the subscribers experience first. Ditechs customers include Verizon, AT&T, Sprint/Nextel, Orascom Telecom, China Unicom, Global Crossing, and West Corporation.
Ditech Networks 3099 North First Street San Jose, CA 95134 USA
800 234 0884 toll free 800 770 0117 support 408 883 3611 direct 408 894 9448 fax
sales@ditechnetworks.com www.ditechnetworks.com
Copyright 2012 Ditech Networks. All rights reserved. PhoneTag, Read Your Voicemail!, Packet Voice Processor, VQA, and Experience Intelligence are trademarks of Ditech Networks. All other brands are the property of their respective owners. Specifications may change without notice. This document was last revised March, 2012.
20/20