Download as pdf or txt
Download as pdf or txt
You are on page 1of 52

StreamingCodecStudyReport

ForNPRDigitalMedia

JohnKean
EllynG.Sheffield,PhD

NPRLabs
September20,2013

2013NationalPublicRadio,Inc.

CONTENTS
IntroductionandExecutiveSummary............................................................................................................4
CodecChoiceandOptimalBitRates.........................................................................................................................4
LoudnessManagementWithinandBetweenStreams.............................................................................................6
EncodingComputer...................................................................................................................................................8
AudioQualityTests.........................................................................................................................................9
SelectionofCodecsbyWellInformedListeners.......................................................................................................9
AudioCodecsforTest............................................................................................................................................9
Participantsandmethodology.............................................................................................................................11
SoundSampleMaterialandPreparation.............................................................................................................12
PlaybackConditions.............................................................................................................................................12
Results.................................................................................................................................................................12
AudioQualityofCodecsConsumerTest1............................................................................................................15
AudioCodecsUnderTest....................................................................................................................................15
ParticipantsandMethodology............................................................................................................................16
Results.................................................................................................................................................................16
LoudnessConsumerTest2....................................................................................................................................21
Methodology.......................................................................................................................................................21
SamplePreparation.............................................................................................................................................21
WithinStreamResults.........................................................................................................................................22
BetweenStreamResults.....................................................................................................................................24
TechnicalRecommendations........................................................................................................................29
CodecSelectionandBitRate...................................................................................................................................29
CodecCompatibilityandLicensing..........................................................................................................................30
StreamingRequirements.........................................................................................................................................32
WirelessNetworks...............................................................................................................................................32
Codecandbitrates..................................................................................................................................................35

LoudnessConsistency...................................................................................................................................36
CurrentExperience..................................................................................................................................................36
LoudnessControlandManagement.......................................................................................................................38
ProgramLevelMeasurement..............................................................................................................................38
LoudnessValuesShouldbetheTarget,NotPeakLevels.....................................................................................39
LoudnessMonitoringandNormalizationofStreams..........................................................................................41
BatchNormalizationofAudioLoudnessforFilesandPodcasts..........................................................................45
LoudnessLevelsRelativeToOtherInternetStreams..........................................................................................45
StreamServer...............................................................................................................................................47
PresentAudioCard..................................................................................................................................................47
PatchmixDSPConfiguration....................................................................................................................................49
TritonDigitalAdInsertionSoftware........................................................................................................................50
AudioCardReplacements.......................................................................................................................................51
CautionsfortheSoftwareEncoder.........................................................................................................................51
Acknowledgements......................................................................................................................................52

STREAMINGCODECSTUDYREPORTFORNPRDIGITALMEDIA
INTRODUCTIONANDEXECUTIVESUMMARY
ThisreportpresentstheresultsofresearchandtestinginsupportofanewInternetaudiostreamingsystemtobe
launchedbyNPRDigitalMedia,withafocusondeliveryovermobilewirelessnetworkstosmartphoneandtablet
devicesusingacustomplayerapplication.Theobjectiveofthisstudywastoidentifythebestdigitalaudiocodec
andstreamingrate.Additionally,weexploredtheeffectsofaudiolevelshiftsonconsumerspreferencesand
behavioranddevelopedasolutiontoloudnessmanagementforplaybackinavarietyoflisteningconditions.
Thissectionprovidesanexecutivesummaryofthekeyfindingsandtechnicalrecommendations.Theappendices
discussimplementation,aswellasexplanationsoftheresearchandmethodology.

CODECCHOICEANDOPTIMALBITRATES
Digitalaudiocodecsaretheheartofapracticalstreamingsystem.TheymakeInternetstreamingcommercially
feasiblebyvastlyreducingthebitraterequiredoftheaudiodatawithoutnoticeablyreducingtheaudioquality.
Withoutthem,streamingwouldrequiretoomuchdatabandwidthtobeaffordable.
Codecsvaryintheirabilitytomaintainsoundqualityatlowerbitrates.Theolderones,suchasthefamiliarMP3
format,arethepoorestathidingaudibleartifactsthatresultasthebitratesarereduced.Throughcontinual
researchanddevelopment,digitalcompressiontechniqueshavecontinuedtoevolve,leadingtonewcodecsthat
providehighsoundfidelityateverlowerbitrates.
Besidessavingontransmissionexpensesrelatedtothedatabandwidth,minimizingthebitratehasdirectbenefits
forthelisteners,becauselowerstreambitrates:

Startfasterwhenastreamisselected,muchlikearadioplaysassoonastunedintoastation;
Experiencemarkedlyhigherreliability(fewerdropouts)inmobilewirelessnetworks,and
Restartaudiofasterafteradropoutoccurs
Savepotentialdatachargesforthelistener.

Thisstudyincludedrealtimefieldmeasurementsofwirelessmobilenetworks(seeStreaming
Requirements/WirelessNetworksintheAppendix),whichexplainsthatasthebitraterises,ariseinthefrequency
anddurationofdatainterruptions(dropouts)occur.Infact,datashowtheamountofinterruptionrisessteadily
atratesaboveapproximately32kbps,whereas,atlesserrates,outagestendtobecausedbyweakcellularsignals.
Notsurprisingly,testingintheWashington,DCareaalsoshowedthatstreamreliabilitywasworstduringmorning
andeveningdrivetimethepeaklisteningtimesformostpublicradioprogramming.
Therearenumerouscodecsavailableforusebyportabledevicesandcomputers.Wetestedsixoftheleading
codecswithexpertlistenersoverawiderangeofbitrates(seetheappendixforSelectionofCodecsbyWell
InformedListenersfordetails).

5
MeanOpinionScore

AfteridentifyingtheHighEfficiencyAAC(HEAAC)
codecasthetopperformingcodec,consumerswere
testedtosubstantiatetheexpertlisteners
recommendations(seetheappendixAudioQuality
ofCodecsConsumerTestfordetails).Figure1
showshowconsumersratedtheHEAACcodec,
showninred,onaverticalscaleofMeanOpinion
Scorerangingfrom1(worst)to5(best)forthe
musicgenre.Noticethatthereisrelativelylittle
differencebetweentheoverall(encoded/decoded)
qualityandtheoriginalreferenceaudio,shownto
thefarrightinlightblue.Overarangeofbitrates
suitableformobilestreaming,ourtestdataand
industryresearchshowgoodagreementbetween
thelistenerscoreswithmusicandspeechprograms.

4
MP3

HEACC
xHEAAC

AACLC
Reference

1
16 24 32 48 64 96 128
TestedRates(kbps)

Figure1 ConsumersMeanOpinionScoresforHEAACandxHEAAC
codecswithmusicatbitratesfrom16to128kbps

Recommendation1:Ofthecodecscurrentlyavailable,thestandout,overall,isHighEfficiencyAdvancedAudio
Coding(HEAAC),alsoknownasaacPlusandAAC+,forstreaming.Thiscodec,hereinreferredtoas
HEAAC,ispreferredforthefollowingreasons:

ProvidesthesoundqualityofMP3codingatlessthanhalfthebitrate;
CommonlyavailableinAppleiOS(allversions)andrecentAndroidoperatingsystems;
Licensingfeespaidbyendproductmanufacturers,andnofeetoprogramoriginators;
Supportformetadatatransportforloudnessnormalizationandancillarydatafunctions.

Asdiscussedfullyintheappendix,theoptimalbitratemustconsidervariousaspectsofthelistenersexperience,
whichmustbalancesoundqualityathigherbitratesandstreamingreliabilityatlowerbitrates.Inourview,this
balanceisbestmetwiththefollowingconsiderations:

ForbothmusicandspeechstreamedwiththeHEAACcodec,theoptimalcodecandbitrateis48kbps;
Fornonrealtimeplayback,suchaspodcasts,theoptimalbitrateformusicandspeechwiththe
HEAACcodecis64kbps;1
Higherbitratesareunnecessaryaslistenersheardminimal,ifany,improvement.
Updatetheencoderasnewversionsarereleased(updatesusuallyontheencodingside,wherethe
heavierdataprocessingoccurs).Streamingsoftwareshouldalsobedesignedtosupportthefuture
replacementwithasuperiorcodec.

Thereasonthatahigherbitratemaybeusedforpodcastsisthatthesearefiletransferstotheplayer,which
bridgeinterruptionsintheflowofdataovertheInternet.Filetransfersmaybeburstyandinterrupted,butthese
effectsareunnoticeableduringfileplayback.Livestreamsuserelativelysmalldatabufferstopermitfaster
startupandrecoveryfromdropouts,buttheyrequireamoreconstantflowofdata.
5

TheHEAACcodecisuniversallyavailableinportabledevicesoperatingiOSandmostdevicesrunningtheAndroid
OS(sinceversion3.1).ThereareearlierAndroiddevicesthatmaynothaveHEAAConboard,however,asthese
devicesaretypicallymorethantwoyearsoldandthismarketsharecontinuestodeclineasconsumersreplace
theirdeviceswithnewermodelsthatsupportHEAACaudio.

LOUDNESSMANAGEMENTWITHINANDBETWEENSTREAMS
Uniformityofperceptualloudnessisanimportantconsiderationforthelisteningexperience.Asdiscussedindetail
inLoudnessConsistency/CurrentExperience,theuniformityofpublicradiostreamsisquitepoorintermsof
loudness,audiobandwidthandlevelsofdigitalcompressionartifacts.Thechartbelowshowsaseriesofaudio
samplesfrom49publicradiostations,allcarryingthesamenewsprogramfromNPR.Theverticalpositionofthe
dotmeasuresthesubjectiveloudnessonadecibel(dB)scaleadesirablerangeis3dB.Inthistest,however,
streamsarevaryingovermorethana15dBrangeinaudiotransmissionlevel.

Loudness(dBr)

25
20
15
10
5
0
0

10

15

20

25

30

35

40

45

Figure2Audioloudness(relativedBscale)of49publicradiostationstreamscarryingWeekendEditionSaturday

Thisprojectincludedasystematicstudywithconsumerstodeterminetheirreactiontochangesinloudness,as
reportedinLoudnessConsumerTest2.Forcasessuchasstreamtostreamswitching,orhearingone
announcementatonelevelfollowedbyanotherannouncementatadifferentlevel,wemeasuredtheamountof
changeinaudiolevelthatisannoyingtolisteners.Findingsindicatethattheirregularityofaudiolevelsbetween
individualpublicradiostreamstodayoftenexceedthetoleranceoflisteners.Toensureapleasantlistener
experience,thetransmissionlevelofeachstreamshouldbenormalized,basedonitslongtermloudness,as
measuredbytheITUstandardloudnessmeter.
Asidefromstreamtostreamvariation,theeffectofloudnessvariationwithinastreamisimportantforlistener
satisfaction,dependingonexternalenvironmentalfactors.Forexample,listenersonanoisysubwayneedtighter
controlofspeechormusicdynamicrangetohearprogrammingclearly,butinbetterlisteningconditions,the
naturaldynamicsofhumanspeechoramusicalperformancecanaddtotheexperienceandenjoymentof
listeners.2

Asanexperiment,westreamedanews/informationandaclassicalmusicstationforseveralweekseachwitha
directconnectiontotheconsoleoutput,thatis,noaudioprocessing.Listenerreportswereuniversallyfavorable.
6

However,loudnessvariationshouldnotbesolvedbymorecompressionofdynamicrange.Weevaluatedthe
dynamicrangeofmanypublicradiostreamsandfoundthatmanyuseaggressivecompressionandlimiting,even
withfineartsprogramming.Thismaybesuitableforbroadcasttransmissionswhicharesubjecttosignalnoise
andinterference,andwhichmustconformtotightmodulationlimits.However,audioprocessingisnotrequired
toprotectInternetaudiostreams:itisadigitalaudiotransmissionsystemcapableofmorethan90dBofdynamic
range,whichdoesnotrequiredynamicrangecompressionperse.Reductioninthedynamicrangeisusefulfor
noisierlisteningenvironments,orsimplytocomplywiththelistenerstaste.However,itispreferabletoputthis
choiceunderthecontrolofthelistener,ratherthanarestrictionthatisuniversallyappliedtoalllisteners.
Werecommendabreakfromthetraditionofdynamic
compressionatthetransmissionend,whichforcesa
onesizefitsalltreatmenttoaudio.Figure3illustrates
howactiveloudnessmanagementcanbeappliedatthe
player.Thethumbnailpictureontheleftshowssamples
from11publicradiostreams(20to30secondseach)as
theywerereceivedonaniPhone4S.Considerable
variationinlevelisevidentintheheightsofthestreams

asmuchas15dBbetweenadjacentstreams.Thepicture
totherightshowsthesamestreamsafteractivating
Figure3Elevenpublicradiostreamsasreceived(left)and
afterprocessinginaniPhoneloudnesscontrolapp(right)
activeloudnessmanagementsoftwarethatwasaddedto
theiPhone.Thesoftwarewaseffectiveatraisingthelow
levelstreamsandbringingdowntheleveloftheloudstreams.Theaudibleeffectwasamorepleasantevennessin
soundacrossthegroup,whileeachsampleremainedfairlytruetoitsoriginalform.3
Takingacuefromindustryworkonloudnessmanagementforhighdefinitiontelevision,werecommendthe
following:

Adoptloudnessmeteringforproduction,setupandstreammonitoring.Measuretherealtime
loudnessanddynamicrangeofprogramswithITURBS.17702metering(seeAppendixfordetails)to
ensurelevelssoundmoreconsistentacrossstreamsandfromprogramtoprogram.
Peakmeteringshouldbereliedupononlytoensurethatoverloadisnotoccurring,notforevaluating
loudnessconsistencyofproductionandaudiostreams.AsdemonstratedintheAppendix,peak
measurementdoesnotcorrelatewellwithsubjectiveloudness.
Discontinuepeaknormalizationofaudiofilesinproduction.Thiseffortshouldbereplacedbyloudness
normalization,eithermanuallyorautomatically,plusactiveloudnessmanagementintheplayer.
Discontinueaggressivecompressionofaudiodynamicsbeforestreamtransmission.Withloudness
measurementoflevelsinproduction,loudnessmanagementcanbeaccomplishedwithlessdynamic
compression(or,eventually,attheplayer,whichalsohelpsensurethatstreamswillsoundbetterand
moreuniformtothelisteners).

Thiswasaratherextremeexample,aswewouldnotexpecttousethismuchcontrolforstreamsthatare
transmittedneartheproperlevel.
7

Shiftactiveloudnessmanagementtothestreamplayer,usingcustomsoftware.Thiscanresolvethe
problemofaudiocontrol,bothwithintheprogramandfromstreamtostream.SmartphonesandPCsare
capableofcontrollingtheaudioloudnessrange,andsomemusicplayerappsforsmartphonesalready
dothis.Streamplayerloudnessmanagementwouldputthecontrolinthehandsofthelistener,
accordingtheirneedsandtastes.(Thiswouldrequiretransmissionofaudiowithoutaddition
compressionatthestreamencoding,onlyloudnessnormalizationprogrammaterialtoensure
consistency.Theissuewiththisisuseofrandom,unassociatedplayerslackingtheloudness
management:theywouldplaythestreams,justwithouttheloudnessmanagementfeatureavailablein
plannedplayers.Thiswouldnotaffectmanymobiledevices,however,whichuseaplayersuppliedby
publicradiosources.)

Measurementandcontrolofaudiotransmissionlevelsforpublicradiostreamingisarguablyabiggerchallenge
thanselectingthecodecandbitrate.Fortunately,wenowhavethetechnologytomanageloudnesseffectively,
andcomfortably,foreverylistener.

ENCODINGCOMPUTER
AsdiscussedintheAppendix,NPRDigitalMediaissupplyingaservercomputertostationsforstreamencoding,
alongwithanaudiointerfaceandsoftware.ACreativemodelEMU0404audiocardwasselectedforthe
computer,aconsumerproductwithhighquality,butinputsthatarenotcompatiblewithmoststationsanalogor
digitalaudioplants.Tominimizeinstallationandoperationalissues,NPRLabsdevelopedacustomaudiointerface
boxthatensuresfullcompatibilityandsimplesetup.Theinterfacesupportsbothanaloganddigitalaudioandhas
beentestedinalivestreamforseveralweeksattwomemberstations.
TheEMU0404audiocardisdiscontinuedandthereisnodirectreplacementfromCreative.Werecommenda
studytoidentifyasuitablereplacement.ThisshouldconsiderUSBconnectedaudiointerfaces,whichare
becomingcommoninaudiosystems.

Appendices
AUDIOQUALITYTESTS
Thisappendixdetailsthreetestsconductedoverthespringandsummerof2012.Thefirst,SelectionofCodecsby
WellInformedListenerswasconductedatNPRandatthePublicRadioEngineeringConferenceinLasVegas.The
secondandthird:AudioQualityofCodecsConsumerTest1,andLoudnessConsumerTest2,wereconductedat
TowsonUniversity,nearBaltimore,withindividualsfromTowsonandthenearbycommunity.

SELECTIONOFCODECSBYWELLINFORMEDLISTENERS
Thepurposeofthispretestwasto(a)criticallyevaluateseveralcodecsthatarecurrentlybeingusedorpotential
contendersfortransmissionovertheInternet,and(b)identifytwocodecsthatwouldbepresentedtoconsumers
inafinalroundofaudioqualitytesting.

AUDIOCODECSFORTEST
Thecodecsforthistestwerechosentorepresentsomeofthemostpopularavailable,aswellasthenewest(and
arguably,best).Thechoicesarelistedinthetablebelow,alongwithinformationontherelevantstandards,if
applicable,theirhistoryandlicensingrequirements.

Codec
LAME

SourceInformation
lame.sourceforge.net

Description
AfreesoftwarecodeccompatiblewithMP3playback.
Althoughawidelyavailable,itincludestechnologycoveredby
otherpatentholders.LAMEsdevelopersconsiderit
educationalsoftwarethat,assuch,doesnotinfringepatents.
Commercialusemayrequirelicensefees.

MPEG2Layer
III(MP3)

AACLow
Complexity
(AACLC)

ISO/IEC111723(1993)
ISO/IEC138183(1995)

http://www.iis.fraunhofer.de/en/b
f/amm/produkte/audiocodec/audi
ocodecs/mp3.html

Awidelyusedcodecusedforconsumeraudiostorageand
transmissionencodingandplayback.Itisincludedinthe
firmwareofmostdigitalaudioplayers.

ISO/IEC138187(1997)
ISO/IEC144963(2001)

AdvancedAudioCodingwasdesignedasthesuccessortothe
MP3format,achievingbettersoundqualityatsimilarbit
rates.Itwasoriginallydevelopedinthemid1990sand
becameanMPEG2standardin1994.TheLowComplexity
(LC)profileisintendedforconsumerapplications:because
ofApplesuseiniTunes,itisinallAppleoperatingsystemsfor
iPhone,iPodandiPaddevice.Itbecamearegularpartofthe
Androidoperatingsystembeginningwithv3.1
(http://developer.android.com/guide/appendix/media

http://www.iis.fraunhofer.de/en/b
f/amm/produkte/audiocodec/audi
ocodecs/aaclc.html

Withsomeexceptions,thesoftwareissubjecttoroyalty
paymentsforuseinbroadcast,streamingandphysical
storage(seemp3licensing.com).

formats.html).BasedonEBUtesting(EBUTech3324,Sept.
2007),thiscodecprovidesgoodtoexcellentqualityat
128kbps.ThroughitsstandardizationinMPEG4theAAC
familyofcodecoftenappearswiththatidentification.
ThelicensingmodelfortheAACfamilyofcodecsputsafeeon
manufacturersordevelopersofenduserproducts,rather
thanusetobroadcastorstream(www.vialicensing.com).
High
EfficiencyAAC
(HEAAC)

ISO/IEC144963:2009
http://www.iis.fraunhofer.de/en/b
f/amm/produkte/audiocodec/audi
ocodecs/heaac.html

TheadditionofSpectralBandReplication(SBR)toAACLC
resultsintheHighEfficiencyAACprofile(HEAACor
aacPlus).WithHEAAC,thelowerpartoftheaudio
spectrumincodedwithAACLC,whileSBRencodesaguided
recreationtheupperfrequencyspectrum.HEAACoffersan
efficiencyimprovementofupto30%comparedtoAACLC.
Atbitratesof24to32kbps,HEAACautomaticallyadds
ParametricStereoprocessing,whichextractspositional
informationfromthestereosignalalongwithamonodown
mix,toachieveareconstructionofthedecodedsignalwith
thestereoimage.Thisachievesamoreefficienttransmission
thantwochannelHEAACwithsimilaroverallscoresfor
quality.WhenSBRaloneisemployed,itisreferredtoas
HEAACv1;whenParametricStereoisadded,itisreferred
toasHEAACv2oraacPlusv2.TheversionofHEAACin
ourtestingcanbedeterminedbythebitrateemployedfor
eachtest.

G.722.2
(AMRWB+)

ITUTG.722.2

Extended
HEAAC
(xHEAAC)

ISO/IECDIS230033
MPEGDPart3
ISO/IEC230033:2012

http://www.itu.int/rec/TREC
G.722.2/recommendation.asp?lan
g=en&parent=TRECG.722.2
200307I

http://www.iis.fraunhofer.de/cont
ent/dam/iis/en/dokumente/pr/20
08/voiceage_iis_engl.pdf

AdaptiveMultiRateWideband(AMRWB)iscodifiedin
G.722.2asanITUTstandardspeechcodec,orvocoder.A
standardformobilephones,AMRWBisintendedtoprovide
acceptablespeechqualityatverylowbitratesand
multimediasupportatmoderatebitrates.VoiceAge
Corporation,adeveloperandlicensorofdigitalspeechand
audiocompressiontechnologyforcommunications
applications,developedahifispeechandaudiocodec
versionofthecodecforbothspeechandmusic,AMRWB+,
makingitacandidatefortesting.
UnifiedSpeechandAudioCodingbeganascollaboration
betweenFraunhoferIISandVoiceAgeandcombinesthe
technologyofbothcompaniestofurtherperformanceand
efficiencyofthelossycompression,usingtimedomain
vocodingtoolsforspeechsegmentsaswellastransform
codingmethodsforgeneralaudio.Itisabletoblendbetween
thetechniquesdynamicallyinresponsetotheprogramsignal.
Aparametricstereooptionmayfurtherincreasecoding
efficiency.ThefinalstandardwasdevelopedbytheMPEG
audiosubgroupasanMPEG4audioobjecttype.xHEAACis
theMPEG"profile"whichencompassesHEAACv2aswellas
additionaltechnologywiththename"UnifiedSpeechand
AudioCoding"

10

Othercodecswereconsideredfortheexpertlistenertesting.However,therequirementthattheybeavailablein
theAndroidandiOSoperatingsystems,andavarietyconsumerproducts,limitedthefieldofcandidates.This
requirementwasoverlookedforAMRWB+andxHEAACcodecsbecauseoftheirstandardization,whichcanhelp
futureadoptioninconsumerdevices.4xHEAACcombinestheadvantagesofspeechandaudiocodecs,which
makesitpossibletooutperformeithercodingapproachalone.Thiswouldbeanothercodectoconsiderforfuture
implementation.Indeed,theperformanceofxHEAAC,describedelsewhereinthissection,suggestsittobea
futureupgradeforNPR,onceitbecomescommonlyavailableinproductsorifnolicensingfeeisinvolvedto
bundleitinaplayer.

PARTICIPANTSANDMETHODOLOGY
Elevenlisteners(4femalesand7males),allNPRstafforassociatedwithNPRaffiliates,listenedtoaudiosamples
trialbytrialtorankordertheaudiooutputofthecodecs.Listenerswerenottakenfromthegeneralpublicfor
thiseffort.Theywereallwellacquaintedwithcodecsandtheartifactsthatcodecscreate.Withinonetrial(e.g.,a
femalespeechsample)participantslistenedtothesamesampleproducedbysixcodecsatthesamebitrate,side
byside,allowingthemtocomparesmalldifferencesandrankorderthecodecsona0100scale.Figure4depicts
thescreentoenterdataforonetrial.Participantsweredirectedtoslidethenoteonthebarupanddownuntil
theyweresatisfiedthattheyhadrankorderedthesixcodecs.Participantswereencouragedtogiveeachcodeca
uniquenumber,eventhoughtheymayhavefelttwocodecswereequivalent.This,forexample,resultedin
participantsoccasionallygivingonecodeca23andanothercodeca24.

Figure4SnapshotofRankOrderScreen

Foreachnewtrial,listenersheardrandomlyorderedsamples,suchthatdifferentcodecswereplacedondifferent
sliderbars.Selectionsincludedfemaleandmalespeech,orchestra,singleinstrument,multiplevoicesand
voiceover(i.e.,commercials).ThecodecsdescribedaboveweretestedattheMPEGstandardbitratesof16,32,

USACandxHEAACcodectechnologiesweredevelopedjointlybyalargenumberofcompaniesandinstitutes,
includingVoiceAge,Sony,Panasonic,Samsung,Philips,Dolby,NTTDocomo.AlthoughFraunhoferandVoiceAge
mayclaimtohavecontributedthelargestpartofthetechnology,othershavecontributedsignificantly.
11

48,64and96kbps.Whiletestscouldbeconductedatotherbitrates(toprovideamoreuniformdistributionof
rates)somecodecsareoptimizedfortheMPEGrates.Theselectedratesavoidedthepossibilityofuneventest
results.

SOUNDSAMPLEMATERIALANDPREPARATION
Audiomaterialwasselectedfromavarietyofsourcesrepresentingmaleandfemalespeech,speechoveramusic
bed(voiceover),singing,individualinstrumentsandorchestralmusic.Allmaterialwasproducedfrom
uncompressedPCMfileswithasamplerateof44.1kHzin16bitintegerformat.(Someaudiowascollectedas
48kHzrecordings,whichwereconvertedto44.1kHzusingAudacity2.0rateconverterinthehighqualitymode.)
AllofthetestmaterialheardwasplayedbackasaWAVfile.Thisrequiredeachaudiosampletobeencodedand
decoded(transcoded)backtoaWAVfileforeachcodectype.TheLAMEsampleswereencodedanddecodedin
Reaperv4.26usingtheversion3.98codec.TheAMRWB+transcodingwasaccomplishedwiththeVoiceAge
EvaluationExecutable(http://www.voiceage.com/amrwbplus_eval.php),aWindowssoftwaretoolthatproduces
theoutputWAVfileinonestep.TheMP3,AAC(includingHEAAC),andxHEAACfileswerefurnishedtothe
FraunhoferInstitutefortranscoding.
Toensurethatthelistenersscoreswerenotbiasedbydifferencesinloudness,theaudiolevelofthesampleswere
measuredusingtheOrbanLoudnessMetersoftware,v2.0.3(downloadedfromhttp://orban.com/meter/).Before
transcoding,theaudiogainofeachWAVfilewastrimmedinAudacity2.0toapproximately23dBLUFS,according
totheIntegratedindicatorofthesoftwaretoolsITURBS.17702loudnessmeter.(TheEBUR128termLUFS,for
LoudnessUnits,relativetodigitalFullScale,isusedherein,andisequivalenttotheITUtermLKFS.)

PLAYBACKCONDITIONS
TheexpertlistenertestwasconductedwithSennheiserHD
600headphones,connectedtoaTascamUS144MKIIaudio
interface,providingtheaudioconnectionfromthetest
computer.Approximatelyhalfthetestsubjectslistenedin
NPRStudio5A,whichprovidedaveryquietenvironment.
Theremainderofthelistenersvolunteeredduringthe2012
PublicRadioEngineeringConferenceinLasVegas,andtests
wereconductedinaquiethotelroomusingthesame
headphonesandheadphonedriver.Aphotoofthehotel
listeningroomsetupisshowninFigure5.Listeners
operatedtheircodectestthroughEPrime,aPCprogram
toautomatetheplaybackofcodecaudiotestsamplesand
collectthelistenersresponses.

Figure5 Participantbeingsetupforexpertlistenertest

RESULTS
AscanbeseenfromFigure6,whenallgenreswerecombined,listenersfeltthatthreecodecscodecsperformed
betterthanothersat32,48and64kbps:HEAACV2,xHEAAC,andAMRWB+.AlthoughLAME,FraunhoferMP3

12

andAACLCweresatisfactoryat96kbps,theyweresignificantlyloweratallotherbitrates,renderingthem
unacceptableascandidatechoicesfortheconsumertest.
Figure6throughFigure11showresultsbygenre.Noticethatinspeech,AMRWB+fallsoffprecipitouslyat
64kbps,highlightingitsintendeduseforlowerbitrates.Withregardtomusic(i.e.,orchestral,choralandsingle
instruments)xHEAACandHEAACareconsistentlyhigher,withlistenersrankingthemassatisfactorystartingat
32kbps.(ThedataforHEAACexcludes64and96kbpsassomeofthataudiomaterialwaslaterfoundtohave
beeninadvertentlyimpairedinpreparationfortesting.However,testsconductedbyotherssuggestthatthe
resultsforHEAACwouldhavebeensimilartoAACLCat96kbps.)
Interestingly,resultswerequiteflatstartingat32kbps,suggestingthatlistenersdidnothearmarked
improvementsathigherbitrates.Giventheseresults,theaudioqualityconsumertestincludedxHEAACand
HEAACasthecodecsofinterest.
100
90
80

AACLC

70
60

AMRWB+

50

HEAAC

40

LAMEMP3

30

MP3

20

xHEAAC

10
0
16kbps

24kbps

32kbps

48kbps

64kbps

96kbps

Figure6ExpertlistenerscoresforALLGENREScombined

100
90
80

AACLC

70
60

AMRWB+

50

HEAAC

40

LAMEMP3

30

MP3

20

xHEAAC

10
0
16kbps

24kbps

32kbps

48kbps

64kbps

96kbps

Figure7ExpertlistenerscoresforSPEECHgenre

13

100
90
80

AACLC

70
60

AMRWB+

50

HEAAC

40

LAMEMP3

30

MP3

20

xHEAAC

10
0
16kbps

24kbps

32kbps

48kbps

64kbps

96kbps

Figure8ExpertlistenerscoresforORCHESTRALgenre

100
90
80

AACLC

70
60

AMRWB+

50

HEAAC

40

LAMEMP3

30

MP3

20

xHEAAC

10
0
16kbps

24kbps

32kbps

48kbps

64kbps

96kbps

Figure9ExpertlistenerscoresforSINGLEINSTRUMENTgenre

100
90
80

AACLC

70
60

AMRWB+

50

HEAAC

40

LAMEMP3

30

MP3

20

xHEAAC

10
0
16kbps

24kbps

32kbps

48kbps

64kbps

96kbps

Figure10ExpertlistenerscoresforVOICEOVERgenre

14

100
90
80

AACLC

70
60

AMRWB+

50

HEAAC

40

LAMEMP3

30

MP3

20

xHEAAC

10
0
16kbps

24kbps

32kbps

48kbps

64kbps

96kbps

Figure11ExpertlistenerscoresforCHORALMULTIPLEVOICESgenre

AUDIOQUALITYOFCODECSCONSUMERTEST1
Thepurposeofthistestwastodeterminehowconsumerswouldrespondtoaudiotransmittedatdifferentbit
ratesusingourselectedcodecsHEAACandxHEAACandwhethertheywouldcontinuetolistentothe
transmissionasthebitratedecreased.ConsumersheardHEAACandxHEAACsamplescodedat16,24,32,48,
64and96kbps,andheardsamplescodedwithMP3at16,24kbps(lowanchors)and128kbps(frequentlyusedin
streaming),plusAACLCat128kbpsandanunprocessedsample,knownasthereference.Theconsumertest
alsopresentedthematerialunderconditionsclosertothatofaveragelisteners:thelistenersusedAppleearbuds,
ratherthanhighendheadphones,andaPolkiSonictableradio.Also,thelisteningroomwasasmallinterior
officewithamoderatelevelofairhandlingnoise.Thesetestsweredesignedtodeterminethebitrateparameters
foranInternetaudiosystem,lessonethatisindistinguishablefromthesourceunderthemostdiscerningof
conditionsthanonethatiscapableofdeliveringaqualityexperienceforalllisteners.Webelievethetwotests
informthatdetermination.

AUDIOCODECSUNDERTEST
ThehighscoresoftheHEAACandxHEAACcodecswereprimaryconsiderationsforthetest.TheAMRWB+codec
performedwell,too,butitisnotcommontotheoperatingsystemsincurrentsmartphonesandpersonal
computers.ThesamecouldbesaidforthexHEAAC,butwefeelthiscodecisonthebesttrackforfutureadoption
bysmartphonemanufacturers.(ItalsohappenstobebasedonajointdevelopmentofFraunhoferInstituteand
VoiceAgeCorporation,thecompaniesthatwereactiveinthedevelopmentoftheHEAACandAMRWB+codecs,
respectively.)TheMP3codecwasincludedto(1)provideacomparisontoacodecandbitratefrequentlyusedin
Internetstreaming(MP3at128kbps),and(2)byoperatingatalowbitprovidealowreferenceforthetests.The
audiomaterialforallthreecodecswaspreparedinthesamemannerdescribedfortheexpertlistenertestsand
wastranscodedatFraunhoferInstitute.

15

PARTICIPANTSANDMETHODOLOGY
FortyparticipantswererecruitedfromTowsonUniversitystaffandadministrationviatheuniversitysDaily
Digestannouncementforum.Seebelowfordemographicdetails.
AgeGroup
1830
3040
4050
5065

Female
5
5
5
5

Male
5
5
5
5

Participantslistenedtoatotalof250speechandmusicaudiosamples,onebyone.Aftereachsample,participants
wereaskedtoanswer3questions:(a)onascaleof15(bad,poor,fair,good,excellent),ratetheaudioqualityfor
thesample;(b)onayes/nobasis,wereyousatisfiedwiththeaudioquality;and(c)wouldyoukeeplisteningtothe
radioorturnitoff,giventheaudioqualityofthesample.Intwosessions,participantslistenedoverloudspeakers
andearbuds.Sessionswerecounterbalancedsuchthat20peoplewereassignedtolisteningwithspeakersfirst
and20peoplewereassignedtolisteningwithearbudsfirst.Participantswereencouragedtoadjustthevolumeto
acomfortablesettingbeforebeginningthesession,afterwhichthevolumeremainedconstant.

RESULTS
DATANORMALIZATION:
Withineachgenre,uniqueaudioselectionswerepresentedforlisteningatdifferentbitrates.Thismethodology
wasspecificallyconstructedinordertominimizetheeffectoflistenersbecomingoverlyfatiguedbyhearingthe
sameselectiontoomanytimes.However,foreachaudiosample,anonprocessedreferencewasalsoincludedfor
listenerstorate.Inordertoensurethattheaudiocontentforeachselectionwasnotundulyinfluencingpeoples
ratings(MeanOpinionScoreandOn/Offdecision),scoreswerenormalizedtotheircorrespondingreferences.For
example,iftheMOSforaparticularreferencesamplewas4.5,scoresforcodecswerecalculatedinrelationtothe
4.5;ifthenextreferencesamplewas4.2,thosescoreswerecalculatedinrelationto4.2.
ANALYSISOFGENDERANDAGE
A2(Gender:male,female)x4(Agegroup:1829;3039;4049;5065)AnalysisofVariance(ANOVA)was
conductedtoexplorewhethermenandwomendifferentlyansweredwhethertheywouldleavetheradioon,
giventhequalityoftheaudio.Therewasamaineffectofage,F(3,10920)=8.14,p<.01,showingthat1829year
oldlistenersingeneralweremoremotivatedtoleavetheradioon,particularlyathigherbitrates.Therewasno
substantialdifferencebetweenmaleandfemalelisteners.
ANALYSISOFEARBUDANDLOUDSPEAKERLISTENING
Inordertodeterminewhetherconsumersratedaudiodifferentlywhenlisteningoverearbudsandloudspeakers,a
multivariateanalysisofvariance(MANOVA)wasconductedwithtypeoflistening,bitrateandgenreas
independentvariables,andMOS,turnoffscore,andsatisfactionscoreasdependentvariables.Thisanalysis
revealedsignificantdifferencesforgenreandbitrateonallmeasuresbutdidnotrevealdifferencesforloud
speakerorearbudlistening.Thusforthemainanalysis,allresponses,whethercollectedonearbudsor
loudspeakers,werecollapsedintoonemeasure.
16

ANALYSESOFMOS,COMBINEDANDSEPARATEDBYGENRE
Figure12showslistenersmeanopinionscoreswhenallaudioselectionswerecombined.Asterisk(*)indicatethat
thecodecaudioqualityisstatisticallysignificantdifferentfromtheReference.NotethatMP3wassignificantly
worsethanHEAACandxHEAACatthelowestbitrates,andthatat32kbpsandabove,HEAACandxHEAAC
performedextremelywell,withlistenersratingthemasgood.Listenersdidnotdistinguishbetweentheaudio
qualityofHEAACandxHEAACat64kbpsandtheuncodedreferencesample.
5.0
4.5
4.0
3.5
3.0
2.5
2.0
1.5
1.0

16

24

MP3

1.8 *

2.0 *

HEACCv1

3.2 *
3.8 *

3.7 *
4.0

xHEAAC

32

48

64

96

128

WAV

4.2
3.9

4.1

4.0

3.9

3.9

4.1

4.1

AAC

4.2

Reference

4.1
Figure12MOSallaudiocombined

5.0
4.5
4.0

MP3

3.5

HEACC

3.0

xHEAAC

2.5

AACLC

2.0

Reference

1.5
1.0
16

24

32

48

64

96

128

Figure13MOSmusic

17

5.0
4.5
4.0

MP3

3.5

HEACC

3.0

xHEAAC

2.5

AACLC

2.0

Reference

1.5
1.0
16

24

32

48

64

96

128

Figure14MOSspeech

5.0
4.5
4.0

MP3

3.5

HEACC

3.0

xHEAAC

2.5

AACLC

2.0

Reference

1.5
1.0
16

24

32

48

64

96

128

Figure15MOSvoiceover

Figure16showstheOnOffscoreswithcombinedaudioselections.Whenansweringthisquestion,99%of
listenersfeltthattheywouldleavetheradioonat32kbpsandabove.Evenat24kbps,approximately94%of
listenersclaimedtheywouldleaveHEAACon.Figures1719showonoffscoresbygenre.

18

1.0
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0.0

16

24

32

48

64

MP3

0.21

0.38

HEACC

0.72

0.94

0.99

1.00

0.99

xHEAAC

0.98

0.99

0.99

0.99

0.99

96

128
1.00

1.01

AACLC

1.01

Reference

1.00

Figure16Leaveonallaudiocombined

1
0.9
0.8
0.7

MP3

0.6

HEACC

0.5

xHEAAC

0.4

AACLC

0.3

Reference

0.2
0.1
0
16

24

32

48

64

96

128

Figure17Leaveonmusic

19

1
0.9
0.8
0.7

MP3

0.6

HEACC

0.5

xHEAAC

0.4

AACLC

0.3

Reference

0.2
0.1
0
16

24

32

48

64

96

128

Figure18Leaveonspeech

1
0.9
0.8
0.7

MP3

0.6

HEACC

0.5

xHEAAC

0.4

AACLC

0.3

Reference

0.2
0.1
0
16

24

32

48

64

96

128

Figure19Leaveonvoiceover

20

LOUDNESSCONSUMERTEST2
METHODOLOGY
FortyparticipantswererecruitedfortheloudnesstestfromTowsonUniversitystaffandadministrationviathe
universitysDailyDigestannouncementforum.Seebelowfordemographicdetails.
AgeGroup
1830
3040
4050
5065

Female
5
5
5
5

Male
5
5
5
5

Participantslistenedto128speechandmusicaudiosamples,dividedintoatotaloffoursessions.Inthefirsttwo
sessions,participantslistenedtosamplesviaspeakers,andthenrepeatedthetwosessionslisteningviaearbuds.
Speakersandearbudswerecounterbalancedacrossparticipantssothat20peopleusedspeakersfirstand20
peopleusedearbudsfirst.Participantswereencouragedtoadjustthevolumetoacomfortablesettingbefore
beginning,afterwhichthevolumeremainedconstant.
Inthefirstsegmentofsession1,participantslistenedtoasingleaudiosampleapproximately1525secondsin
length.Participantsweretoldthatsomesampleswouldcontainaloudnesschangeandsomesampleswouldnot.
Theywereinstructedtopressthespacebarassoonastheydetectedachangeinloudness,ifatall.
Inthesecondsegmentofsession1,participantslistenedtotwosamples,backtoback,andwereinstructedto
listenforaloudnesschangebetweensamples.Listenerswerepresentedwithdifferentcombinationsofaudio,
includingspeechtospeech,speechtomusic,musictospeechandmusictomusic.Sampleswereapproximately
1518secondsinlength,sothatlisteningtoasamplepairtookapproximately3035seconds.Theparticipantsdid
nothavetoidentifywhenthechangewasdetectedinsegmenttwo,asthechangeoccurredwhenthesecond
samplewasplayed.Aftereachsampleorsamplepairparticipantsanswered4questions:
1) Couldyoudetectaloudnesschangeonascaleof110(1didnotdetect,10easilydetectable);
2) Onascaleof110howannoyingwastheloudnesschange(1notannoying,10veryannoying);
3) Ifthischangeoccurredwhilelisteningtotheradio,wouldyoudonothing,adjustthevolumeorturnoff
theradio.
4) Ifthischangeoccurredonaregularbasis,wouldyoudonothing,adjustthevolumeorturnofftheradio.
Sessionthreewasareplicationofsessionone,andsessionfourwasareplicationofsessiontwo,withthe
participantlisteningviaspeakersorearbuds,whichevertheydidnotuseinthefirsttwosessions.

SAMPLEPREPARATION
Audioselectionsincludedmusicandspeech.Theselectionswerechosenbasedontheevennessoftheir
amplitude,usingtheOrbanLoudnessMeterwithitscurrentITURBS.17702algorithm.Eachcandidateselection
wasmeasuredata1samplepersecondintervalandsampleswereselectedthateachhadamaximumtominimum
loudnessvariationoflessthan4dB.Thisensuredthatlistenersscoreswouldnotbesignificantlybiasedbyone
partoftheselection,suchasthelastsoundstheyhearduringthesample.UsingtheaudiogaineditorinCoolEdit
21

Pro,theselectedsampleswereadjustedingainsothattheaverageloudnesslevelofthesampleswaswithin2dB
ofeachother.
ANALYSISOFAGEANDGENDER
Fortheloudnesstest,A2(Gender:male,female)x4(Agegroup:1829;3039;4049;5065)AnalysisofVariance
(ANOVA)wasagainconductedtoexplorewhethermenandwomenweredifferentiallyannoyedbychangein
amplitude.Therewasnostatisticaldifferenceamongagegroups,andnostatisticaldifferencebetweenmaleand
femalelisteners.

WITHINSTREAMRESULTS
Bypressingabutton,peoplewereaskedtoidentifywhereachangeinamplitudeoccurredwithinasingleaudio
stream.Correctresponseswerecountedonlyiflistenerspressedtheirbuttonswithin6secondsoftheamplitude
change.Figure20showsthepercentageofcorrectandincorrectresponsesasthedBchangeincreasedfrom1dB
to9dB.Noticethatat6dBamajorityoflistenerswereabletocorrectlyidentifyachangeinamplitude,but
beneaththatlevel,80%oflistenerscouldnotcorrectlyidentifyachange.Peoplealsodemonstratedthefalse
positiveresponse,sayingthattherewerechangesinsamplesthatwereheldconstant.
1
0.9
0.8
0.7
0.6

IncorrectResponse

0.5

IncorrectNoResponse

0.4

Incorrectbeforeortoolate

0.3

Correct

0.2
0.1
0
1dB

2dB

3dB

4dB

6dB

9dB

Constant

Figure20Percentageofpeopleidentifyingchangeinamplitude

Figure21displaysparticipantsratingsofhowannoyingthechangewas,ona1(notannoying)to10(extremely
annoying)scale.Noticethatparticipantsdidnotthinkanyoftheamplitudechangeswereveryannoyingor
extremelyannoying,butthatat9dBparticipantsclaimedthatthechangewasbecomingannoying.

22

10
9
8
7
6
5
4
3
2
1
0
1dB

2dB

3dB

4dB

6dB

9dB

NoChange

Figure21Howannoyingthechangewas

Figure22andFigure23showhowlistenerswouldrespondtotheradioafterachangeoccurredwithinastream.
Figure22showsthatiflistenersbelievedthechangewereatypical,fewwouldturnthevolumeoff,andinsteadopt
toadjustthevolumeknob.However,asisseeninFigure23,iftheybelievedthechangewouldoccurfrequently,
morereportedtheywouldturntheradiooff.

Figure22Listeners'behaviorafterachangeoccurredIFthechangewasrare

23


Figure23Listeners'behaviorafterachangeoccurredIFthechangewasfrequent

BETWEENSTREAMRESULTS
Listenerswereaskedseveralquestionsafterlisteningtotwoaudiostreams.First,theywereaskedhowprominent
thechangewas.Figure24showslistenersratingswhenlisteningonearbudsorloudspeakers.Noticethatwhen
thechangesweresmall,participantsclaimedthattheywereeasiertohearonearbuds.However,whenthe
changewaslarge,bothearbudlisteningandloudspeakerlisteningwasequivalent.Similarly,asseeninFigure25,
listenerswereslightlymoreannoyedatlowerloudnesschangeswhentheylistenedoverearbuds,butthis
annoyanceevenedoutasthechangebecamelarger.
Figure24Howprominentchangewaswhenlisteningonearbudsorloudspeakers

24

Figure25Howannoyingchangewaswhenlisteningonearbudsorloudspeakers

WithregardtothedirectionofthechangeloudtosoftorsofttoloudFigure26andFigure27showthatasthe
changeincreasedmorelistenersreportedhearingthechangeandbeingannoyedbythechangeiftheaudiowent
fromsoftertolouder.
Figure26Howprominentchangewas

25

Figure27Howannoyingchangewas

Withregardtoparticipantsbehavior,aswiththefirstsegmentwherechangeoccurredwithintheaudioselection,
whenchangeoccurredbetweentwoaudioselectionsparticipantsweremorelikelytoturnofftheradiowhenthey
feltthischangewouldoccurfrequently.AsisevidentinFigure29,at6dBchangepeoplewerebeginningtotake
action,byadjustingthevolumeorturningtheradiooff.
Figure28Behaviorifradioloudnesschangeoccurredonce

26

Figure29Behaviorifradioloudnesschangeoccurredregularly

1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0

I'dadjustthevolumewith
thevolumeknob.
I'ddonothing.
I'dturnofftheradio.

3dB

6db

9dB

12db

Finally,whenweparsedthedataintospecifickindsofaudio,participantsweresignificantlymoreannoyedwhen
changeoccurredbetweenspeechsamples,ascanbeseeninFigure30.Itislikelythatpeoplearesignificantly
moresensitivetochangesinamplitudebetweenspeechsamplesbecausetheyexpectspeechtobemoreeven,
andaremorefamiliarwithchangeinamplitudewithmusic,asdemonstratedinFigure31andFigure32.
Figure30Annoyancebetweenaudiostreamsspeechtospeech

10
9
8
7
6
5
4
3
2
1
0

loudtosoft
softtoloud

3dB

6db

9dB

12db

27

Figure31Annoyancebetweenaudiostreamsmusictomusic

10
9
8
7
6
5
4
3
2
1
0

loudtosoft
softtoloud

3dB

6db

9dB

12db

Figure32Annoyancebetweenaudiostreamsspeech/musicandmusic/speech

10
9
8
7
6
5
4
3
2
1
0

loudtosoft
softtoloud

3dB

6db

9dB

12db

earbudearbudearbudearbudearbudearbud

28

TECHNICALRECOMMENDATIONS
Thesectionaddresseskeyfactorsthataffectthequalityexperiencedbylisteners,summarizedbelow:5

Absenceofdigitalcompressionartifactsintheaudio(bychoosingtherightbitrateinassociationwitha
givencodec);
Optimumdynamicrangetoheartheprogrammaterial,asdeterminedbythelistenersneeds(controlled,
forclarityinanoisyenvironment,oropenwhenthelistenerdesiresitandtheambientnoiselevelis
lower);and
Reliabilityofthestreamaudio(oflessimportanceforpersonalcomputersconnectedtobroadband
networks,butasignificantconcernforstreamingovermobilenetworks).

CODECSELECTIONANDBITRATE
Thetwotestresultsdiscussedearlierinthisreportprovidesdirectguidanceontheselectionofacodecandthebit
rate.Thefirststudy,byexperiencedlisteners,examinedavarietyofcommonlyusedcodecs.Intermsofefficiency
(thelowestbitratesrequiredtorenderhighmeanopinionscores)forarangeofaudiomaterial,thisstudy
identifiedthexHEAACandHEAACcodecsforfirstandsecondplace,respectively.xHEAACandHEAACprovided
similarresultsformusic,butxHEAACexcelledatvoicequalityatthelowestbitrates.Asdiscussedinthesection
AudioCodecsUnderTest,however,xHEAACisnotcurrentlyavailableacrossallsmartphoneandPCplatforms.
TheHEAACcodecissufficientlyavailableinconsumerdevices,inouropinion,toreceivethenomination.

Figure33TheAdvancedAudioCodec"family"

Thequalityoftheaudioitself,thatis,tobeaudiblyfreefromdistortion,noise,etc.,isunderthecontrolof
programoriginatorsandisbeyondthescopeofthisstudy
29

Figure33illustratesthecodecsundertheunifiedAACpatentlicense.Thecodecsarearrangedinorderof
decreasingage,aswellastheirincreasingbitrateefficiency,fromlefttoright.6TheAACcodecsrecommendedfor
NPRsstreamingservicesarecontainedintheMPEG4box.TheAACLowComplexity(AACLC)codecis
appropriateforratesofatleast96kbps.ItspawnedtheHighEfficiency(HEAAC)versionbyaddingSpectral
BandReplication.Atbitratesbelowapproximately96kbps,AACswitchestotheHEAACmode.Thistransitionis
automatic,butthespecificbitratemaybemodifiedbysendingconfigurationparameterstotheencoder.Atrates
of32kbpsandbelow,ParametricStereoprocessingisadded,tofurtherimprovecodingefficiencyandextend
usablestereooperationtoaslowas20kbps.Allthreecodecsaresupportedwithinonesoftwareencoderand
decoder,andtheirlicensingtermsaresharedwithotherformsofAAC,suchasAACLowDelay(AACLD).

CODECCOMPATIBILITYANDLICENSING
TheHEAACcodecisalmostuniversallyavailableinportabledevicesoperatedbyiOSandAndroid(atleastsince
version3.1Honeycomb,accordingtothedeveloper).7Androiddevicesbeforev3.1maynothaveHEAACon
board,however,asthesedevicesarenowmorethantwoyearsold,thispercentageisdecliningasconsumers
replacetheirdeviceswithnewermodelsandlaterversionsofAndroidOS.
OthersystemsthatsupportHEAACincludeBlackberry,Windowsphone,SymbianOSandthe3rdGeneration
PartnershipProject(3GPP),whichisacollaborationoftelecommunicationsentities.ServicessuchasPandora,
Netflix,huluandtheBBCiPlayeruseHEAAC,aswellasglobalbroadcastsystemssuchasDigitalVideo
Broadcasting(DVB),WorldDigitalMultimediaBroadcasting(DMB),DigitalRadioModiale(DRM).8
Inadditiontothetechnicalperformanceandavailability,NPRshouldconsideranycostsassociatedwiththecodec
choice.HEAAChasnofee,eitherforNPRtostreamorforlistenerstouse,providedthatthecodecwassupplied
bythephonesmanufacturer.ForthosemobiledevicesthatdonotincludetheHEAACcodec,theAACcodec
couldbeincludedinaplayersoftwaredownload,butthedeveloper(NPR)wouldbesubjecttomandatorylicense
fees:alicensefeeof$15,000forfiveyearsplus$0.98perdownloadforthefirst500,000units.9

Thesedesignationsaresuppliedinafigurebythelicensingagent,ViaLicensingCorporation,whichrepresentsa
numberofpatentholdersinvolvedinthedevelopmentofAAC.Thedesignationsmayvary.
7

Thepageathttp://developer.android.com/about/versions/android2.3highlights.htmlstates(on10/2012)that
the[Android2.3]platformalsoaddssupportforAACencodingandAMRwidebandencoding(insoftware).
Decodingwouldalsobeneededtoplaybackonesownfiles,althoughthisisnotstated.Thesupportedfilesand
containerformatsatthepagehttp://developer.android.com/guide/appendix/mediaformats.htmllistADTSraw
AAC(.aac,decodeinAndroid3.1+,encodeinAndroid4.0+).Thisstatementseemsinconsistentwiththefirst
reference.
8

HEAACTheUniversalSolution,FraunhoferIISpresentationtotheAudioEngineeringSociety,October2010.

LicensingonbehalfofthepatentholdersisadministeredbyViaLicensingCorporation;see
http://www.vialicensing.com/licensing/aacfaq.aspxandhttp://www.vialicensing.com/licensing/aacfees.aspx.
30

TheMP3format,whileuniversallyavailable,appearstobeanalternative.However,thedrawbackstothatcodec
include:

MP3streamingrequiresatleast2timesthebitrateofHEAACforsimilarquality,whichincreases
streamingcostsandmayaccumulatehigherdatafeesforlisteners;10
ThehigherstreamingrateneededbyMP3willlowerstreamreliability,comparedtoHEAAC.

AcostbenefitanalysisisindicatedforthecaseofHEAACcodecdownloadversusthecostofanalternativeMP3
stream.Thefollowingexampleshowsthat,dependingonthenumberoflistenerswhodownloadtheNPRapp
withHEAAC(asweunderstandthatthecurrentversionhasbeendownloadedatleast100,000times),thecostto
NPRofthecodeclicensecouldbemadeupinamatteroftimethroughstreamingcostsavingsbyusingHEAAC
insteadofMP3.TheexamplebelowillustratesthenumberofmonthstobreakevenfortheAAClicenseifthe
addedstreamingcostwere5peruserpermonth.
overalllicensefee

$15,000

perdownload

$0.98

numberofdownloads:

10,000

50,000

100,000

500,000

costperdownload

$2.48

$1.28

$1.13

$1.01

differenceperuserpermonth,128kbps

$0.05

timeitwouldtaketogettothedownloadcostfortheAACdecoder
(numberofmonths)

49.6

25.6

22.6

20.2

Anotherconsiderationforimplementinganyadvancedcodeciscomputersrunningolderoperatingsystems,such
asWindowsXP,whichlackedAAC.CodecupgradesthatincludeacompatibleAACdecoderarewidelyavailablefor
downloadontheInternet,anditmaybepossibletouseseparatepublicradiostreamsoftwaretoavoidthe
licensingfeeforconsumerPCsoftware,whichis$0.48perunitplus$32,000peryear.9CompatibilityofHEAAC
witholderApplecomputerproductsisnotlikelytobeanissueasApplesiTunesusestheAACcodecexclusively.

10

TheStreamGuyshostingservice(www.streamguys.com)leviesa$0.05extracostperuserpermonth(at1000
usercapacity)forcustomerswhostreamat128kbps,comparedtothosewhostreamat48kbps.
31

STREAMINGREQUIREMENTS
Whileitisundeniablethatthathigherbitratespermitpotentiallygreateraudioquality,thereareotherconditions
besidesaudioqualitytoconsider,suchasthereliabilityofthestream,whichrunscountertothebitrate.The
followingsectionexplainshowwirelessmobilenetworksaffectstreamreliabilityandhowthisimpactsthechoice
ofstreamingbitrate.

WIRELESSNETWORKS
Wirelessmobilenetworks,suchasVerizonandAT&T,areessentialtoamobileaudioservice.Thesecarriersare
licensedbytheFCCtoprovidetwowaybroadbanddataservicetolargegeographicareasoftheU.S.Accordingto
ConsumerWatchdog(www.consumerwatchdog.com),however,theypromisefaster4Gspeedswithoutactually
makingimprovementstoexistingproductsandservicesorwithoutdisclosingthemeaningoffaster.Prompted
bycomplaintsfromconsumersaboutirregularbroadbandspeedsandunevengeographicservice,theFCC
announcedthatitwilltestmobileoperatorssystemsinthenearfuture(seeFigure33).Someindustryanalysts
doubtthatmuchcanbedonetoimproveperformance,ascustomergrowthanduserdemandforgreateramounts
ofdatawillcontinuetosoakupthenetworks4GandLTEspeedimprovements.

Figure34Examplesofnewsreportsaboutthemisleadingspeedclaimsofmobilewirelessoperators;FCCintentiontoinvestigate

Thebandwidthofwirelessnetworksisnotanabsolute:itcanbedescribedbyaverages,withstatisticalvariations
aroundamean.ThemeasurementsinFigure34andFigure35furtherillustratethatbitrates,whengroupedinto

32

ranges,varyintheiroccurrence(frequency).11ThemeasuredratescompareUDP,anInternettransportprotocol
withnodataretransmission,toTCP,atransportprotocolthatcanrequestretransmissionofdatapackets.UDPis
commonlyusedforstreamswhileTCPisacoreprotocolusedontheInternetforreliablefiletransfer.

Figure35StatisticalavailabilityofbitrateforTCPandUDPprotocols

Reliableprotocols,suchasTCP,guaranteecorrectdeliveryofeachbitinthemediastream.However,they
accomplishthiswithasystemoftimeoutsandretries,whichmakesthemmorecomplextoimplementthanUDP.
UDPisitpreferredforstreamingbecauseithaslesslatencyandprocessingoverheadthanTCP,butitisless
reliablebecausepacketcheckingandretransmissionrequestsarenotpossible.Witheitherprotocol,whenthereis
datalossonthenetwork,amediastreamstallswhiletheprotocolhandlersdetectthelossandretransmitthe
missingdata.Clientscanminimizethiseffectbybufferingdata,butattheexpenseoflongerstartuptimesand
latency.
ItisapparentthatUDPoffershighermoreoccasionsforhigherdownstreambitratesthanTCP.TheFigure35
chartsshowthatbitratesinthecityhavea
widervariabilitythanthehighway
measurements.Itisimportanttonotethat
thesemeasurementsareaveragesovertime,
andthatthedata,especiallywithUDP,could
includeshorterintervalswhennodatais
received.
Figure36showsasimplificationofhowaudio
datapacketscanflowoveranInternetnetwork
(wirelessorwired)toaUDPdevice,asthesmall
blocksalongthebaseline.Theaccumulated
dataovertimeisshownwiththeblueline.If
theaudioplayoutrate,shownbytheredline,

Figure36Dataflowforstreamingandfiletransfer

11

TheEffectsofMobilityonStreamingMediainCDMA20001xEVDONetwork,PhillipaSessini,et.al.,Dept.of
ComputerScience,UniversityofCalgary,Proc.SPIE,Jan.2007.
33

exceedsthereceiveddata,dropoutsintheaudiowilloccur.Ifaloweraudioplayoutrate,showningreen,isless
thantherateatwhichdataaccumulatesinthebuffer,uninterruptedaudioresults.Thesescenariosdonotapply
tofiledownloadsystems,suchasPandora,whichareeffectivelyfiletransfersystemswithalargedatabuffer:the
datatransfercanevenstop,briefly,aslongasthedataburstskeepaheadofthefileplayoutrate.(Itisbelieved
thatPandoraandsimilarfileplayoutsystemsmayalsoshifttoalowerplaybackbitrateinanticipationofadata
shortfall,whichhelpsavoidaudiodropouts.Becausepodcastsarefiletransfers,theirplaybackafterdownloadis
complete;hence,highburstybitratesarepossiblewithouteffecttotheplayback.
Priortothisprojectweconstructedaninvehiclesystemtomeasurethereliabilityofdifferentstreams.Todothis
weconnectedtheaudiooutputfromasmartphonetoanaudiosensorandcomputerthatcouldrecordtimeand
locationalongwithaudioavailability.Thephonewasplacedintheclear,inaholderatthetopofthedash,anda
GPSantennawasplacedontheroof.Ourprimaryinterestwastodetermineifstreamratewouldaffectreliability.

Figure37Typicalroute,betweenNPRandhomeforoneoftheauthors.

ThemapinFigure37showsatriptoNPRheadquarters,intheupperright,fromtheVirginiasuburbsasaseriesof
dots.Thegreendotsindicatestreamaudioandthewhitedotsshowdropouts,usinganAppleiPhone3GSonthe
AT&Tnetwork.12Theinsetchartshowstheaudioavailabilityovertimeforthesametrip.Itisapparentthatthis
stream(at128kbpsfromNRKKlassik,inNorway)hadseveraldropouts,withtwolastingabouttwominuteseach.
Ourtestingfoundthatlowerbitratestreamswereconsistentlymorereliablethanhigherstreams.

12

TheiPhone3GSintroducedsupportfor7.2MbpsHSDPA(HighSpeedDownlinkPacketAccess)intheUniversal
MobileTelecommunicationsSystem(UMTS),athirdgenerationmobilecellularsystemfornetworksbasedonthe
GSMstandard.Thiscapabilitywouldappeartofarexceedthe0.128Mbps(128kbps)streamrateintheexample.
34

Thestreamloggingsystemwasexpandedlatertosupporttwosmartphones.Inthatwaywecouldby
simultaneouslymeasuretwodifferentstreamsinonevehicleduringthesametrip,cancellingoutthevariations
thatmayoccurfromtriptotrip.Figure38showstwomemberstationswithstreamratesinthistestof32kbps
(upper,inred)and128kbps(lower,inblue),bothreceivedoniPhone4sontheAT&Tnetwork.Thehigherbit
ratestreamexperiencedmoredropouts,includingone17seconddropatapproximately573secondsfromthe
start.
Dropoutbehaviorwithhigherbitratestreamswasfrequentlylessreliable,butnotalwaysconsistent.AsFigure38
shows,thedropoutsarenotcorrelatedintime,suggestingthatinpopulousareaswherethesignalisconsistently
high,thedropoutsarecausedbynetworkcapacitylimitations.

Figure38Comparisonofdropoutsonsimultaneousstreamsat128and32kbps

Ourtestsfoundsignificantlyhigherdropoutduringpeakmorningandeveningtraveltimethanduringoffpeak
times,whichreinforcesourtheorythatnetworkloading,notsignal,wasreducingstreamreliability.Thismaybe
duetotravelfromonewirelesscellintoanothercellthatisattemporarilyatcapacity,wherenoothercellis
availableforhandoff.Wealsofoundcaseswherehandoffappearedtobedelayed,suchasthesectionofI66
leavingWashingtonandenteringArlington,wherewetheorizethathandoffbetweenWashingtonandVirginia
networksisoccurringthatcausesregularandpredictabledropout.Thedropoutsareusuallymoreprolongedwith
highbitratestreamsinthissectionofroadway.

CODECANDBITRATES
Fromthetestingdiscussedabove,weconcludethatlowerbitratesoffernoticeablyhigherresilienceagainst
momentarynetworkcapacityissuesandpoorcellhandoff.Theeffectofdropoutsonlistenersatisfactionwasnot
measuredinthisstudy,butwepresumethatlistenersannoyancewithdropoutsrisesquicklywithfrequencyor
duration,andcouldleadtotuneouts.Basedonourresearch,werecommendkeepingthebitrateofstreams
aslowaspossible,consistentwithhighaudioquality.FortheHEAACcodec,webelievethisbalanceisachieved
ataconstantstreamrateof48kbps.(bitrateisnotsuitedtowirelessstreamingastheinstantaneousbitrate
mayexceedthedatachannelcapacity,resultinginincreaseddropouts.)

35

LOUDNESSCONSISTENCY
Nexttodigitalaudioquality,theuniformityofperceptualloudness,bothfromstreamtostreamandfrom
programtoprogramwithinstreams,isoneofthemostimportantfactorstolistenerenjoyment.Intelevision,for
example,thematchingofcommercialannouncementstoprogrammingandcontrolofdynamicrangehave
requiredlargeinvestmentsintechnologyresearchanddevelopmenttotacklethisproblemtoconsumers
satisfaction.Streamingservicesforpublicradiodeservesspecialconsiderationbecauseofthewidevarietyof
environmentsinwhichlistenersmayhearstreamsandbecausetheycaneasilyswitchbetweenstreamsofvarying
programtypes.
Asexplainedbelow,thecurrentconsistencyofloudnessbetweenpublicradiostreamsispoor.Thisappearstobe
duetomultiplefactors,includingalackofstandardsfordigitalaudiotransmissionlevel,inattentiontoaudiolevels
thatinadvertentlygetoutofadjustment,andgreatvarietyinaudioprocessingdevicesandthewaytheyare
operated.Atthesametime,audiostreamsontheInternetunrelatedtopublicradiooperateacrossanenormous
rangeofaudiolevels,whichpresentsissuesforlistenerswhomayswitchtoandfrompublicradiostreams.
Listenersmayhearpublicradiostreamsinavarietyofenvironments,fromanoisysubwaycartoaquietliving
room.Thoseinnoisierlocationsneedtightcontrolofdynamicrangetoheareverywordofatalkshowor
newscast,andeverynoteofasymphony.However,thedegreeofloudnessmanagementnecessaryfornoisy
listeningconditionsisentirelydifferentthaninaquietroom.Consequently,wepresentasolutionthatputsthe
activeloudnessmanagementintheplayer,wherethelistenercanchoosethedynamicrangethatssuitablefor
theirenvironmentandtaste.

CURRENTEXPERIENCE

Figure39Audiosamplesof49publicradiostationscarryingNPR'sWeekendEdition,Saturday;variationinlevelexceeds21dB

Figure39isascreenshotoftheprogramlevelsfrom49publicradiostationstreams,allcarryingthesame
program:WeekendEdition,Saturday,onFebruary21,2012.Eachsampleismorethan20seconds,whichaffords
timetodetermineeachstreamslevel,whichrangesbymorethan21dB.Listenerswhoswitchbetweensomeof
thesestreamswouldhearchangesinloudnesslargeenough,accordingtoourresearch,tobeannoyingoreven
causelistenerturnoff.ThisisclearlyachallengeforpublicradiossuccesswithInternetaudiostreams.

36


Figure40Audiospectrogramof49publicradiostreams

Figure40showsafrequencyspectrogramofthesame49publicradiostreamscarryingWeekendEditionSaturday.
Theheightsoftheverticalbarsshowthehighestfrequencytransmitted,from43Hzatthebottom,to20kHznear
thetop,with9991Hznearthecenter.Colorsindicatetheamplitude,fromred(highestamplitude)todarkgreen
(noenergy).Severalstreamscontainsubstantialenergyupto15kHz,whileseveralothersrolloffjustabove5kHz.
Thesevariationsareduetodifferentbitratesforthestreams,aswellasdifferentcodecs.Listenersswitchingfrom
streamtostreamwillhearlargevariationsinthequalityandnaturalnessofspeechandmusic.
OriginalAudio

PeakNormalized

30
25
20
15
10
5
0
1

9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49

Figure41Loudnessofsameaudiosamplesasabove(CBSLoudnessMeter)inblue;orangebarsshowlevelvariationsaftereachsamplewas
normalizedaccordingtoitselectricalpeakvalues.

Figure41isachartofloudness(measuredbytheCBSLoudnessMeter)ofthe49publicradiostreamsbeforeand
afterpeaknormalization.Thistechniqueadjuststheaudiogainofeachstreamsampletoproducethesame
electricalpeaklevelacommonpractice,especiallyintheeditingofdigitalaudio,astheentirefilecanbe
examined.However,thistechniqueisalsosimilartomostaudioprocessors(compressorlimiters),whichadjust
theirgaindynamicallytoproduceaconstantelectricalpeaklevel.
37

Thedifferenceinheightofthepeaknormalized(orange)barstotheoriginal(blue)barsshowsthatdespitepeak
normalization,theloudnessvariesbyupto8dBacrossthestreamingsamples.Thisdemonstratesthatevenwhen
audioisleveledaccordingtopeaks,therecanstillbevariationsinloudnessthatcouldbeunnatural,oreven
annoyingtolisteners.WerecommendthataudioproductionimplementanEBU128/ITURBS.17702loudness
measure,whichimitatesthewayhumansperceiveloudness,ratherthanrelyingonelectricalpeaklevelalone.13
Thisisdiscussedfurtherinthenextsection.
Evenifthealltheaudioprocessorswerethesameandadjustedalike,theyonlyprovideuniformpeaksnot
necessarilyuniformimpressionofloudness.Thetelevisionindustryhasdevelopedautomaticloudnesscontrollers
forprogramaudio,basedaroundtheInternationalTelecommunicationsUnionstandardITURBS.17702,
Algorithmstomeasureaudioprogrammeloudnessandtruepeakaudiolevel.Thisstandarddefinesaloudness
measurementprocedurethatgenerallyfollowshumanhearing.Italsohasbeenusedtobuildasystemtoregulate
loudnessforbroadcastandstreamingapplications.Followingourreportontestsofthetoleranceoflistenersto
loudnesschanges,wepresentrecommendationsthatbuildonourresultsandcurrentloudnessmeasurement.

LOUDNESSCONTROLANDMANAGEMENT
PROGRAMLEVELMEASUREMENT
Toensureliveproductionstreamsareencodedwithuniformprogramlevels,thefollowingproceduresare
recommended,basedonthescalesinFigure42:14
Alignment
Level(+4dBm)

Measurement
Level(30dBfs)

Digital
FullScale
(0dBFS)

Permitted
MaximumLevel

9dBFS

IECType1PPM

20

16

12

18dBFS
20

15

(12)

10

3 2 1 0 1 2 3

IECSVI(VUmeter)

Figure42AlignmentandmaximumprogramlevelsforcommonmetersbasedonITUandEBUrecommendations

13

Loudnessnormalizationandpermittedmaximumlevelofaudiosignals,EBURecommendationR128,Geneva
2011,andPracticalguidelinesfordistributionsystemsinaccordancewithEBUR128,EBUTech3344,Geneva,
October2011.
14

ThisgenerallyfollowsITURBS.1726,Signallevelofdigitalaudioaccompanyingtelevisionininternational
programmeexchange,andITURBS.6452,Testsignalsandmeteringtobeusedoninternationalsound
programmeconnections.
38

ForsystemsusinganIECType1peakprogrammeterforproduction,thePPMsPermittedMaximumLevel
(typically0dBonitsscale)withatesttoneisset9dBbelowtheencodersDigitalFullScale(0dBFS,
shownastheredverticallinetotheright);
o Maximumprogrampeakswilloccasionallyreachthe0dBscalelevel,with9dBforunexpected
increasesinprogramlevel;
o Lineuptoneat18dBbelowDigitalFullScalewouldtypicallyindicateat9dBonthemeterscale;
o Measurementtonesat30dBFS,forexampleforfrequencyresponsetesting,wouldindicate
21dBbelowMaximumPermittedLevel;
ForsystemsutilizingaANSIVUMeter(IECVolumeIndicator),atonea0VUindicationisset18dBbelow
DigitalFullScale;
o Maximumprogrampeakswilloccasionallyreachthe0VUscalelevel,with18dBofheadroom
forpeakovershootandunexpectedincreasesinprogramlevel;
o Measurementtonesat30dBFS,forexampleforfrequencyresponsetesting,wouldindicate
21dBbelowMaximumPermittedLevel.
TomonitorthelevelofstreamaudiowerecommendmeasurementwithBOTHanIEC6026810Type1
meterforquasipeakelectricallevel,andanITURBS.17702loudnessmeter.Thefirstmeterensuresthat
thetransmissionwaveformisnotclippedandthesecondmeterdeterminestheinstantaneousloudness.

LOUDNESSVALUESSHOULDBETHETARGET,NOTPEAKLEVELS
Figure43showsacomparisonbetweenloudnessnormalizedaudio(normalizedwithinaseriesof20to30
secondsubsections)andthesimultaneouselectricalpeaklevel.Theloudnessrangesaroundthe
nominal23LUFSvalue,withavarianceofatwoorthreedB.Atonepointofmaximumloudness(around400
secondsfromthestart),theelectricallevel(shownbythedoublearrowheads)is9dBabovetheloudness

Figure43 Graphofloudnessnormalizedaudioandpeaklevel

39

value.Atabout340secondsfromthestart,themaximumelectricalpeakismorethan20dBabovethe
loudnessatthesamemoment.Ifwehadnormalizedaccordingtotheelectricalpeaks,itisclearthatloudness
couldbethrownoffbymanydB.Peakrespondingcompressorsandlimitersdothatbydesign.Asdiscussed
earlier,thestreamingsystemhasheadroomsimilartotheequipmentusedtoproduceprogramming:peak
controllingdevicesthataffectthenaturaldynamicsinprogrammaterialarenotrequiredifthetransmission
levelplanofFigure42isobserved.
Forbothprogramproductionandstreamtransmission,loudnessoftheprogramshouldbetheprimary
measurementpeaklevelsshouldonlybeobservedtoavoidclipping.TheleftsideofFigure44illustrates
programmingthatisproducedwithpeakmetering,
postprocessedwithpeaklimingandcompression,
orpeaknormalized;theredlineshowsuniform
peaklevelfromprogramtoprogram.However,
theloudness,shownbythegreenline,variesup
anddown.Intherightpanel,theloudnessofeach
programisevaluatedanditsgainisincreasedor
decreasedtoproduceanearlyuniformresult.The
electricalpeakshavebeenshiftedbythegain
changes,whichproducesnonuniformlevels.If
sufficientheadroomisbuiltintothesystem,the
Figure44 Normalizationbypeaklevelversusloudnesslevel
upwarddisplacementinpeakswillnotbeaproblem.

Figure45OrbanLoudnessMeterv2.0.8,showingthemeasurementbarsITUBS.17702ShortTerm&Integrated(settoEBU+18Relative
scales)andReconstructedPeak

Forengineeringwork,anexcellentfreesoftwaremeterthatprovidesthespecifiedandothermeteringstandardsis
theOrbanLoudnessMeterforWindowscomputers(http://orban.com/meter/).Thissoftwareprovidesrealtime
40

displayaswellastheabilitytorecordthemeasurementstoaCSVfileforlateranalysis.Itincludesa
ReconstructedPeakmeterthatindicatesthepeakvaluefollowingD/Aconversion,whichcancauseclippingas
theymaybehigherthanthedigitalsamplesbeforeconversion.Cautionisadvisedinusingthismeterwiththe
WindowsXPoperatingsystem,asitscalibrationisaffectedbyacombinationofcontrolsinWindowsaudio
mixer.15TheOrbanmeterisexcellentfortechnicalmonitoring,butitswindowislargeandcannotberesizedor
simplified.Werecommendtheuseofmetersdesignedforaudioworkstationsproductioneditorsthatprovidea
simplerloudnessindicator,suchastheexamplesinFigure46andFigure47.

Figure46LevelViewSprovidessimpleEBU/ITUloudnessmeteringinliveandintegrated(longterm)forms.

Figure47TheNUGENAudioVisLMHprovidesbothrealtimeandhistoricalchartingofaudiolevels.

LOUDNESSMONITORINGANDNORMALIZATIONOFSTREAMS

15

ForcomputersrunningtheWindowsXPoperatingsystem,itisveryimportanttocalibrateyourmeterdevice
throughtheWindowsWaveI/OorStereoMix,toensureaccuracyandtopreventthesounddevicedriverfrom
clipping.ForWindowsVistaand7/8,themeterusesWindowsAudioSessionAPI(WASAPI)Loopback,whichallows
themetertomonitorplayoutfromanyWindowsAudioplayerapplicationandeliminatestheWaveI/OorStereo
Mixrequirements.LevelcalibrationisunnecessarybecausetheWASAPILoopbackisinternallyunitygain.
41

Streamvalidation,theprocessofmonitoringaudiostreamsforquality,caneasilyincorporateaudiolevel
monitoringtoensurethataudioistransmittedwithacceptableranges.Thereareseveralfirmsthatdothat
commercially,especiallyforaudiothataccompaniesvideo.Fortunately,theneedtomanageloudcommercials
whileofferingpleasingamountsofdynamicrangehadledtoconsiderabledevelopmentinvideoproductionand
transmission.16Therearetoolsavailableforrealtimemonitoringandreportingofaudiostreamlevels,suchasthe
productillustratedinFigure51.Thistypeofsoftwarecouldbeusedtospotaudiolevelproblems,whilethe
streamvalidationschecksdigitalquality,metadata,andotheraspectsofeachstream.Werecommendthat
automatedloudnessmonitoringbeincludedinthestreamvalidationsystem.Asdiscussedbelow,the
monitoredlevelsprovidesinformationtoadjustandnormalizetheaudiotransmissionlevelofanystreamsthat
Toensureapleasantexperienceforlisteners,theaudiolevelofallstreamsshouldachievealongtermloudness
targetof23LUFS.
20

10
Jazzmusicsamplewithlittledynamiccompression,moderate
dynamicrangeandclosetonominallongtermloudness

dB

10

20

30

40
BS.17702
Loudness

Max.LRPeak

LRA
(DynamicRange)

LongtermAvg.
Loudness

23LKFS

Figure48Severalminutesofjazzmusic;BS.17702loudness,peakandlongtermloudnessarerelativeto0dBFS

Figure48showsseveralminutesofjazzmusic(runninglefttoright)fromanactualInternetstream,where0dBFS
isdigitalfullscale.ThechartprovidesITURBS.17702measurementscollectedbytheOrbanmeterapplication:

16

IntheU.S.,inresponsetotheCALM(CommercialAdvertisementLoudnessMitigation)Act,theAdvanced
TelevisionSystemsCommitteeestablishedastudygrouptodeveloparecommendedpracticeA/85thatincludes
measurement,metadatainsertionintoaudiostreamsandloudnessmanagementattheconsumerstelevision.
TheEuropeansformedtheEBUGroupP/LOUDandalsodevelopedextensivesystemstomonitorandcontrol
audioprogramloudness.Theirsystemisevenmoreapplicabletoaudioonlytransmissionsystems,anditis
alreadyinuseinNorwaysdigitalaudiobroadcastingsystem.
42

Peaklevelisshowninred;
ShorttermBS.17702loudnessisdarkblue;
Loudnesstarget23LUFSisshowninyellow;
Averageloudnessislightblue;
Loudnessrange(LRA)isgreen;likeaverageloudness,thismeasurementdevelopsovertime,butskips
silentintervals;thissamplestartsat0dB,risesandsettlesaround17dBontheright.

Thisjazzsampleshowsloudnessbrieflyexceeding23LUFS,butthelongtermaverageatfarrightis
about25LUFS,whichis2dBbelowthetarget.However,peaksexceedthePermittedMaximumLevelof9dBFS
onseveraloccasions,whichleavestoolittlemargintoincreaseprogramlevel.Thetransmissionlevelforthis
programsegmentisconsideredsatisfactoryasis.
20

10
Approx. 2minutesofhighlydynamicclassicalmusicwithminimalprocessing;longterm
averageloudnessisslightlybelowtargetandisacompromisewithpeaksthatoccasionally

dB

10

20

30

40
BS.17702
Loudness

Reconstructed
peak

LRA
(DynamicRange)

LongtermAvg.
Loudness

23LKFS

Figure49A2minutesectionofrelativelydynamicclassicalmusic

ApassageofclassicalmusicisshowninFigure49accordingtotheBS.17702measurementsavailablewiththe
Orbanmeter.Itdisplayslargervariationthanthejazzsample,butthelongtermaverageloudnessalsosettles
about2dBlessthanthetargetloudness.PeaklevelfrequentlyexceedsthePermittedMaximumLevel,which
arguesagainstanincreaseinprogramlevel.Classicalprogrammingmayhaveverywidedynamicrange,thus,a
longtermaverageloudnessobjectiveof23LUFSshouldbeadjusteddownwardifelectricalpeaksfrequently
exceedthePermittedMaximumLevelof9dBFS.

43

20

10

dB

10

20

9
+3

30

40

Streamwithheavydynamic
compressionandhighloudness
BS.17702
Loudness

Reconstructed
peak

Streamwithlessdynamic
compressionandlowerloudness
LRA
(DynamicRange)

LongtermAvg.
Loudness

23LKFS

Figure50Recommendedwaystohandlestreamsthataretooloudortoosoft

Figure50illustratestwoactualaudiostreamsandhowtheirtransmissionlevelsshouldbeadjustedtoprovidethe
bestexperienceforlisteners.Thestreamonthelefthalfistalkwithheavydynamiccompression,asshownbyits
peaklevelinred.Thecompressionandlimitingpreventthepeaksfromreachingmaximumlevel(0dBFS),butit
alsohasanaverageloudnessofapproximately14LUFS,whichis9dBabovethetargetloudnessof23LUFS.
(Notethatthedynamicrange,indicatedbyLRA,ingreen,isabout8dB.)
Theaudiostreamontherightistalkwithlessdynamiccompression,withpeaksreaching10dBFS,andanaverage
loudnessofapproximately27LUFS.Thisstreamcouldbeincreasedintransmissionlevelby3dBwithoutdriving
peaklevelstooclosetothemaximum.Fortheshortintervalusedforthisdemonstration,thisstreamhasa
dynamicrangeofapproximately12dB.Whilethisstreamwouldmomentarilyvaryinloudnessbecauseofitsmore
opendynamicsound,itslongtermloudnesswouldbecomparabletothedenseraudiosampleontheleft,when
bothareadjustedtowardthe23LUFStarget,asshownbytheyellowarrows.Whiletheloudnessofthefirst,
highlycompressedstreamisreduced,thisapproachistheonlywaytoensurethatlistenersarenotannoyedby
jumpsinloudnessfromstreamtostream.17

17

Therearemanyofarticlesdiscussingthegrowingconcernofthesocalledloudnesswarinpubliclydistributed
audio.Examplesofthoughtfulpapersare:TheLoudnessWar:Background,SpeculationandRecommendations,
EarlVickers,AES129thConvention,4November2010;Loudnessnormalisationandpermittedmaximumlevelof
audiosignals,EBUTechnicalRecommendationR128,EuropeanBroadcastingUnion,http://tech.ebu.ch/loudness,
2010;.TheLoudnessWars:WhyMusicSoundsWorse,NPRMusic,31December2009;NorwaysDABnetwork
embracesloudnessnormalization,BjrnAarseth,EBUTechnicalReview,13February2012.
44

BATCHNORMALIZATIONOFAUDIOLOUDNESSFORFILESANDPODCASTS
Thejobofloudnessmanagement,filebyfile,canbeautomatedwithexistingsoftware.Oneexampleis
LevelOne,developedbyGrimmAudio(http://www.grimmaudio.com/pro_software_levelone.htm).This
programcanserveaspartofaningestsystem,which,accordingtoGrimm,analyzesfilesupto100timesfaster
thanrealtime,normalizingprogramsorprogramsegmentsbeforetransmissionorassemblyintolargerprograms.
Figure51showstheLevelOnewindowinusebynumberofNRKprogramfilescarriedbyNorwaysDigitalAudio
Broadcastingsystem.

Figure51Exampleofacommercialbatchfileprocessor,madebyLevelOne,forfileloudnessnormalization

AnotherloudnessmanagementsystemistheAudioToolsServerforfilebasedworkflows
(http://minnetonkaaudio.com/).Thedeveloperstatesthatthisproductincludesadialoganchoringtechnique
thatproducesmoreconsistentspeechlevels,evensurroundedbymusicorsoundeffects.(Thesearetechniques
developedforvideoproduction,althoughtheymayhaveapplicationinpublicradioprogramswithmixedspeech
andothersoundcontent.)
AnopensourcesolutionisFreeLCS(LoudnessCorrectionServer)http://freelcs.sourceforge.net.Thisfreesoftware
issetuponanetworkserverwhereuserscandropaudiofilestobeloudnesscorrected.Thesoftwarecreates
graphicfilesthatdisplaytheinternalloudnessvariationsinafile.Loudnesscorrectedversionsofeachfileare
movedtoaHotFolderforlatertransmission.Theprogramsupportsencodedfileformatsandusesaprotective
limitertopreventpeaklevelsfromclippingwherethevolumemustbeincreased.Thesoftwareiswrittenin
Python3whichallowsmodificationsandintegrationwithotherprogramsandrunsonUbuntuLinux.

LOUDNESSLEVELSRELATIVETOOTHERINTERNETSTREAMS
TheInternetisopenterritoryasfarasstreamloudnessisconcerned.Anincreasingnumberofstreamoperators
arecompressingtheirprogramcontentandoperatingpeaksclosetomaximumlevel.Evenintwoofthreestreams
oftheBBC,dynamiccompressionisevident:Figure52showssampleclipsfromRadio3(classical),Radio2(adult
contemporary)andRadio1(Top40).TheverticalscaleisdBrelativetodigitalfullscale(fortheredandblue
traces)andtheITUBS.17702loudnessscale,dBLK,forthegreentrace.
45


Figure52SamplesfromBBCInternetaudiostreams:Radio3,Radio2andRadio1(1minutetotal)

Inthiscasestudy,Radio2and3wereengagedintalkduringthe15to20secondsamples.Theclassicalmusicat
thetimewasalightorchestrarendition,withpeaksjustbelow15dBFSandloudnessofabout35to25dBLK.
Thisapproximatestheloudnessofapublicradiostream,transmittedatthelevelstandardsdescribedearlier
(PermittedMaximumLevelof9dBFSandtargetloudnessof23dBLK).
Bycomparison,theRadio2and3segmentsarecompressed,raisingelectricalpeakswithin2dBoffullscale,and
loudnessinatightrangebetween15and11dBLK.Ofcourse,theysoundsubstantiallylouderthantheRadio3
program,whichraisesanissueastowhetherpublicradioshouldcompetewithloudstreams.
Activeloudnessmanagementinthestreamplayersisrecommended,byputtingalimitonprogramaudiothat
exceedsanominallevel(therecommendedloudnessinEBUR128is23dBLK).Inthiscasestudy,classicalmusic
wouldbeunaffectedbyathresholdat23dBLK,buttheRadio2andRadio3programswouldbeturneddownby
theloudnessmanagementsoftwareintheplayer.Thesoftwaremayalsobeprogrammedtoreducetherangeof
programmaterialwithwidevariationswhilealsopreventingblastingofloudprogrammaterial(streamsthatare
notcompliantwithEBUR128).
Iftransmissionlevelmustberaised,andlackingloudnessmanagementontheconsumerside,aprocessthathas
theleastimpactonprogramdynamicsisto:(1)producetotargetlevel(23dBLK)asusual;(2)limitpeaks
at9dBFS;and(3)applyagainincreaseofndB(n<<9dB).

46

STREAMSERVER
ThecomputersuppliedtostationsforstreamencodingisaSuperChassis512L260B,madebySuperMicro
Computer,Inc.(http://www.supermicro.com/products/chassis/1u/512/sc512l260.cfm).Itisa1RUrackmount
unitrunninganIntelprocessorandC602chipsetwith4GBofmemoryandtwo1TBharddrives.The
motherboardexpansionslotsupportsonePCIE3.0x16fullheighthalflengthcard,whichlayshorizontally
becauseofthelowformfactorofthecabinet.

PRESENTAUDIOCARD
Foraudioinput,NPRDigitalMediaselectedanEMU0404
PCIaudiocardmadebyCreativeTechnologyLtd.
(http://us.store.creative.com/EMU0404
PCI/M/B00064YZVK.htm).Thecardisoneofthehighest
performanceproductsdevelopedfortheconsumeraudio
market,using24bit192kHzA/DconvertersandDSP
hardwareaccelerationeffectstominimizeCPUload.Asa
consumerproduct,thecard,showninFigure53,issupplied
withdongleadaptersthatinterconnectthecardframeto
standardconsumerconnectors.Theanalog(DB9)and
digital(DB15)audioconnectorsaredetailedinFigure54.
Figure53EMU0404cardwithdonglesattached
Theaudiolinesextendtounbalancedinputandoutput
connectorsoperatingat10dBu.Digitalinputandoutputisextendedtoopticalandunbalancedcoaxialoperating
atS/PDIFsignallevel(0.5Vpp).TheproductisfurnishedwithPatchMixDSPsoftwareforWindowscomputers
thatprovidesacomprehensivearrayofaudiomixingandprocessingeffects.

Theconsumeranaloganddigitallevelscanpresentaproblemforstations,which
typicallyoperatebalancedanalogaudiolinesatsubstantiallyhigherlevels.
Professionalusersmustalsoconvertfrombalancedtounbalancedaudioatthecard
toavoidhumandnoise.Theseconversionsalsoincreasedifficultywithstandardizing
transmissionlevelsandprovidingproperheadroomatthecardsA/Dconverter.
Also,stationsuseprofessionalconnectors,suchasXLRtointerfaceanalogordigital
audioequipment.StationsalsomayuseAESEBUdigitallinestointerconnectaudio
equipment.WhiletheprotocolforS/PDIFandAESdigitalaudioarecompatible,AES
operatesatahighervoltageandisusuallyabalancedconnection.Thesedifferences
couldleadtoreliabilityissueswiththeaudiointerface.
NPRLabsinvestigatedavarietyofprofessionalaudiointerfacecardstoreplacethe
EMU0404,butthecostwouldbefarbeyondwhatwasbudgeted,andNPRhad
alreadypurchasedalargenumberoftheconsumercards.Asasolutionforstations
andNPR,NPRLabsrecommendedaninterfaceboxbesuppliedtostations.Nooff
theshelfboxwasavailable,sowedesignedacustomboxbuiltlocallybyExcalibur
Figure54Audioconnectorsoncard
bracket

47

Electronics.ThisboxprovidesapairoffemaleXLRconnectorsforthe
audioinputandanRJ45connectorfortheoutput,whichisusedwitha
cableandDB9connectorfordirectconnectiontotheEMUcards
analoginput.
AsshowninFigure56,theboxcontainstwoqualityTriadMagnetics
transformersandaresistivepadusingmetalfilmresistors.Theanalog
pathisdesignedtoproduceadigitizedlevelof18dBFSwitha50ohm
sourceimpedanceproducing+4dBm.(Asdiscussedinthesectionon
transmissionlevels,18dBFSisthenominaltransmissionlevelfor
streaming,andthisalsoprovides18dBofheadroomfortheA/D
convertersfullscalepoint.)Thepadimpedanceisdesignedtopresent
abridgingloadtothebalancedsourcewhichreducesthelevelbyless
than2dBifsourcedfrom600ohms.Internaljumpersareprovidedto
Figure55Customaudiointerface
optionallyconnecttheinputandoutputconnectorsgroundterminals
tothemetalbox.Asshipped,thejumperontheXLRinputsisliftedfromthechassisground;thejumperforthe
groundconnectiontotheEMUcardisconnected.Toavoidgroundloopsitisrecommendedthatonlyonejumper
beinplace.

Figure56Schematicforaudiointerfacebox

ArearpanelswitchisprovidedsothattheleftXLRconnectorcanserveastheAESEBUdigitalinput.Thedigital
signalisconnectedtheRJ45outputconnector,whichcanbeconnectedthroughaseparatecabletotheEMU
cardsDB15S/PDIFinputconnector.TestsweremadetoverifythatthecardworksreliablywithAESEBUsignals
(minimum1.2vPP,normalAES3operatesat23vPP).
48

Thelowestsamplerateforthepreselectedaudiointerfacecard(CreativeEMU0404)is44.1kHz,whichisthe
recommendedrateformusic.Ifitwereavailableinthepresentconfiguration,wewouldrecommendarateof
32kHzforstreamswithprimarilynewsandinformationprogramming.Thiswouldavoidquantizationoftheaudio
spectrumthatisunnecessaryforthisgenre,therebyprovidingasmallimprovementincodecefficiency.However,
ifafuturecodecwithavocoderisemployed,thissampleratechangemaybeunneeded.

PATCHMIXDSPCONFIGURATION
TheCreativeEMUcardsrequireproprietarysoftware,calledPatchMixDSPtooperate.Wheninstalled,the
softwarebypassesthestandardWindowsaudiomixerandprovidesadigitalPCMaudioconnectiondirectlstream
encodingsoftwareasLineIn(LegacyMode).TheoperationoftheWindowsaudiomixer,theRealtekALC662
audiochiponthemotherboard,anditsinterfacesoftwareareunaffected;theymaybeusedsimultaneouslyasa
secondaudioinputorasanaudiobridgingmonitoroftheEMUPCMoutput,suppliedtothe3.5mmjackonthe
rearpanel.

Figure57PatchMixDSPwindow(left)andWinampSHOUTcastwindow(right)

ThePatchMixDSPsoftwareloadsautomaticallyatstartup,alongwiththeWinampplayeranditsassociated
SHOUTcastDSPplugin,whichprovidesthestreamencoding.(ThisappliesonlytostreamsnotusingtheTriton
DigitalAdInsertionstreamingsoftware.)

49

Figure57showstherecommendedconfigurationforPatchMixDSP,withtwoinputmodules:oneforS/PDIF
digitalinputandoneforanalogL/RinputontheEMU0404card.Themodulesprovidesixinsertseachforspecial
features.TheS/PDIFmodulestopinsertisasmallbargraphpeaklevelindicator,followedbyaDSPbased
COMPRESSor/limiter.Theanalogisalsoconfiguredwithabargraphindicator,followedbyaTIRM_POTgain
control,andthenaSENDinsert,whichroutesthemodulesoutputtothecomputersPCMaudiofeedingthe
encodingcard.OnlyoneSENDcanbeinserted,whichservesasatoggleswitchforeitherdigitalaudioinputor
analoginput.TheTRIM_POTinsertwasclicked,whichputayellowboxarounditanddisplaysinformationabout
theinsert:inthiscasealargerbargraphindicatorinthevideoscreen.
TheTRIM_POTscreenincludesagaincontrol,whichissetto0.0dB.Inthiscase,a+4dBuinputonanalogleft
channelproduces18dBFSonthevideoscreensbargraphindicator.TheSHOUTcastencodingsoftwareissetto
receivethePCMaudiofromPatchMix(LineIn),whichproducesa18dBindicationontheencodersbargraph
indicator.Notethatmaximumpeakswouldoccasionallyreach9dBontheencoder,whichprovides9dBof
headroom.Thisisthenominallevelsetupfortransmission.

TRITONDIGITALADINSERTIONSOFTWARE
Tosupportanaudiencemeasurementandadmanagementsolutionforpublicradiostreams,NPRplanstowork
withTritonDigital(formerlyAndoMedia)onadigitalsponsorshipsystem.Thesystemwouldusethestream
serverhardware,describedabove,alongwithTritonmediaautomationsoftwaretoinsertsponsorship
announcements.
TheTritondocumentationspecifiesthattwosoundcardsarerequired:onefortheexternalaudioinputanda
secondtoprovidetheinternalannouncementfiles.18However,theSuperServercomputerselectedbyNPR
physicallysupportsonlyasinglePCcard,suchastheEMU0404audiosoundcard.19TheRealtekALC662audio
chiponthecomputermotherboardcanprovidetheannouncementfileplayback.20Astheannouncementfilesare
playeddigitally,theaudioispasseddigitallythroughtheRealtekchipandnodegradationinqualityoccurs.
However,thereisnoprovisionintheTritonsoftwaretoautomaticallyadjusttheloudnessofrecorded
announcements.Thisisapotentialproblemforuniformtransmissionloudness,astheannouncementsare
producedelsewhereunderdifferinglevelstandards,possiblyincludingdynamiccompression.Ideally,Triton
shouldupgradeitssoftwaretoadjustannouncementplaybacklevelstomatchtheexternal(station)audio,in
accordancewithEBUR128.Lackingthat,implementationofactiveloudnessmanagementintheplayback

18

AndoMediaInstallationClientRequirementsandPreparation,AndoMediaGroup,March2,2011.

19

WeunderstandfromTritonengineersthattheiradinsertionsoftwarewaswrittenwiththeintentionofusing
theOrbanOptimodPC1101audiocard(nowtheOptimodPC1211e),whichsupportsexternalandadinsertion
audio.Thisisarelativelylargeincreaseincostforthestreamserverand,ifoncardaudioprocessingisusedto
controlannouncementlevels,wouldalsodynamicallycompressthestationprogramaudio,whichwediscourage.
20

WehavetestedtheonboardchipforoperationwiththeEMUcard,however,wedidnothaveaccesstothe
announcementfilestoverifyoperationtheTritonadinsertionsoftware.WeareadvisedbyTritionstechnical
supportthatthemotherboardaudiochipshouldbecompatiblewiththeirsoftware.
50

software,asdiscussedearlier,caneffectivelymoderateundesiredchangesinloudnessasannouncementsare
inserted.Currently,theTritonsoftwarealsocansupportonlyonestreamperserver,whichpreventsdual
streamingatdifferentbitratesorwithdifferentcodecs.

AUDIOCARDREPLACEMENTS
TheEMU0404productisdiscontinued,andCreativehasnodirectreplacementforthemodel.NPRDigitalMedia
followedouradviceandpurchasedsomeremainingcardsatdealers.However,thereisaneedtoidentifyalong
termreplacementforthiscardasadditionalcomputersareimplementsoroldcardsfail.TheOptimodPC1211e
cardisacandidate,whichoffersthecapabilitytomixtheexternalaudioinputandtheinternalaudioforTritonad
insertionsoftware.(Tritonofficialsnotedthattheirsoftwarewaswrittenwiththeintentionofusingthe
OptimodPCcard,whichalsoincludesaprofessionalaudiocompressor/limiter.)However,theOptimodPCcard
costsseveraltimesmorethantheEMUcard,evenwiththeaudiointerface.
ThesearchforasolutiontothepresentcardsconsumerinputgaveusaperspectiveonthedirectionofPCaudio
interfaces.Inourview,audioengineersaremovingawayfromPCcardimplementations,anditwasapparentthat
thechoicesofthesecardshasquicklydiminishedoverthepastcoupleofyears,infavorofexternalaudio
interfaceswithUSBinterconnectiontothePC.USBaudiointerfacesarequiteplentiful,haveveryhighaudio
qualityandcouldpermitmorethanoneaudioinputtothecomputer.
ThereissomeconcernaboutwhetherUSBdevicesarereliableenough,especiallywhenthecomputerisrebooted:
doesitautomaticallyrestarttheaudiointerfaceeverytime?Also,questionshavebeenraisedaboutthepossibility
ofinterruptionstotheaudiodatathroughtheUSBportasotherdevicesusetheportorcomputershardwaredata
bus.Thesearequestionsonwhichwerecommendfurtherstudyforfuturegrowth,andbeforethesupplyofEMU
PCcardsrunsout.

CAUTIONSFORTHESOFTWAREENCODER
ThestreamencodingsoftwareselectedbyNPRDigitalServicesistheNullsoftsWinampStandardmediaplayer
withtheSHOUTcastDSPpluginforstreamencoding.Thissoftware,currentlyinversion5.63fortheplayer,is
availableasafreedownloadatwww.winamp.com.Therearesomeuncertaintieswithusingfreewareforalarge
scalestreamingprojectsuchasNPRs:

Correctingerrorsinthesoftware,oransweringdifficultquestionsmaybedifficultwithouttechnical
supportthatisundercontractualobligationtoNPR;
Thesoftwareisdesignedforusebyindividualuserswhoundertaketheirownlocaldownloads,whichmay
makeupdatestothesoftwareacrosshundredsofremotecomputersmorecumbersome;
Modernaudiocodecsareundercontinualimprovement,andnewversionsareissuedfromtimetotime
thatshouldbeused;theupdatesmaynotbeapparenttoNPRandmoredifficulttopushouttolarge
numbersofremotecomputers.

Fortunately,Winamphasalargeuserbaseandonlinecommunity.(ThereisaProversionofthesoftwarethat
promisestechnicalsupportforcustomers,butaquestionemailedtothemhasremainedunansweredafter10
days.)Itispossiblethatotherswillidentifysoftwareissuesquickly,whichtechnicalsupportmaycorrectonbehalf
ofthecommunity,andsometechnicalquestionsmaybeansweredbyexpertsinthecommunity.
51

ACKNOWLEDGEMENTS
Theauthorsappreciatetheassistanceandcontributionsofthefollowingindividualsandcompanies:
JanNordmann,MaxNeuendorf,andNikolausRettelbach,FraunhoferInstitute
ClaudeGravel,VoiceAgeCorporation
TowsonUniversity,TowsonMD
BobOrbanandGregOgonowski,Orban
MichaelByrnesandWilliamHarrison,ExcaliburElectronics
StaffoftheNPRAudioEngineeringDepartment
ThemembersoftheCodecEvaluationProjectAdvisoryGroup

52

You might also like