SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Downloaden Sie, um offline zu lesen
Multimedia Services:
Audio
Sep-2015
Dani Gutiérrez Porset
Associate Professor
Communications Engineering
Eman ta zabal zazu
2
2Multimedia Services:
Audio
Thanks, Licences and Tools
● Thanks to people and organizations who took or take part
in free software and free knowledge projects, specially
Wikimedia Foundation and KDE
●
This presentation is licensed as CC BY-SA 3.0 ES
http://creativecommons.org/licenses/by-sa/3.0/es/
● This presentation has been made with KDE, LibreOffice,
Inkscape, Gimp, Chromium, Firefox
3
3Multimedia Services:
Audio
Sources and References
● Images from Wikimedia Foundation, if not referenced other
source. Logos and trademarks belong to respective organizations
● Texts:
– Wikipedia pages and referenced articles and material
– “Guide to Voice and Video over IP” - Sun, Mkwawa, Jammeh,
Ifeachor
– “Video over IP” - Wes Simpson
– “Computer Networking, a top-down approach” - Kurose, Ross
4
4Multimedia Services:
Audio
Index
● Introduction
● Codecs
● Speech
● Files, Containers and Formats
● Audio wires and connectors
5
5Multimedia Services:
Audio
Human ear
● Time domain:
– Not able to hear short
time signals (< 1 msec)
– Loud signals mask
quieter signals near in
time
Introduction
● Frequency domain:
– Range of audible
frequencies
– Loud signals at one
pitch mask quieter
signals at a near pitch
6
6Multimedia Services:
Audio
Audio Applications
● Speech, VoIP
● CD, DVD sound
● Digital Audio/Video Broadcasting (DAB, DVB)
● Internet streaming
● Studio/transmitter link
● Theatrical movie presentation
● MIDI (similar to vectorial images): Technical standard for musical
instruments that describes pitch, durations, velocity,... of notes. An
output hardware device or a software synthesizes real audio
Introduction
7
7Multimedia Services:
Audio
Audio analog signals
Waveform:
for time-domain
Spectrum:
for freq-domain
Audacity screenshots. Dani Gutiérrez
Introduction
8
8Multimedia Services:
Audio
Modulation families
Analog baseband signal Digital signal
Analog bandpass channel Analog modulation
e.g. AM, FM
Digital modulation
e.g. PSK, FSK, ASK, QAM
Analog baseband channel Pulse modulation,
analog over analog
e.g. PAM, PWM
Digital baseband modulation
e.g. Unipolar, NRZ, Manchester
Digital channel Pulse modulation,
analog over digital
e.g. PCM
Introduction
9
9Multimedia Services:
Audio
Analog-over-digital modulations
= Digitization
● Pulse-code modulation (PCM)
– Differential PCM (DPCM)
– Adaptive DPCM (ADPCM)
● Delta modulation (DM or Δ-modulation)
● Adaptive-delta modulation (ADM) or Continuously variable
slope delta modulation (CVSDM)
● Delta-sigma modulation ( Δ)∑
● Pulse-density modulation (PDM), e.g. used in Super Audio
CD (“DSD” trademark from Sony and Philips)
Introduction
10
10Multimedia Services:
Audio
Pulse Code Modulation
1.Sampling (>= 2 x bandwidth of the analog signal)
Errors depend on Frecuency cut and clock accuracy
2.Quantization: uniform (LPCM=Linear PCM) or non-uniform (PCMA=A-
law, PCMU= -law)μ
Errors: Granularity
3.Coding: number of bits per sample
Mode Bandwidth (Hz) Sampling (kHz)
Narrowband (NB) 300–3400 8
Wideband (WB) 50–7000 16
Super-wideband (SWB) 50–14000 32
Fullband (FB) 20–20000 48
Introduction
11
11Multimedia Services:
Audio
Audio Codecs
●
Aim of a Codec: to convert and to compress, for storage
and transmission over distinct media, e.g.
– AMR-NB: lossy, for speech
– Dolby Digital: lossy, for cinema and HDTV broadcast
– Dolby TrueHD: lossless, for home entertainment
● Conversion types:
– Analog to Digital (+ Digital to Analog)
– Digital to Digital
● Bitrate (kbits/s) at the codec output
http://en.wikipedia.org/wiki/Analog-to-digital_converter
http://en.wikipedia.org/wiki/Audio_coding_format
Codecs
12
12Multimedia Services:
Audio
Classifications of Audio Codecs
Codecs
● Nature of source: speech, music, cellular (2G GSM, 3G ARM)...
● Source signal bandwidth (NB, WB, SWB, FB) and Sampling rate
● Resulting bitrate (Most in 4,8 to 16 kbps)
● One or more bitrates (adaptive)
● Lossless or lossy
● Latency or delay (inherent to each algorithm)
● Quality
● Creator (ITU-T, IETF, ETSI, Skype,...)
● Licenses
● Costs for encoder and player
● Compression techniques and algorithms
(depend on nature and bandwidth of source signal):
Frame based or sample based, Delay, CBR or VBR, No. of channels,...
● Complexity (computation time)
Source
Processing
Result
Legals & Costs
http://en.wikipedia.org/wiki/Comparison_of_audio_coding_formats
13
13Multimedia Services:
Audio
Audio compression
● Based on psychoacoustics:
– Threshold of hearing (frequencies)
– Simultaneous masking
● Lossy used algorithm families:
– Time domain: Linear predictive coding (LPC), mainly for speech: CELP,
ACELP, VSELP, LPC, RPE-LTP,...
– Freq domain:
● Modified discrete cosine transform (MDCT), e.g. CELT
● Applied to full band or to sub-bands (SBC): break signal into freq bands, and
encode each one independently, e.g. MP3
– Some codecs combine both, e.g. G.718 uses CELP and MDTC
Codecs
14
14Multimedia Services:
Audio
Compression ratio
Codec
Digital Input
Stream
Output
Stream
● f=Sampling freq (kHz)
● bs=Bits/sample
● b=Bitrate (kbps)
f x bs
b
Compression ratio =
(related to input)
64
b
Compression ratio =
(related to 64 kbps)
Codecs
15
15Multimedia Services:
Audio
Framed based vs Sample based
● Sample-based: one sample each time
e.g. PCM and ADPCM
● Frame-based: more than one sample is taken, to
study correlation between near samples. Frame
length can be fixed or variable
e.g. G.723.1 and G.729
Codecs
16
16Multimedia Services:
Audio
Audio Codecs and Delays
● Delays more or less appropriate for some types of transmission:
– Low latency: less compression, higher bitrate
e.g. for real time in VoIP or satellite communications
– High latency: higher compression, lower bitrate
e.g. for stored media, broadcasting or recording
● Origin of latencies:
– Processing, depends on hardware
– Inherent to each algorithm or codec (buffering is needed):
● Frame size (msec): related to number of samples inside the frame
● Look-ahead time: when needed to study correlation between actual and next frame
● Delay calculations:
– In sender: Algorithm delay = Frame length + Look-ahead time
– In both: Codec delay = 2 x Frame length + Look-ahead time
Codecs
17
17Multimedia Services:
Audio
Audio Codecs: CBR vs VBR
● CBR: Constant bitrate. Older
● VBR: Variable bitrate:
– Frames of a file with distinct bitrates depending on variability of
information, higher during more complex periods
– Better quality vs size, but more complex to encode
– Typical in lossless compression (e.g. FLAC, Apple Lossless) and in some
lossy compressions (e.g. MP3, Opus, Vorbis, AAC)
– Encoding in single-pass (“on the fly”) or multipass (not for real time or
live streaming)
– Input parameter: fixed quality, max/min bitrate, average bitrate, file
size
Codecs
18
18Multimedia Services:
Audio
Audio Codecs and Channels
● Mix two (stereo) or more channels of similar information reducing size
but at high quality, instead of store and send independent channels
● Techiques (used in e.g. MP3, AAC, Vorbis) that may be combined for a
signal:
– Simple Stereo (SS): independent channels. No compression
– Mid-side Stereo (MS):
● Middle = (L+R)/2, Side = (L-R)/2.
● Can benefit if signal is more “mono-like”, compressing new “Middle channel”
– Intensity Stereo:
● Based on phychoacoustics, replaces both channels with a single signal plus directional
information
● Better at low bitrates, worse at high bitrates
Codecs
19
19Multimedia Services:
Audio
Audio Codecs comparisons
Source: http://www.opus-codec.org/comparison/
Codecs
20
20Multimedia Services:
Audio
Example of Audio Codec: MP3
● Versions: MPEG-1, MPEG-2, MPEG-2.5 Audio Layer III
● Specification defines decoder better than encoder.
Distinct implementations for encoder, e.g. LAME
● Distinct bitrates and sampling rates depending on version
● Channels: 2 in MPEG-1 mode and up to 5.1 in MPEG-2
● Algorithms: MDCT Hybrid Subband
● Supports CBR and VBR
● Licensing and patent war
Codecs
21
21Multimedia Services:
Audio
Other examples
of typical Audio Codecs
● AAC (Advanced Audio Coding), from ISO and IEC. Part
of MPEG-2 and MPEG-4. Designed to replace MP3.
Patent for coding, not for streaming or distributing
contents
● Vorbis, from Xiph.Org foundation: typically inside Ogg
or WebM containers. Based on MDCT. Open, Royalty-
free
● Opus, from IETF. Suitable for interactive real-time.
Based on CELT and SILK. Open, Royalty-free
http://en.wikipedia.org/wiki/Category:Audio_codecs
Codecs
22
22Multimedia Services:
Audio
Speech case
● Distinct to music
● Interactive
● Voiced speech: harmonics (at freq depending if
male/female)
● Unvoiced signal: like white noise
Speech
23
23Multimedia Services:
Audio
Speech Codecs
● Aim: intelligibility and speaker identification
● Specialized codecs, e.g.:
– Better for music: Vorbis
– Better for speech: GSM, Speex,...
● Distinct times:
– Speech frame: time to encode a frame of speech
– RTP Packet voice duration: time to packetize and send to the network
e.g. for PCM: 20 msec
● Sometimes a VoIP tool provides several codecs to be selected manually
or automatically, and can be changed during conversation
Speech
24
24Multimedia Services:
Audio
Speech Codecs:
Techniques and Codec Comparison
● Compression: remove short-term correlation (~ 1 msec) and
long-term correlation (~ 5 to 10 msec).
● Techniques: Waveform, Parametric (Vocoders for speech),
Hybrid
Source: http://www-mobile.ecs.soton.ac.uk/speech_codecs/common_classes.html
Speech
25
25Multimedia Services:
Audio
G.711 Codec
● Reference codec for comparison
● G.711 = “PCM of voice frequencies”
8k samples/sec x 8 bits/sample = 64 kbps
● Voice quantisation: non-uniform logarithmic quantisation
because of its nature of voice: lower level speech signal has
higher PDF (Probability Density Function) than higher speech
● Variations:
– µ-law (North America, Japan): 14 bits to 8 bits
– A-law (Europe): 13 bits to 8 bits
Speech
26
26Multimedia Services:
Audio
Speech compression:
Waveform based technique
● Method: Remove rendundancy in waveform and
reconstruct.
● Complexity: low
● Results: 16 kpbs to 64 kbps
● Examples of Codecs:
– PCM
– ADPCM (Adaptative Differential PCM), for NB and WB
Speech
27
27Multimedia Services:
Audio
Speech compression:
Parametric based technique
● Method:
– Take segments of short periods (~20 msec) and classifies them in
voiced or unvoiced
– The voice parameters of each segment are obtained via speech
analysis, encoded and sent
● Complexity: high
● Results: better compression ratios, bad quality
● Examples of Codecs:
– LPC (Linear Prediction Coding); 1,2 to 4,8 kbps. Used for secure
wireless communications
Speech
28
28Multimedia Services:
Audio
Speech compression:
Hybrid based technique
● “Analysis-by-Synthesis coding”
● Method: Combines waveform and parametric
● Examples of Codecs:
– CELP (Codebook Excitation Linear Prediction): 4,8 to 16
kbps. Mobile/wireless/satellite communications
achieving toll quality (MOS over 4.0)
– Other modern codecs: G.729, G.723.1, AMR, iLBC, SILK
Speech
29
29Multimedia Services:
Audio
Speech codec examples
Source: Cisco. Voice Over IP - Per Call Bandwidth Consumption
Other important speech codecs:
● SILK, from Skype. Based on LPC. Not royalty-free
● iSAC (internet Speech Audio Codec): wideband and super wideband, open, royalty-free
● ILBC (internet Low Bitrate Codec): narrowband, open, royalty-free for WebRTC
● AMR (Adaptive Multi-Rate) or AMR-NB (Narrow Band)
● AMR-WB (Wideband)
● Speex http://en.wikipedia.org/wiki/Category:Speech_codecs
Speech
30
30Multimedia Services:
Audio
Audio files
● No. of Channels: one (“mono”), two (“stereo”) or
Multichannel
● Compression and codecs:
– No compression: raw PCM,...
– Lossless: FLAC, Apple Lossless .m4a, WMA lossless,...
– Lossy: MP3, Vorbis, AAC,…
Files, Containers and Formats
http://en.wikipedia.org/wiki/Audio_file_format
31
31Multimedia Services:
Audio
Examples of Audio containers
● WAV:
– Instance of [the more general] RIFF
– Chunks:
● One or more chunks, e.g. 2 channels for stereo
● Can contain compressed audio data and non-audio data
– Metadata for each chunk:
● Encoding (typically LPCM uncompressed), No. of channels,
bits/channel, sample rate
● Labels: artist, comments,...
● From video: Ogg, MPEG-4 Part 14 or MP4
Files, Containers and Formats
32
32Multimedia Services:
Audio
Audio physical formats
● CD:
– Reference: “Red book”
– Digital audio encoding:
● 2-channel
● Signed 16-bit
● Linear PCM
● 44,100 Hz
– Similar but distinct to WAV: no headers, tracks that match the CD's sector sizes
● Other supports and associated modulations and lossless codecs:
Super Audio CD (SACD) Pulse density modulation + Direct Stream Transfer
DVD-Audio, Blu-ray, (HD DVD) Meridian Lossless Packing
Files, Containers and Formats
33
33Multimedia Services:
Audio
Audio hardware:
wires and connectors
● Only for audio:
– Analog: PC (colors scheme)
– Digital: S/PDIF. Supports uncompressed PCM audio
and compressed 5.1/7.1 surround sound
● Audio with video:
– Analog: SCART
– Digital: HDMI
Wires and Connectors

Weitere ähnliche Inhalte

Was ist angesagt?

02.m3 cms sys-req4mediastreaming
02.m3 cms sys-req4mediastreaming02.m3 cms sys-req4mediastreaming
02.m3 cms sys-req4mediastreamingtarensi
 
Basics of Mpeg 4 3D Graphics Compression
Basics of Mpeg 4 3D Graphics CompressionBasics of Mpeg 4 3D Graphics Compression
Basics of Mpeg 4 3D Graphics CompressionMarius Preda PhD
 
Video Compression Standards - History & Introduction
Video Compression Standards - History & IntroductionVideo Compression Standards - History & Introduction
Video Compression Standards - History & IntroductionChamp Yen
 
Streaming Media Protocols
Streaming Media ProtocolsStreaming Media Protocols
Streaming Media Protocolssanjoysanyal
 
h.264 video compression standard.
h.264 video compression standard.h.264 video compression standard.
h.264 video compression standard.Videoguy
 
Video Streaming - 4.ppt
Video Streaming - 4.pptVideo Streaming - 4.ppt
Video Streaming - 4.pptVideoguy
 
MPEG Compression Standards
MPEG Compression StandardsMPEG Compression Standards
MPEG Compression StandardsAjay
 
Video Coding Standard
Video Coding StandardVideo Coding Standard
Video Coding StandardVideoguy
 
Introduction To Video Compression
Introduction To Video CompressionIntroduction To Video Compression
Introduction To Video Compressionguestdd7ccca
 
Iain Richardson: An Introduction to Video Compression
Iain Richardson: An Introduction to Video CompressionIain Richardson: An Introduction to Video Compression
Iain Richardson: An Introduction to Video CompressionIain Richardson
 
Mpeg 2 transport streams
Mpeg 2 transport streamsMpeg 2 transport streams
Mpeg 2 transport streamschikien276
 
Protocol For Streaming Media
Protocol For Streaming MediaProtocol For Streaming Media
Protocol For Streaming MediaKaniska Mandal
 
LDI 2012 System Integration
LDI 2012 System IntegrationLDI 2012 System Integration
LDI 2012 System IntegrationLauraFrank
 

Was ist angesagt? (20)

02.m3 cms sys-req4mediastreaming
02.m3 cms sys-req4mediastreaming02.m3 cms sys-req4mediastreaming
02.m3 cms sys-req4mediastreaming
 
Video standard
Video standardVideo standard
Video standard
 
MPEG4 vs H.264
MPEG4 vs H.264MPEG4 vs H.264
MPEG4 vs H.264
 
Basics of Mpeg 4 3D Graphics Compression
Basics of Mpeg 4 3D Graphics CompressionBasics of Mpeg 4 3D Graphics Compression
Basics of Mpeg 4 3D Graphics Compression
 
Video Compression Standards - History & Introduction
Video Compression Standards - History & IntroductionVideo Compression Standards - History & Introduction
Video Compression Standards - History & Introduction
 
Streaming Media Protocols
Streaming Media ProtocolsStreaming Media Protocols
Streaming Media Protocols
 
h.264 video compression standard.
h.264 video compression standard.h.264 video compression standard.
h.264 video compression standard.
 
Audio Compression
Audio CompressionAudio Compression
Audio Compression
 
Video Streaming - 4.ppt
Video Streaming - 4.pptVideo Streaming - 4.ppt
Video Streaming - 4.ppt
 
MPEG Compression Standards
MPEG Compression StandardsMPEG Compression Standards
MPEG Compression Standards
 
H261
H261H261
H261
 
Video Coding Standard
Video Coding StandardVideo Coding Standard
Video Coding Standard
 
Digital video
Digital videoDigital video
Digital video
 
Introduction To Video Compression
Introduction To Video CompressionIntroduction To Video Compression
Introduction To Video Compression
 
Video coding standards ppt
Video coding standards pptVideo coding standards ppt
Video coding standards ppt
 
Iain Richardson: An Introduction to Video Compression
Iain Richardson: An Introduction to Video CompressionIain Richardson: An Introduction to Video Compression
Iain Richardson: An Introduction to Video Compression
 
H263.ppt
H263.pptH263.ppt
H263.ppt
 
Mpeg 2 transport streams
Mpeg 2 transport streamsMpeg 2 transport streams
Mpeg 2 transport streams
 
Protocol For Streaming Media
Protocol For Streaming MediaProtocol For Streaming Media
Protocol For Streaming Media
 
LDI 2012 System Integration
LDI 2012 System IntegrationLDI 2012 System Integration
LDI 2012 System Integration
 

Andere mochten auch

Incorportating Multimedia Technology In The Preschool Classroom
Incorportating Multimedia Technology In The Preschool ClassroomIncorportating Multimedia Technology In The Preschool Classroom
Incorportating Multimedia Technology In The Preschool ClassroomLady Dolandolan
 
Flexible Transport of 3D Videos over Networks
Flexible Transport of 3D Videos over NetworksFlexible Transport of 3D Videos over Networks
Flexible Transport of 3D Videos over NetworksAhmed Hamza
 
Future technologies for business simulations
Future technologies for business simulationsFuture technologies for business simulations
Future technologies for business simulationsMichal Augustini
 
Eγκατάσταση (Clean install) Λ.Σ. Windows
Eγκατάσταση (Clean install) Λ.Σ. WindowsEγκατάσταση (Clean install) Λ.Σ. Windows
Eγκατάσταση (Clean install) Λ.Σ. Windowsgogosf
 
multimedia technologies Introduction
multimedia technologies Introductionmultimedia technologies Introduction
multimedia technologies IntroductionMohammed Fareed
 
Medical Multimedia Information Systems (ACMMM17 Tutorial)
Medical Multimedia Information Systems (ACMMM17 Tutorial) Medical Multimedia Information Systems (ACMMM17 Tutorial)
Medical Multimedia Information Systems (ACMMM17 Tutorial) klschoef
 
Computer Animation PowerPoint
Computer Animation PowerPointComputer Animation PowerPoint
Computer Animation PowerPointoacore2
 
Computer animation Computer Graphics
Computer animation Computer Graphics Computer animation Computer Graphics
Computer animation Computer Graphics University of Potsdam
 
Robots presentation
Robots presentationRobots presentation
Robots presentationaroobkazim
 
presentation on 3D Technology
presentation on 3D Technologypresentation on 3D Technology
presentation on 3D TechnologyMatti Ur Rehman
 
(Computer) Animation Technique
(Computer) Animation Technique(Computer) Animation Technique
(Computer) Animation Techniquejustinesolano
 
Animation's Life Cycle And Its Application
Animation's Life Cycle And Its ApplicationAnimation's Life Cycle And Its Application
Animation's Life Cycle And Its ApplicationShakaib Arif
 
3Ds Max presentation
3Ds Max presentation3Ds Max presentation
3Ds Max presentationNader Soubra
 
Introduction to Animation
Introduction to AnimationIntroduction to Animation
Introduction to Animationmrnasim
 
Computer technology power point
Computer technology power pointComputer technology power point
Computer technology power pointoacore2
 
Basic Concepts of Animation
Basic Concepts of AnimationBasic Concepts of Animation
Basic Concepts of Animationjamalharun
 
3D printer Technology _ A complete presentation
3D printer Technology _ A complete presentation3D printer Technology _ A complete presentation
3D printer Technology _ A complete presentationVijay Patil
 

Andere mochten auch (20)

Incorportating Multimedia Technology In The Preschool Classroom
Incorportating Multimedia Technology In The Preschool ClassroomIncorportating Multimedia Technology In The Preschool Classroom
Incorportating Multimedia Technology In The Preschool Classroom
 
Flexible Transport of 3D Videos over Networks
Flexible Transport of 3D Videos over NetworksFlexible Transport of 3D Videos over Networks
Flexible Transport of 3D Videos over Networks
 
Future technologies for business simulations
Future technologies for business simulationsFuture technologies for business simulations
Future technologies for business simulations
 
OpenGL Basics
OpenGL BasicsOpenGL Basics
OpenGL Basics
 
Nioc2015 final
Nioc2015 finalNioc2015 final
Nioc2015 final
 
Eγκατάσταση (Clean install) Λ.Σ. Windows
Eγκατάσταση (Clean install) Λ.Σ. WindowsEγκατάσταση (Clean install) Λ.Σ. Windows
Eγκατάσταση (Clean install) Λ.Σ. Windows
 
multimedia technologies Introduction
multimedia technologies Introductionmultimedia technologies Introduction
multimedia technologies Introduction
 
Medical Multimedia Information Systems (ACMMM17 Tutorial)
Medical Multimedia Information Systems (ACMMM17 Tutorial) Medical Multimedia Information Systems (ACMMM17 Tutorial)
Medical Multimedia Information Systems (ACMMM17 Tutorial)
 
Computer Animation PowerPoint
Computer Animation PowerPointComputer Animation PowerPoint
Computer Animation PowerPoint
 
Computer animation Computer Graphics
Computer animation Computer Graphics Computer animation Computer Graphics
Computer animation Computer Graphics
 
Robots presentation
Robots presentationRobots presentation
Robots presentation
 
presentation on 3D Technology
presentation on 3D Technologypresentation on 3D Technology
presentation on 3D Technology
 
(Computer) Animation Technique
(Computer) Animation Technique(Computer) Animation Technique
(Computer) Animation Technique
 
Animation's Life Cycle And Its Application
Animation's Life Cycle And Its ApplicationAnimation's Life Cycle And Its Application
Animation's Life Cycle And Its Application
 
3Ds Max presentation
3Ds Max presentation3Ds Max presentation
3Ds Max presentation
 
Introduction to Animation
Introduction to AnimationIntroduction to Animation
Introduction to Animation
 
Computer technology power point
Computer technology power pointComputer technology power point
Computer technology power point
 
Basic Concepts of Animation
Basic Concepts of AnimationBasic Concepts of Animation
Basic Concepts of Animation
 
3D printer Technology _ A complete presentation
3D printer Technology _ A complete presentation3D printer Technology _ A complete presentation
3D printer Technology _ A complete presentation
 
ANIMATION PPT
ANIMATION PPTANIMATION PPT
ANIMATION PPT
 

Ähnlich wie Multimedia Services: Audio

Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compressionneeraj9217
 
Basics of audio coding
Basics of audio codingBasics of audio coding
Basics of audio codingsakshij91
 
C06-Broadcast_Systems1.ppt
C06-Broadcast_Systems1.pptC06-Broadcast_Systems1.ppt
C06-Broadcast_Systems1.pptPalanikumar72221
 
Linear Programming Case Study - Maximizing Audio Quality
Linear Programming Case Study - Maximizing Audio QualityLinear Programming Case Study - Maximizing Audio Quality
Linear Programming Case Study - Maximizing Audio QualitySharad Srivastava
 
MM_Conferencing.ppt
MM_Conferencing.pptMM_Conferencing.ppt
MM_Conferencing.pptVideoguy
 
Audio formats for phones
Audio formats for phonesAudio formats for phones
Audio formats for phonesJoe Nasr
 
Mm01 a vformat
Mm01 a vformatMm01 a vformat
Mm01 a vformatgotovikas
 
Development of Network Service Infrastructure for Transcoding ...
Development of Network Service Infrastructure for Transcoding ...Development of Network Service Infrastructure for Transcoding ...
Development of Network Service Infrastructure for Transcoding ...Videoguy
 

Ähnlich wie Multimedia Services: Audio (20)

Digital audio
Digital audioDigital audio
Digital audio
 
Digital audio
Digital audioDigital audio
Digital audio
 
Speech encoding techniques
Speech encoding techniquesSpeech encoding techniques
Speech encoding techniques
 
SPEECH CODING
SPEECH CODINGSPEECH CODING
SPEECH CODING
 
Lecture 12
Lecture 12Lecture 12
Lecture 12
 
Digital Audio in Multimedia
Digital Audio in MultimediaDigital Audio in Multimedia
Digital Audio in Multimedia
 
add9.5.ppt
add9.5.pptadd9.5.ppt
add9.5.ppt
 
Codecs
CodecsCodecs
Codecs
 
Audio and video compression
Audio and video compressionAudio and video compression
Audio and video compression
 
Basics of audio coding
Basics of audio codingBasics of audio coding
Basics of audio coding
 
C06-Broadcast_Systems1.ppt
C06-Broadcast_Systems1.pptC06-Broadcast_Systems1.ppt
C06-Broadcast_Systems1.ppt
 
MPEG 4
MPEG 4MPEG 4
MPEG 4
 
A glance-at-voip
A glance-at-voipA glance-at-voip
A glance-at-voip
 
Audio compression
Audio compressionAudio compression
Audio compression
 
Linear Programming Case Study - Maximizing Audio Quality
Linear Programming Case Study - Maximizing Audio QualityLinear Programming Case Study - Maximizing Audio Quality
Linear Programming Case Study - Maximizing Audio Quality
 
MM_Conferencing.ppt
MM_Conferencing.pptMM_Conferencing.ppt
MM_Conferencing.ppt
 
Audio formats for phones
Audio formats for phonesAudio formats for phones
Audio formats for phones
 
Mm01 a vformat
Mm01 a vformatMm01 a vformat
Mm01 a vformat
 
3 multimedia systems
3   multimedia systems3   multimedia systems
3 multimedia systems
 
Development of Network Service Infrastructure for Transcoding ...
Development of Network Service Infrastructure for Transcoding ...Development of Network Service Infrastructure for Transcoding ...
Development of Network Service Infrastructure for Transcoding ...
 

Mehr von Dani Gutiérrez Porset

Librecon 2016 - Emprendizaje digital y Software libre
Librecon 2016 - Emprendizaje digital y Software libreLibrecon 2016 - Emprendizaje digital y Software libre
Librecon 2016 - Emprendizaje digital y Software libreDani Gutiérrez Porset
 
Evolución de las Herramientas de aprendizaje online con licencia libre
Evolución de las Herramientas de aprendizaje online con licencia libreEvolución de las Herramientas de aprendizaje online con licencia libre
Evolución de las Herramientas de aprendizaje online con licencia libreDani Gutiérrez Porset
 
Introducción al diseño grafico con software libre
Introducción al diseño grafico con software libreIntroducción al diseño grafico con software libre
Introducción al diseño grafico con software libreDani Gutiérrez Porset
 
Intro a los sistemas operativos móviles
Intro a los sistemas operativos móvilesIntro a los sistemas operativos móviles
Intro a los sistemas operativos móvilesDani Gutiérrez Porset
 
Software libre para una ciudadanía libre
Software libre para una ciudadanía libreSoftware libre para una ciudadanía libre
Software libre para una ciudadanía libreDani Gutiérrez Porset
 
Amenazas en la red: ataques, ciberespionaje y malware
Amenazas en la red: ataques, ciberespionaje y malwareAmenazas en la red: ataques, ciberespionaje y malware
Amenazas en la red: ataques, ciberespionaje y malwareDani Gutiérrez Porset
 
Kubuntu - Instalación y Configuración
Kubuntu - Instalación y ConfiguraciónKubuntu - Instalación y Configuración
Kubuntu - Instalación y ConfiguraciónDani Gutiérrez Porset
 

Mehr von Dani Gutiérrez Porset (20)

Librecon 2016 - Emprendizaje digital y Software libre
Librecon 2016 - Emprendizaje digital y Software libreLibrecon 2016 - Emprendizaje digital y Software libre
Librecon 2016 - Emprendizaje digital y Software libre
 
Evolución de las Herramientas de aprendizaje online con licencia libre
Evolución de las Herramientas de aprendizaje online con licencia libreEvolución de las Herramientas de aprendizaje online con licencia libre
Evolución de las Herramientas de aprendizaje online con licencia libre
 
Sockets ipv4
Sockets ipv4Sockets ipv4
Sockets ipv4
 
Mecanismos IPC system V en Linux
Mecanismos IPC system V en LinuxMecanismos IPC system V en Linux
Mecanismos IPC system V en Linux
 
Señales en Linux
Señales en LinuxSeñales en Linux
Señales en Linux
 
Introducción al diseño grafico con software libre
Introducción al diseño grafico con software libreIntroducción al diseño grafico con software libre
Introducción al diseño grafico con software libre
 
Moodle avanzado - Julio 2012
Moodle avanzado - Julio 2012Moodle avanzado - Julio 2012
Moodle avanzado - Julio 2012
 
Akademy 2013 bilbao_proposal
Akademy 2013 bilbao_proposalAkademy 2013 bilbao_proposal
Akademy 2013 bilbao_proposal
 
Web 2.0 (2010ko abendua) euskeraz
Web 2.0 (2010ko abendua) euskerazWeb 2.0 (2010ko abendua) euskeraz
Web 2.0 (2010ko abendua) euskeraz
 
Sareen kudeaketa, SNMP eta RMON
Sareen kudeaketa, SNMP eta RMONSareen kudeaketa, SNMP eta RMON
Sareen kudeaketa, SNMP eta RMON
 
Cómo hacer una buena presentación
Cómo hacer una buena presentaciónCómo hacer una buena presentación
Cómo hacer una buena presentación
 
Ofimatica con Libreoffice
Ofimatica con LibreofficeOfimatica con Libreoffice
Ofimatica con Libreoffice
 
Gestión de redes, SNMP y RMON
Gestión de redes, SNMP y RMONGestión de redes, SNMP y RMON
Gestión de redes, SNMP y RMON
 
DHCP, DNS, whois
DHCP, DNS, whoisDHCP, DNS, whois
DHCP, DNS, whois
 
Web 2.0 (dic 2010)
Web 2.0 (dic 2010)Web 2.0 (dic 2010)
Web 2.0 (dic 2010)
 
Intro a los sistemas operativos móviles
Intro a los sistemas operativos móvilesIntro a los sistemas operativos móviles
Intro a los sistemas operativos móviles
 
kde on windows
kde on windowskde on windows
kde on windows
 
Software libre para una ciudadanía libre
Software libre para una ciudadanía libreSoftware libre para una ciudadanía libre
Software libre para una ciudadanía libre
 
Amenazas en la red: ataques, ciberespionaje y malware
Amenazas en la red: ataques, ciberespionaje y malwareAmenazas en la red: ataques, ciberespionaje y malware
Amenazas en la red: ataques, ciberespionaje y malware
 
Kubuntu - Instalación y Configuración
Kubuntu - Instalación y ConfiguraciónKubuntu - Instalación y Configuración
Kubuntu - Instalación y Configuración
 

Kürzlich hochgeladen

Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 

Kürzlich hochgeladen (20)

Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 

Multimedia Services: Audio

  • 1. Multimedia Services: Audio Sep-2015 Dani Gutiérrez Porset Associate Professor Communications Engineering Eman ta zabal zazu
  • 2. 2 2Multimedia Services: Audio Thanks, Licences and Tools ● Thanks to people and organizations who took or take part in free software and free knowledge projects, specially Wikimedia Foundation and KDE ● This presentation is licensed as CC BY-SA 3.0 ES http://creativecommons.org/licenses/by-sa/3.0/es/ ● This presentation has been made with KDE, LibreOffice, Inkscape, Gimp, Chromium, Firefox
  • 3. 3 3Multimedia Services: Audio Sources and References ● Images from Wikimedia Foundation, if not referenced other source. Logos and trademarks belong to respective organizations ● Texts: – Wikipedia pages and referenced articles and material – “Guide to Voice and Video over IP” - Sun, Mkwawa, Jammeh, Ifeachor – “Video over IP” - Wes Simpson – “Computer Networking, a top-down approach” - Kurose, Ross
  • 4. 4 4Multimedia Services: Audio Index ● Introduction ● Codecs ● Speech ● Files, Containers and Formats ● Audio wires and connectors
  • 5. 5 5Multimedia Services: Audio Human ear ● Time domain: – Not able to hear short time signals (< 1 msec) – Loud signals mask quieter signals near in time Introduction ● Frequency domain: – Range of audible frequencies – Loud signals at one pitch mask quieter signals at a near pitch
  • 6. 6 6Multimedia Services: Audio Audio Applications ● Speech, VoIP ● CD, DVD sound ● Digital Audio/Video Broadcasting (DAB, DVB) ● Internet streaming ● Studio/transmitter link ● Theatrical movie presentation ● MIDI (similar to vectorial images): Technical standard for musical instruments that describes pitch, durations, velocity,... of notes. An output hardware device or a software synthesizes real audio Introduction
  • 7. 7 7Multimedia Services: Audio Audio analog signals Waveform: for time-domain Spectrum: for freq-domain Audacity screenshots. Dani Gutiérrez Introduction
  • 8. 8 8Multimedia Services: Audio Modulation families Analog baseband signal Digital signal Analog bandpass channel Analog modulation e.g. AM, FM Digital modulation e.g. PSK, FSK, ASK, QAM Analog baseband channel Pulse modulation, analog over analog e.g. PAM, PWM Digital baseband modulation e.g. Unipolar, NRZ, Manchester Digital channel Pulse modulation, analog over digital e.g. PCM Introduction
  • 9. 9 9Multimedia Services: Audio Analog-over-digital modulations = Digitization ● Pulse-code modulation (PCM) – Differential PCM (DPCM) – Adaptive DPCM (ADPCM) ● Delta modulation (DM or Δ-modulation) ● Adaptive-delta modulation (ADM) or Continuously variable slope delta modulation (CVSDM) ● Delta-sigma modulation ( Δ)∑ ● Pulse-density modulation (PDM), e.g. used in Super Audio CD (“DSD” trademark from Sony and Philips) Introduction
  • 10. 10 10Multimedia Services: Audio Pulse Code Modulation 1.Sampling (>= 2 x bandwidth of the analog signal) Errors depend on Frecuency cut and clock accuracy 2.Quantization: uniform (LPCM=Linear PCM) or non-uniform (PCMA=A- law, PCMU= -law)μ Errors: Granularity 3.Coding: number of bits per sample Mode Bandwidth (Hz) Sampling (kHz) Narrowband (NB) 300–3400 8 Wideband (WB) 50–7000 16 Super-wideband (SWB) 50–14000 32 Fullband (FB) 20–20000 48 Introduction
  • 11. 11 11Multimedia Services: Audio Audio Codecs ● Aim of a Codec: to convert and to compress, for storage and transmission over distinct media, e.g. – AMR-NB: lossy, for speech – Dolby Digital: lossy, for cinema and HDTV broadcast – Dolby TrueHD: lossless, for home entertainment ● Conversion types: – Analog to Digital (+ Digital to Analog) – Digital to Digital ● Bitrate (kbits/s) at the codec output http://en.wikipedia.org/wiki/Analog-to-digital_converter http://en.wikipedia.org/wiki/Audio_coding_format Codecs
  • 12. 12 12Multimedia Services: Audio Classifications of Audio Codecs Codecs ● Nature of source: speech, music, cellular (2G GSM, 3G ARM)... ● Source signal bandwidth (NB, WB, SWB, FB) and Sampling rate ● Resulting bitrate (Most in 4,8 to 16 kbps) ● One or more bitrates (adaptive) ● Lossless or lossy ● Latency or delay (inherent to each algorithm) ● Quality ● Creator (ITU-T, IETF, ETSI, Skype,...) ● Licenses ● Costs for encoder and player ● Compression techniques and algorithms (depend on nature and bandwidth of source signal): Frame based or sample based, Delay, CBR or VBR, No. of channels,... ● Complexity (computation time) Source Processing Result Legals & Costs http://en.wikipedia.org/wiki/Comparison_of_audio_coding_formats
  • 13. 13 13Multimedia Services: Audio Audio compression ● Based on psychoacoustics: – Threshold of hearing (frequencies) – Simultaneous masking ● Lossy used algorithm families: – Time domain: Linear predictive coding (LPC), mainly for speech: CELP, ACELP, VSELP, LPC, RPE-LTP,... – Freq domain: ● Modified discrete cosine transform (MDCT), e.g. CELT ● Applied to full band or to sub-bands (SBC): break signal into freq bands, and encode each one independently, e.g. MP3 – Some codecs combine both, e.g. G.718 uses CELP and MDTC Codecs
  • 14. 14 14Multimedia Services: Audio Compression ratio Codec Digital Input Stream Output Stream ● f=Sampling freq (kHz) ● bs=Bits/sample ● b=Bitrate (kbps) f x bs b Compression ratio = (related to input) 64 b Compression ratio = (related to 64 kbps) Codecs
  • 15. 15 15Multimedia Services: Audio Framed based vs Sample based ● Sample-based: one sample each time e.g. PCM and ADPCM ● Frame-based: more than one sample is taken, to study correlation between near samples. Frame length can be fixed or variable e.g. G.723.1 and G.729 Codecs
  • 16. 16 16Multimedia Services: Audio Audio Codecs and Delays ● Delays more or less appropriate for some types of transmission: – Low latency: less compression, higher bitrate e.g. for real time in VoIP or satellite communications – High latency: higher compression, lower bitrate e.g. for stored media, broadcasting or recording ● Origin of latencies: – Processing, depends on hardware – Inherent to each algorithm or codec (buffering is needed): ● Frame size (msec): related to number of samples inside the frame ● Look-ahead time: when needed to study correlation between actual and next frame ● Delay calculations: – In sender: Algorithm delay = Frame length + Look-ahead time – In both: Codec delay = 2 x Frame length + Look-ahead time Codecs
  • 17. 17 17Multimedia Services: Audio Audio Codecs: CBR vs VBR ● CBR: Constant bitrate. Older ● VBR: Variable bitrate: – Frames of a file with distinct bitrates depending on variability of information, higher during more complex periods – Better quality vs size, but more complex to encode – Typical in lossless compression (e.g. FLAC, Apple Lossless) and in some lossy compressions (e.g. MP3, Opus, Vorbis, AAC) – Encoding in single-pass (“on the fly”) or multipass (not for real time or live streaming) – Input parameter: fixed quality, max/min bitrate, average bitrate, file size Codecs
  • 18. 18 18Multimedia Services: Audio Audio Codecs and Channels ● Mix two (stereo) or more channels of similar information reducing size but at high quality, instead of store and send independent channels ● Techiques (used in e.g. MP3, AAC, Vorbis) that may be combined for a signal: – Simple Stereo (SS): independent channels. No compression – Mid-side Stereo (MS): ● Middle = (L+R)/2, Side = (L-R)/2. ● Can benefit if signal is more “mono-like”, compressing new “Middle channel” – Intensity Stereo: ● Based on phychoacoustics, replaces both channels with a single signal plus directional information ● Better at low bitrates, worse at high bitrates Codecs
  • 19. 19 19Multimedia Services: Audio Audio Codecs comparisons Source: http://www.opus-codec.org/comparison/ Codecs
  • 20. 20 20Multimedia Services: Audio Example of Audio Codec: MP3 ● Versions: MPEG-1, MPEG-2, MPEG-2.5 Audio Layer III ● Specification defines decoder better than encoder. Distinct implementations for encoder, e.g. LAME ● Distinct bitrates and sampling rates depending on version ● Channels: 2 in MPEG-1 mode and up to 5.1 in MPEG-2 ● Algorithms: MDCT Hybrid Subband ● Supports CBR and VBR ● Licensing and patent war Codecs
  • 21. 21 21Multimedia Services: Audio Other examples of typical Audio Codecs ● AAC (Advanced Audio Coding), from ISO and IEC. Part of MPEG-2 and MPEG-4. Designed to replace MP3. Patent for coding, not for streaming or distributing contents ● Vorbis, from Xiph.Org foundation: typically inside Ogg or WebM containers. Based on MDCT. Open, Royalty- free ● Opus, from IETF. Suitable for interactive real-time. Based on CELT and SILK. Open, Royalty-free http://en.wikipedia.org/wiki/Category:Audio_codecs Codecs
  • 22. 22 22Multimedia Services: Audio Speech case ● Distinct to music ● Interactive ● Voiced speech: harmonics (at freq depending if male/female) ● Unvoiced signal: like white noise Speech
  • 23. 23 23Multimedia Services: Audio Speech Codecs ● Aim: intelligibility and speaker identification ● Specialized codecs, e.g.: – Better for music: Vorbis – Better for speech: GSM, Speex,... ● Distinct times: – Speech frame: time to encode a frame of speech – RTP Packet voice duration: time to packetize and send to the network e.g. for PCM: 20 msec ● Sometimes a VoIP tool provides several codecs to be selected manually or automatically, and can be changed during conversation Speech
  • 24. 24 24Multimedia Services: Audio Speech Codecs: Techniques and Codec Comparison ● Compression: remove short-term correlation (~ 1 msec) and long-term correlation (~ 5 to 10 msec). ● Techniques: Waveform, Parametric (Vocoders for speech), Hybrid Source: http://www-mobile.ecs.soton.ac.uk/speech_codecs/common_classes.html Speech
  • 25. 25 25Multimedia Services: Audio G.711 Codec ● Reference codec for comparison ● G.711 = “PCM of voice frequencies” 8k samples/sec x 8 bits/sample = 64 kbps ● Voice quantisation: non-uniform logarithmic quantisation because of its nature of voice: lower level speech signal has higher PDF (Probability Density Function) than higher speech ● Variations: – µ-law (North America, Japan): 14 bits to 8 bits – A-law (Europe): 13 bits to 8 bits Speech
  • 26. 26 26Multimedia Services: Audio Speech compression: Waveform based technique ● Method: Remove rendundancy in waveform and reconstruct. ● Complexity: low ● Results: 16 kpbs to 64 kbps ● Examples of Codecs: – PCM – ADPCM (Adaptative Differential PCM), for NB and WB Speech
  • 27. 27 27Multimedia Services: Audio Speech compression: Parametric based technique ● Method: – Take segments of short periods (~20 msec) and classifies them in voiced or unvoiced – The voice parameters of each segment are obtained via speech analysis, encoded and sent ● Complexity: high ● Results: better compression ratios, bad quality ● Examples of Codecs: – LPC (Linear Prediction Coding); 1,2 to 4,8 kbps. Used for secure wireless communications Speech
  • 28. 28 28Multimedia Services: Audio Speech compression: Hybrid based technique ● “Analysis-by-Synthesis coding” ● Method: Combines waveform and parametric ● Examples of Codecs: – CELP (Codebook Excitation Linear Prediction): 4,8 to 16 kbps. Mobile/wireless/satellite communications achieving toll quality (MOS over 4.0) – Other modern codecs: G.729, G.723.1, AMR, iLBC, SILK Speech
  • 29. 29 29Multimedia Services: Audio Speech codec examples Source: Cisco. Voice Over IP - Per Call Bandwidth Consumption Other important speech codecs: ● SILK, from Skype. Based on LPC. Not royalty-free ● iSAC (internet Speech Audio Codec): wideband and super wideband, open, royalty-free ● ILBC (internet Low Bitrate Codec): narrowband, open, royalty-free for WebRTC ● AMR (Adaptive Multi-Rate) or AMR-NB (Narrow Band) ● AMR-WB (Wideband) ● Speex http://en.wikipedia.org/wiki/Category:Speech_codecs Speech
  • 30. 30 30Multimedia Services: Audio Audio files ● No. of Channels: one (“mono”), two (“stereo”) or Multichannel ● Compression and codecs: – No compression: raw PCM,... – Lossless: FLAC, Apple Lossless .m4a, WMA lossless,... – Lossy: MP3, Vorbis, AAC,… Files, Containers and Formats http://en.wikipedia.org/wiki/Audio_file_format
  • 31. 31 31Multimedia Services: Audio Examples of Audio containers ● WAV: – Instance of [the more general] RIFF – Chunks: ● One or more chunks, e.g. 2 channels for stereo ● Can contain compressed audio data and non-audio data – Metadata for each chunk: ● Encoding (typically LPCM uncompressed), No. of channels, bits/channel, sample rate ● Labels: artist, comments,... ● From video: Ogg, MPEG-4 Part 14 or MP4 Files, Containers and Formats
  • 32. 32 32Multimedia Services: Audio Audio physical formats ● CD: – Reference: “Red book” – Digital audio encoding: ● 2-channel ● Signed 16-bit ● Linear PCM ● 44,100 Hz – Similar but distinct to WAV: no headers, tracks that match the CD's sector sizes ● Other supports and associated modulations and lossless codecs: Super Audio CD (SACD) Pulse density modulation + Direct Stream Transfer DVD-Audio, Blu-ray, (HD DVD) Meridian Lossless Packing Files, Containers and Formats
  • 33. 33 33Multimedia Services: Audio Audio hardware: wires and connectors ● Only for audio: – Analog: PC (colors scheme) – Digital: S/PDIF. Supports uncompressed PCM audio and compressed 5.1/7.1 surround sound ● Audio with video: – Analog: SCART – Digital: HDMI Wires and Connectors