Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

On variable rate frame independent predictive speech coding: Re-engineering iLBC

Garrido, Christopher M. ; Murthi, Manohar N. and Andersen, Sören Vang LU (2006) IEEE International Conference on Acoustics, Speech, and Signal Processing, 2006 p.717-720
Abstract
The Internet Low Bit-rate Coder (iLBC) is now widely used for Voice over Internet Protocol (VoIP) applications. Unlike speech coders such as those based on Code Excited Linear Prediction (CELP), the iLBC achieves superior robustness to packet loss by avoiding inter-frame coding dependencies. While robustness to packet loss is essential, a VoIP codec should also possess the flexibility to change its source coding rate in order to counter network congestion and facilitate joint source channel coding for wireless channels. Previously, we presented a new variation of the iLBC encoding procedure which yielded a more efficient, rate-flexible result. In an effort to improve performance at lower source rates, we present various improvements to the... (More)
The Internet Low Bit-rate Coder (iLBC) is now widely used for Voice over Internet Protocol (VoIP) applications. Unlike speech coders such as those based on Code Excited Linear Prediction (CELP), the iLBC achieves superior robustness to packet loss by avoiding inter-frame coding dependencies. While robustness to packet loss is essential, a VoIP codec should also possess the flexibility to change its source coding rate in order to counter network congestion and facilitate joint source channel coding for wireless channels. Previously, we presented a new variation of the iLBC encoding procedure which yielded a more efficient, rate-flexible result. In an effort to improve performance at lower source rates, we present various improvements to the original framework. Specifically, we reallocate bits from the Adaptive Codebook) procedure; reduce the length of the start state vector; utilize an adaptive pulse gain quantization scheme; and extend the use of entropy coding. Overall, the various combined improvements result in the modified iLBC (with entropy coding) achieving a rate reduction of 2.0 to 2.9 kbps when compared to the original fixed-rate iLBC without any loss in quality. In comparisons with Adaptive MuitiRate (AMR), the modified iLBC coder remarkably exhibits equivalent Perceptual Evaluation of Speech Quality (PESQ) scores as the AMR coder at 10.2 and 12.2 kbps, and out-performs AMR for all packet loss rates. This is a significant result as the modified iLBC performs equivalent to AMR without exploiting inter-frame redundancies. (Less)
Please use this url to cite or link to this publication:
author
; and
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
host publication
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13
pages
717 - 720
publisher
IEEE - Institute of Electrical and Electronics Engineers Inc.
conference name
IEEE International Conference on Acoustics, Speech, and Signal Processing, 2006
conference location
Toulouse, France
conference dates
2006-05-14 - 2006-05-19
external identifiers
  • wos:000245559901062
  • scopus:33947707326
ISSN
1520-6149
language
English
LU publication?
no
id
57d825aa-a9fc-4689-9f43-3cc6c4285615 (old id 4092526)
date added to LUP
2016-04-01 16:30:50
date last changed
2022-04-22 22:34:58
@inproceedings{57d825aa-a9fc-4689-9f43-3cc6c4285615,
  abstract     = {{The Internet Low Bit-rate Coder (iLBC) is now widely used for Voice over Internet Protocol (VoIP) applications. Unlike speech coders such as those based on Code Excited Linear Prediction (CELP), the iLBC achieves superior robustness to packet loss by avoiding inter-frame coding dependencies. While robustness to packet loss is essential, a VoIP codec should also possess the flexibility to change its source coding rate in order to counter network congestion and facilitate joint source channel coding for wireless channels. Previously, we presented a new variation of the iLBC encoding procedure which yielded a more efficient, rate-flexible result. In an effort to improve performance at lower source rates, we present various improvements to the original framework. Specifically, we reallocate bits from the Adaptive Codebook) procedure; reduce the length of the start state vector; utilize an adaptive pulse gain quantization scheme; and extend the use of entropy coding. Overall, the various combined improvements result in the modified iLBC (with entropy coding) achieving a rate reduction of 2.0 to 2.9 kbps when compared to the original fixed-rate iLBC without any loss in quality. In comparisons with Adaptive MuitiRate (AMR), the modified iLBC coder remarkably exhibits equivalent Perceptual Evaluation of Speech Quality (PESQ) scores as the AMR coder at 10.2 and 12.2 kbps, and out-performs AMR for all packet loss rates. This is a significant result as the modified iLBC performs equivalent to AMR without exploiting inter-frame redundancies.}},
  author       = {{Garrido, Christopher M. and Murthi, Manohar N. and Andersen, Sören Vang}},
  booktitle    = {{2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13}},
  issn         = {{1520-6149}},
  language     = {{eng}},
  pages        = {{717--720}},
  publisher    = {{IEEE - Institute of Electrical and Electronics Engineers Inc.}},
  title        = {{On variable rate frame independent predictive speech coding: Re-engineering iLBC}},
  year         = {{2006}},
}