Advanced

Towards ILBC speech coding at lower rates through a new formulation of the start state search

Garrido, CM; Murthi, MN and Andersen, Sören Vang LU (2005) 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) In 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING p.769-772
Abstract
The Internet Low Bit-rate Coder (iLBC) has emerged as a candidate for Voice over Internet Protocol (VoIP) applications. By avoiding the inter-frame coding dependencies endemic to many speech coders such as those based on Code Excited Linear Prediction, iLBC is able to achieve superior robustness to packet loss. In addition to robustness to packet loss, a VoIP codec should possess the agility to adjust its source coding rate in order to react to network congestion and to be amenable to joint source channel coding for wireless channels. Towards this end, we develop a new formulation of the iLBC encoding process that allows for a variable rate iLBC. In particular, we demonstrate how the LP excitation signal is constructed from a much shorter... (More)
The Internet Low Bit-rate Coder (iLBC) has emerged as a candidate for Voice over Internet Protocol (VoIP) applications. By avoiding the inter-frame coding dependencies endemic to many speech coders such as those based on Code Excited Linear Prediction, iLBC is able to achieve superior robustness to packet loss. In addition to robustness to packet loss, a VoIP codec should possess the agility to adjust its source coding rate in order to react to network congestion and to be amenable to joint source channel coding for wireless channels. Towards this end, we develop a new formulation of the iLBC encoding process that allows for a variable rate iLBC. In particular, we demonstrate how the LP excitation signal is constructed from a much shorter vector of 'start state' samples through a non-square synthesis matrix that captures the effects of the Adaptive Codebook operations. With this new framework, the search and quantization of the start state is re-formulated as an Analysis by Synthesis matching problem. We demonstrate how a Multi-Pulse (MP) approach can be utilized to effect a variable rate coding solution for this new framework. A variable rate coder with the MP start state achieves better performance than the Adaptive Multi-Rate (AMR) coder at 12.2 and 10.2 kbps for packet loss rates greater than 4%. (Less)
Please use this url to cite or link to this publication:
author
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
in
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING
pages
769 - 772
publisher
IEEE--Institute of Electrical and Electronics Engineers Inc.
conference name
2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
external identifiers
  • wos:000229404200193
  • scopus:33646764229
ISSN
1520-6149
language
English
LU publication?
no
id
c8fe2f10-34fb-4b61-8f59-65a94e83db45 (old id 4092541)
date added to LUP
2013-10-17 10:42:40
date last changed
2017-03-12 04:13:04
@inproceedings{c8fe2f10-34fb-4b61-8f59-65a94e83db45,
  abstract     = {The Internet Low Bit-rate Coder (iLBC) has emerged as a candidate for Voice over Internet Protocol (VoIP) applications. By avoiding the inter-frame coding dependencies endemic to many speech coders such as those based on Code Excited Linear Prediction, iLBC is able to achieve superior robustness to packet loss. In addition to robustness to packet loss, a VoIP codec should possess the agility to adjust its source coding rate in order to react to network congestion and to be amenable to joint source channel coding for wireless channels. Towards this end, we develop a new formulation of the iLBC encoding process that allows for a variable rate iLBC. In particular, we demonstrate how the LP excitation signal is constructed from a much shorter vector of 'start state' samples through a non-square synthesis matrix that captures the effects of the Adaptive Codebook operations. With this new framework, the search and quantization of the start state is re-formulated as an Analysis by Synthesis matching problem. We demonstrate how a Multi-Pulse (MP) approach can be utilized to effect a variable rate coding solution for this new framework. A variable rate coder with the MP start state achieves better performance than the Adaptive Multi-Rate (AMR) coder at 12.2 and 10.2 kbps for packet loss rates greater than 4%.},
  author       = {Garrido, CM and Murthi, MN and Andersen, Sören Vang},
  booktitle    = {2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING},
  issn         = {1520-6149},
  language     = {eng},
  pages        = {769--772},
  publisher    = {IEEE--Institute of Electrical and Electronics Engineers Inc.},
  title        = {Towards ILBC speech coding at lower rates through a new formulation of the start state search},
  year         = {2005},
}