When errors become the rule: A survey of Transformation-Based Learning

Uneson, Marcus

When errors become the rule: A survey of Transformation-Based Learning

Mark

Uneson, Marcus ^LU (2011) In LU-DV-EX DATM01 20111
Computer Science

Abstract: Transformation-based learning (TBL) is a machine learning method for sequential classification, invented by Eric Brill (1993, 1995). It is widely used within
natural language processing (but surprisingly little in other areas).

TBL is a simple yet flexible paradigm, which achieves competitive or even state-of-the-art performance in several areas and does not overtrain easily. It is especially successful at catching local, fixed-distance dependencies. The learned representation -- an ordered list of transformation rules -- is compact and efficient, with clear, declarative semantics. Individual rules are interpretable and often meaningful to humans.

The present thesis has two main parts. First and foremost, we offer a survey of the... (More); Transformation-based learning (TBL) is a machine learning method for sequential classification, invented by Eric Brill (1993, 1995). It is widely used within
natural language processing (but surprisingly little in other areas).

TBL is a simple yet flexible paradigm, which achieves competitive or even state-of-the-art performance in several areas and does not overtrain easily. It is especially successful at catching local, fixed-distance dependencies. The learned representation -- an ordered list of transformation rules -- is compact and efficient, with clear, declarative semantics. Individual rules are interpretable and often meaningful to humans.

The present thesis has two main parts. First and foremost, we offer a survey of the most important theoretical work on TBL. It is intended to be informal but relatively comprehensive, addressing a perceived gap in the literature. Second, in a more practical part, we describe a recursive, parallelizable rephrasing, well suited for declarative languages, of a fast imperative learning algorithm proposed by Ngai and Florian (2001). We implement and test this algorithm in the functional language Haskell. (Less)

Please use this url to cite or link to this publication: http://lup.lub.lu.se/student-papers/record/2007414

author

Uneson, Marcus ^LU

supervisor

Christos Levcopoulos ^LU

organization

Computer Science

course

DATM01 20111

year

2011

type

M1 - University Diploma

subject

Technology and Engineering

keywords

computer science, computational linguistics, machine learning, sequential classification, brill tagging, transformation-based learning

publication/series

LU-DV-EX

report number

2011-157

ISSN

1651-6389

language

English

id

2007414

date added to LUP

2011-12-08 11:16:29

date last changed

2011-12-08 11:16:29

@misc{2007414,
  abstract     = {{Transformation-based learning (TBL) is a machine learning method for sequential classification,  invented by Eric Brill (1993, 1995). It is widely used within
natural language processing (but surprisingly little in other areas).

TBL is a simple yet flexible paradigm, which achieves competitive or even state-of-the-art performance in several areas and does not overtrain easily. It is especially successful at catching local, fixed-distance dependencies. The learned representation -- an ordered list of transformation rules -- is compact and efficient, with clear, declarative semantics. Individual rules are interpretable and often meaningful to humans.

The present thesis has two main parts. First and foremost, we offer a survey of the most important theoretical work on TBL. It is intended to be informal but relatively comprehensive, addressing a perceived gap in the literature. Second, in a more practical part, we describe a recursive, parallelizable rephrasing, well suited for declarative languages, of a fast imperative learning algorithm proposed by Ngai and Florian (2001). We implement and test this algorithm in the functional language Haskell.}},
  author       = {{Uneson, Marcus}},
  issn         = {{1651-6389}},
  language     = {{eng}},
  note         = {{Student Paper}},
  series       = {{LU-DV-EX}},
  title        = {{When errors become the rule: A survey of Transformation-Based Learning}},
  year         = {{2011}},
}

LUP Student Papers

LUND UNIVERSITY LIBRARIES

When errors become the rule: A survey of Transformation-Based Learning