Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Variables are valuable : making a case for deductive modeling

Tizon-Couto, David and Lorenz, David LU orcid (2021) In Linguistics 59(5). p.1279-1309
Abstract
Following the quantitative turn in linguistics, the field appears to be in a
methodological “wild west” state where much is possible and new frontiers are
being explored, but there is relatively little guidance in terms of firm rules or
conventions. In this article, we focus on the issue of variable selection in regression modeling. It is common to aim for a “minimal adequate model” and eliminate “non-significant” variables by statistical procedures. We advocate an alternative, “deductive modeling” approach that retains a “full” model of variables generated from our research questions and objectives. Comparing the statistical model to a camera, i.e., a tool to produce an image of reality, we contrast the deductive and... (More)
Following the quantitative turn in linguistics, the field appears to be in a
methodological “wild west” state where much is possible and new frontiers are
being explored, but there is relatively little guidance in terms of firm rules or
conventions. In this article, we focus on the issue of variable selection in regression modeling. It is common to aim for a “minimal adequate model” and eliminate “non-significant” variables by statistical procedures. We advocate an alternative, “deductive modeling” approach that retains a “full” model of variables generated from our research questions and objectives. Comparing the statistical model to a camera, i.e., a tool to produce an image of reality, we contrast the deductive and predictive (minimal) modeling approaches on a dataset from a corpus study. While a minimal adequate model is more parsimonious, its selection procedure is blind to the research aim and may conceal relevant information. Deductive models, by contrast, are grounded in theory, have higher transparency (all relevant variables are reported) and potentially a greater accuracy of the reported effects. They are useful for answering research questions more directly, as they rely explicitly on prior knowledge and hypotheses, and allow for estimation and comparison across datasets. (Less)
Please use this url to cite or link to this publication:
author
and
publishing date
type
Contribution to journal
publication status
published
subject
keywords
effect estimation, statistical modeling, theory and data, variable selection
in
Linguistics
volume
59
issue
5
pages
31 pages
publisher
De Gruyter
external identifiers
  • scopus:85115664806
ISSN
1613-396X
DOI
10.1515/ling-2019-0050
language
English
LU publication?
no
id
c8565fba-857b-4dbc-9b3d-a0f9678eb0ac
date added to LUP
2023-11-06 19:51:29
date last changed
2023-11-28 14:04:58
@article{c8565fba-857b-4dbc-9b3d-a0f9678eb0ac,
  abstract     = {{Following the quantitative turn in linguistics, the field appears to be in a<br/>methodological “wild west” state where much is possible and new frontiers are<br/>being explored, but there is relatively little guidance in terms of firm rules or<br/>conventions. In this article, we focus on the issue of variable selection in regression modeling. It is common to aim for a “minimal adequate model” and eliminate “non-significant” variables by statistical procedures. We advocate an alternative, “deductive modeling” approach that retains a “full” model of variables generated from our research questions and objectives. Comparing the statistical model to a camera, i.e., a tool to produce an image of reality, we contrast the deductive and predictive (minimal) modeling approaches on a dataset from a corpus study. While a minimal adequate model is more parsimonious, its selection procedure is blind to the research aim and may conceal relevant information. Deductive models, by contrast, are grounded in theory, have higher transparency (all relevant variables are reported) and potentially a greater accuracy of the reported effects. They are useful for answering research questions more directly, as they rely explicitly on prior knowledge and hypotheses, and allow for estimation and comparison across datasets.}},
  author       = {{Tizon-Couto, David and Lorenz, David}},
  issn         = {{1613-396X}},
  keywords     = {{effect estimation; statistical modeling; theory and data; variable selection}},
  language     = {{eng}},
  month        = {{09}},
  number       = {{5}},
  pages        = {{1279--1309}},
  publisher    = {{De Gruyter}},
  series       = {{Linguistics}},
  title        = {{Variables are valuable : making a case for deductive modeling}},
  url          = {{http://dx.doi.org/10.1515/ling-2019-0050}},
  doi          = {{10.1515/ling-2019-0050}},
  volume       = {{59}},
  year         = {{2021}},
}