Variables are valuable : making a case for deductive modeling

Tizon-Couto, David; Lorenz, David

Variables are valuable : making a case for deductive modeling

Mark

Tizon-Couto, David and Lorenz, David ^LU

(2021) In Linguistics 59(5). p.1279-1309

Abstract: Following the quantitative turn in linguistics, the field appears to be in a
methodological “wild west” state where much is possible and new frontiers are
being explored, but there is relatively little guidance in terms of firm rules or
conventions. In this article, we focus on the issue of variable selection in regression modeling. It is common to aim for a “minimal adequate model” and eliminate “non-significant” variables by statistical procedures. We advocate an alternative, “deductive modeling” approach that retains a “full” model of variables generated from our research questions and objectives. Comparing the statistical model to a camera, i.e., a tool to produce an image of reality, we contrast the deductive and... (More); Following the quantitative turn in linguistics, the field appears to be in a
methodological “wild west” state where much is possible and new frontiers are
being explored, but there is relatively little guidance in terms of firm rules or
conventions. In this article, we focus on the issue of variable selection in regression modeling. It is common to aim for a “minimal adequate model” and eliminate “non-significant” variables by statistical procedures. We advocate an alternative, “deductive modeling” approach that retains a “full” model of variables generated from our research questions and objectives. Comparing the statistical model to a camera, i.e., a tool to produce an image of reality, we contrast the deductive and predictive (minimal) modeling approaches on a dataset from a corpus study. While a minimal adequate model is more parsimonious, its selection procedure is blind to the research aim and may conceal relevant information. Deductive models, by contrast, are grounded in theory, have higher transparency (all relevant variables are reported) and potentially a greater accuracy of the reported effects. They are useful for answering research questions more directly, as they rely explicitly on prior knowledge and hypotheses, and allow for estimation and comparison across datasets. (Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/c8565fba-857b-4dbc-9b3d-a0f9678eb0ac

author

Tizon-Couto, David and Lorenz, David ^LU

publishing date

2021-09-02

type

Contribution to journal

publication status

published

subject

keywords

effect estimation, statistical modeling, theory and data, variable selection

in

Linguistics

volume

59

issue

5

pages

31 pages

publisher

De Gruyter

external identifiers

scopus:85115664806

ISSN

1613-396X

DOI

10.1515/ling-2019-0050

language

English

LU publication?

no

id

c8565fba-857b-4dbc-9b3d-a0f9678eb0ac

date added to LUP

2023-11-06 19:51:29

date last changed

2025-04-04 13:52:57

@article{c8565fba-857b-4dbc-9b3d-a0f9678eb0ac,
  abstract     = {{Following the quantitative turn in linguistics, the field appears to be in a<br/>methodological “wild west” state where much is possible and new frontiers are<br/>being explored, but there is relatively little guidance in terms of firm rules or<br/>conventions. In this article, we focus on the issue of variable selection in regression modeling. It is common to aim for a “minimal adequate model” and eliminate “non-significant” variables by statistical procedures. We advocate an alternative, “deductive modeling” approach that retains a “full” model of variables generated from our research questions and objectives. Comparing the statistical model to a camera, i.e., a tool to produce an image of reality, we contrast the deductive and predictive (minimal) modeling approaches on a dataset from a corpus study. While a minimal adequate model is more parsimonious, its selection procedure is blind to the research aim and may conceal relevant information. Deductive models, by contrast, are grounded in theory, have higher transparency (all relevant variables are reported) and potentially a greater accuracy of the reported effects. They are useful for answering research questions more directly, as they rely explicitly on prior knowledge and hypotheses, and allow for estimation and comparison across datasets.}},
  author       = {{Tizon-Couto, David and Lorenz, David}},
  issn         = {{1613-396X}},
  keywords     = {{effect estimation; statistical modeling; theory and data; variable selection}},
  language     = {{eng}},
  month        = {{09}},
  number       = {{5}},
  pages        = {{1279--1309}},
  publisher    = {{De Gruyter}},
  series       = {{Linguistics}},
  title        = {{Variables are valuable : making a case for deductive modeling}},
  url          = {{http://dx.doi.org/10.1515/ling-2019-0050}},
  doi          = {{10.1515/ling-2019-0050}},
  volume       = {{59}},
  year         = {{2021}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Variables are valuable : making a case for deductive modeling