Variables are valuable : making a case for deductive modeling
(2021) In Linguistics 59(5). p.1279-1309- Abstract
- Following the quantitative turn in linguistics, the field appears to be in a
methodological “wild west” state where much is possible and new frontiers are
being explored, but there is relatively little guidance in terms of firm rules or
conventions. In this article, we focus on the issue of variable selection in regression modeling. It is common to aim for a “minimal adequate model” and eliminate “non-significant” variables by statistical procedures. We advocate an alternative, “deductive modeling” approach that retains a “full” model of variables generated from our research questions and objectives. Comparing the statistical model to a camera, i.e., a tool to produce an image of reality, we contrast the deductive and... (More) - Following the quantitative turn in linguistics, the field appears to be in a
methodological “wild west” state where much is possible and new frontiers are
being explored, but there is relatively little guidance in terms of firm rules or
conventions. In this article, we focus on the issue of variable selection in regression modeling. It is common to aim for a “minimal adequate model” and eliminate “non-significant” variables by statistical procedures. We advocate an alternative, “deductive modeling” approach that retains a “full” model of variables generated from our research questions and objectives. Comparing the statistical model to a camera, i.e., a tool to produce an image of reality, we contrast the deductive and predictive (minimal) modeling approaches on a dataset from a corpus study. While a minimal adequate model is more parsimonious, its selection procedure is blind to the research aim and may conceal relevant information. Deductive models, by contrast, are grounded in theory, have higher transparency (all relevant variables are reported) and potentially a greater accuracy of the reported effects. They are useful for answering research questions more directly, as they rely explicitly on prior knowledge and hypotheses, and allow for estimation and comparison across datasets. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/c8565fba-857b-4dbc-9b3d-a0f9678eb0ac
- author
- Tizon-Couto, David and Lorenz, David LU
- publishing date
- 2021-09-02
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- effect estimation, statistical modeling, theory and data, variable selection
- in
- Linguistics
- volume
- 59
- issue
- 5
- pages
- 31 pages
- publisher
- De Gruyter
- external identifiers
-
- scopus:85115664806
- ISSN
- 1613-396X
- DOI
- 10.1515/ling-2019-0050
- language
- English
- LU publication?
- no
- id
- c8565fba-857b-4dbc-9b3d-a0f9678eb0ac
- date added to LUP
- 2023-11-06 19:51:29
- date last changed
- 2023-11-28 14:04:58
@article{c8565fba-857b-4dbc-9b3d-a0f9678eb0ac, abstract = {{Following the quantitative turn in linguistics, the field appears to be in a<br/>methodological “wild west” state where much is possible and new frontiers are<br/>being explored, but there is relatively little guidance in terms of firm rules or<br/>conventions. In this article, we focus on the issue of variable selection in regression modeling. It is common to aim for a “minimal adequate model” and eliminate “non-significant” variables by statistical procedures. We advocate an alternative, “deductive modeling” approach that retains a “full” model of variables generated from our research questions and objectives. Comparing the statistical model to a camera, i.e., a tool to produce an image of reality, we contrast the deductive and predictive (minimal) modeling approaches on a dataset from a corpus study. While a minimal adequate model is more parsimonious, its selection procedure is blind to the research aim and may conceal relevant information. Deductive models, by contrast, are grounded in theory, have higher transparency (all relevant variables are reported) and potentially a greater accuracy of the reported effects. They are useful for answering research questions more directly, as they rely explicitly on prior knowledge and hypotheses, and allow for estimation and comparison across datasets.}}, author = {{Tizon-Couto, David and Lorenz, David}}, issn = {{1613-396X}}, keywords = {{effect estimation; statistical modeling; theory and data; variable selection}}, language = {{eng}}, month = {{09}}, number = {{5}}, pages = {{1279--1309}}, publisher = {{De Gruyter}}, series = {{Linguistics}}, title = {{Variables are valuable : making a case for deductive modeling}}, url = {{http://dx.doi.org/10.1515/ling-2019-0050}}, doi = {{10.1515/ling-2019-0050}}, volume = {{59}}, year = {{2021}}, }