Sociolinguistic Features for Author Gender Identification : From Qualitative Evidence to Quantitative Analysis
(2017) In Journal of Quantitative Linguistics 24(1). p.65-84- Abstract
Theoretical and empirical studies prove the strong relationship between social factors and the individual linguistic attitudes. Different social categories, such as gender, age, education, profession and social status, are strongly related with the linguistic diversity of people’s everyday spoken and written interaction. In this paper, sociolinguistic studies addressed to gender differentiation are overviewed in order to identify how various linguistic characteristics differ between women and men. Thereafter, it is examined if and how these qualitative features can become quantitative metrics for the task of gender identification from texts on web blogs. The evaluation results showed that the “syntactic complexity”, the “tag questions”,... (More)
Theoretical and empirical studies prove the strong relationship between social factors and the individual linguistic attitudes. Different social categories, such as gender, age, education, profession and social status, are strongly related with the linguistic diversity of people’s everyday spoken and written interaction. In this paper, sociolinguistic studies addressed to gender differentiation are overviewed in order to identify how various linguistic characteristics differ between women and men. Thereafter, it is examined if and how these qualitative features can become quantitative metrics for the task of gender identification from texts on web blogs. The evaluation results showed that the “syntactic complexity”, the “tag questions”, the “period length”, the “adjectives” and the “vocabulary richness” characteristics seem to be significantly distinctive with respect to the author’s gender.
(Less)
- author
- Simaki, Vasiliki LU ; Aravantinou, Christina ; Mporas, Iosif ; Kondyli, Marianna and Megalooikonomou, Vasileios
- organization
- publishing date
- 2017-01
- type
- Contribution to journal
- publication status
- published
- subject
- in
- Journal of Quantitative Linguistics
- volume
- 24
- issue
- 1
- pages
- 65 - 84
- publisher
- Taylor & Francis
- external identifiers
-
- wos:000396571200004
- scopus:84990196911
- ISSN
- 0929-6174
- DOI
- 10.1080/09296174.2016.1226430
- language
- English
- LU publication?
- yes
- id
- f7a17a93-a046-4093-beea-8b8c71e395cb
- date added to LUP
- 2016-10-21 07:07:19
- date last changed
- 2025-01-12 13:35:42
@article{f7a17a93-a046-4093-beea-8b8c71e395cb, abstract = {{<p>Theoretical and empirical studies prove the strong relationship between social factors and the individual linguistic attitudes. Different social categories, such as gender, age, education, profession and social status, are strongly related with the linguistic diversity of people’s everyday spoken and written interaction. In this paper, sociolinguistic studies addressed to gender differentiation are overviewed in order to identify how various linguistic characteristics differ between women and men. Thereafter, it is examined if and how these qualitative features can become quantitative metrics for the task of gender identification from texts on web blogs. The evaluation results showed that the “syntactic complexity”, the “tag questions”, the “period length”, the “adjectives” and the “vocabulary richness” characteristics seem to be significantly distinctive with respect to the author’s gender.</p>}}, author = {{Simaki, Vasiliki and Aravantinou, Christina and Mporas, Iosif and Kondyli, Marianna and Megalooikonomou, Vasileios}}, issn = {{0929-6174}}, language = {{eng}}, number = {{1}}, pages = {{65--84}}, publisher = {{Taylor & Francis}}, series = {{Journal of Quantitative Linguistics}}, title = {{Sociolinguistic Features for Author Gender Identification : From Qualitative Evidence to Quantitative Analysis}}, url = {{http://dx.doi.org/10.1080/09296174.2016.1226430}}, doi = {{10.1080/09296174.2016.1226430}}, volume = {{24}}, year = {{2017}}, }