Advanced

Sociolinguistic Features for Author Gender Identification : From Qualitative Evidence to Quantitative Analysis

Simaki, Vasiliki LU ; Aravantinou, Christina; Mporas, Iosif; Kondyli, Marianna and Megalooikonomou, Vasileios (2017) In Journal of Quantitative Linguistics 24(1). p.65-84
Abstract

Theoretical and empirical studies prove the strong relationship between social factors and the individual linguistic attitudes. Different social categories, such as gender, age, education, profession and social status, are strongly related with the linguistic diversity of people’s everyday spoken and written interaction. In this paper, sociolinguistic studies addressed to gender differentiation are overviewed in order to identify how various linguistic characteristics differ between women and men. Thereafter, it is examined if and how these qualitative features can become quantitative metrics for the task of gender identification from texts on web blogs. The evaluation results showed that the “syntactic complexity”, the “tag questions”,... (More)

Theoretical and empirical studies prove the strong relationship between social factors and the individual linguistic attitudes. Different social categories, such as gender, age, education, profession and social status, are strongly related with the linguistic diversity of people’s everyday spoken and written interaction. In this paper, sociolinguistic studies addressed to gender differentiation are overviewed in order to identify how various linguistic characteristics differ between women and men. Thereafter, it is examined if and how these qualitative features can become quantitative metrics for the task of gender identification from texts on web blogs. The evaluation results showed that the “syntactic complexity”, the “tag questions”, the “period length”, the “adjectives” and the “vocabulary richness” characteristics seem to be significantly distinctive with respect to the author’s gender.

(Less)
Please use this url to cite or link to this publication:
author
organization
publishing date
type
Contribution to journal
publication status
published
subject
in
Journal of Quantitative Linguistics
volume
24
issue
1
pages
65 - 84
publisher
Taylor & Francis
external identifiers
  • scopus:84990196911
  • wos:000396571200004
ISSN
0929-6174
DOI
10.1080/09296174.2016.1226430
language
English
LU publication?
yes
id
f7a17a93-a046-4093-beea-8b8c71e395cb
date added to LUP
2016-10-21 07:07:19
date last changed
2018-04-29 04:32:22
@article{f7a17a93-a046-4093-beea-8b8c71e395cb,
  abstract     = {<p>Theoretical and empirical studies prove the strong relationship between social factors and the individual linguistic attitudes. Different social categories, such as gender, age, education, profession and social status, are strongly related with the linguistic diversity of people’s everyday spoken and written interaction. In this paper, sociolinguistic studies addressed to gender differentiation are overviewed in order to identify how various linguistic characteristics differ between women and men. Thereafter, it is examined if and how these qualitative features can become quantitative metrics for the task of gender identification from texts on web blogs. The evaluation results showed that the “syntactic complexity”, the “tag questions”, the “period length”, the “adjectives” and the “vocabulary richness” characteristics seem to be significantly distinctive with respect to the author’s gender.</p>},
  author       = {Simaki, Vasiliki and Aravantinou, Christina and Mporas, Iosif and Kondyli, Marianna and Megalooikonomou, Vasileios},
  issn         = {0929-6174},
  language     = {eng},
  number       = {1},
  pages        = {65--84},
  publisher    = {Taylor & Francis},
  series       = {Journal of Quantitative Linguistics},
  title        = {Sociolinguistic Features for Author Gender Identification : From Qualitative Evidence to Quantitative Analysis},
  url          = {http://dx.doi.org/10.1080/09296174.2016.1226430},
  volume       = {24},
  year         = {2017},
}