lnu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Sociolinguistic Features for Author Gender Identification: From Qualitative Evidence to Quantitative Analysis
Linnaeus University, Faculty of Technology, Department of computer science and media technology (CM), Department of Computer Science. Lund University.ORCID iD: 0000-0002-8998-3618
University of Patras, Greece.
University of Hertfordshire, UK.
University of Patras, Greece.
Show others and affiliations
2017 (English)In: Journal of Quantitative Linguistics, ISSN 0929-6174, E-ISSN 1744-5035, Vol. 24, no 1, p. 65-84Article in journal (Refereed) Published
Abstract [en]

Theoretical and empirical studies prove the strong relationship between social factors and the individual linguistic attitudes. Different social categories, such as gender, age, education, profession and social status, are strongly related with the linguistic diversity of people's everyday spoken and written interaction. In this paper, sociolinguistic studies addressed to gender differentiation are overviewed in order to identify how various linguistic characteristics differ between women and men. Thereafter, it is examined if and how these qualitative features can become quantitative metrics for the task of gender identification from texts on web blogs. The evaluation results showed that the "syntactic complexity", the "tag questions", the "period length", the "adjectives" and the "vocabulary richness" characteristics seem to be significantly distinctive with respect to the author's gender.

Place, publisher, year, edition, pages
Routledge, 2017. Vol. 24, no 1, p. 65-84
National Category
Language Technology (Computational Linguistics) Computer Sciences
Research subject
Computer and Information Sciences Computer Science, Computer Science
Identifiers
URN: urn:nbn:se:lnu:diva-62067DOI: 10.1080/09296174.2016.1226430ISI: 000396571200004OAI: oai:DiVA.org:lnu-62067DiVA, id: diva2:1086632
Available from: 2017-04-03 Created: 2017-04-03 Last updated: 2018-06-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records BETA

Simaki, Vasiliki

Search in DiVA

By author/editor
Simaki, Vasiliki
By organisation
Department of Computer Science
In the same journal
Journal of Quantitative Linguistics
Language Technology (Computational Linguistics)Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 0 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf