lnu.sePublications
Change search
Link to record
Permanent link

Direct link
Publications (10 of 63) Show all publications
Fatemi, M., Sieranoja, S., Laitinen, M. & Fränti, P. (2025). Detecting Connectivity Patterns in Nordic Twittersphere by Cluster Analysis. SN Computer Science, 6(7), Article ID 815.
Open this publication in new window or tab >>Detecting Connectivity Patterns in Nordic Twittersphere by Cluster Analysis
2025 (English)In: SN Computer Science, ISSN 2662-995X, Vol. 6, no 7, article id 815Article in journal (Refereed) Published
Abstract [en]

We analyze Nordic social media users by clustering them based on their connections on Twitter. The data consists of 15,794 users in the five Nordic countries: Finland, Sweden, Norway, Denmark, and Iceland. We first create an undirected graph from the friendship relations (mutually following each other), then divide the graph into five clusters using a recent M-algorithm, and finally compare the results to users’ locations. The results demonstrate that the users are strongly clustered according to their home country. There is surprisingly little interaction across the countries despite the fact that they are, except for Iceland, physically close to each other and have cultural and linguistic similarities. The main language of the four countries belongs to the Germanic languages, while Finnish is typologically distinct. We further explore content from users in each country, analyzing its alignment with connectivity patterns. Our findings reveal a discrepancy between user-generated content similarity in the Nordic region and their connectivity patterns.

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Clustering, Community detection, Graph clustering, Nordic countries, Social networks, Twitter users
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:lnu:diva-142850 (URN)10.1007/s42979-025-04353-y (DOI)2-s2.0-105015490229 (Scopus ID)
Note

Correction published in: Fatemi, M., Sieranoja, S., Laitinen, M., & Fränti, P. (2025). Correction: Detecting Connectivity Patterns in Nordic Twittersphere by Cluster Analysis. SN Computer Science, 6(7), Article 882. https://doi.org/10.1007/s42979-025-04443-x

Available from: 2025-12-29 Created: 2025-12-29 Last updated: 2026-01-12Bibliographically approved
Laitinen, M., Rautionaho, P., Fatemi, M. & Halonen, M. (2025). Do we swear more with friends or with acquaintances? F#ck in social networks. Lingua, 320, Article ID 103931.
Open this publication in new window or tab >>Do we swear more with friends or with acquaintances? F#ck in social networks
2025 (English)In: Lingua, ISSN 0024-3841, E-ISSN 1872-6135, Vol. 320, article id 103931Article in journal (Refereed) Published
Abstract [en]

We investigate the uses of fuck in digital social networks from social media, Twitter/X in this case. Social media outlets have so far been predominantly treated as massive text collections, but they can be effectively used to investigate the role of social networks in shaping human communication. We use user-generated texts from 5,660 social networks (with 435,345 users and 7.8 billion words) from three settings (UK, US, and Australia). With embedded network information, this massive dataset enables us to investigate how network properties, that of the size and the strength of the network, influence the use of offensive words in these three settings. Our findings show that Americans use fuck most frequently, while Australians least frequently but they are highly creative with spelling variants of the word. Contrary to prior studies, we observe that people on this social media application swear more with acquaintances than with friends, but only in smaller networks − in larger networks of >100 people, the differences level out. Overall, this study highlights the benefits of using social media data that can be enriched to allow access to the social networks that people interact in.

Place, publisher, year, edition, pages
Elsevier, 2025
Keywords
Swearing in interaction, Social networks, Social media, Fuck, Sociolinguistics
National Category
Studies of Specific Languages
Research subject
Humanities, English
Identifiers
urn:nbn:se:lnu:diva-137666 (URN)10.1016/j.lingua.2025.103931 (DOI)001459245100001 ()2-s2.0-105001098112 (Scopus ID)
Funder
European CommissionAcademy of Finland, 345640Academy of Finland, 358725Academy of Finland, 364048Academy of Finland, 367757Academy of Finland, FIRI 2022\u201329
Available from: 2025-03-31 Created: 2025-03-31 Last updated: 2025-04-15Bibliographically approved
Laitinen, M. (2025). Finland, English in. In: Kingsley Bolton (Ed.), The Wiley Blackwell Encyclopedia of World Englishes: . Oxford: Blackwell Publishing
Open this publication in new window or tab >>Finland, English in
2025 (English)In: The Wiley Blackwell Encyclopedia of World Englishes / [ed] Kingsley Bolton, Oxford: Blackwell Publishing, 2025Chapter in book (Refereed)
Place, publisher, year, edition, pages
Oxford: Blackwell Publishing, 2025
National Category
Studies of Specific Languages
Research subject
Humanities, English
Identifiers
urn:nbn:se:lnu:diva-139905 (URN)10.1002/9781119518297.eowe00346 (DOI)9781119518310 (ISBN)9781119518297 (ISBN)
Available from: 2025-06-19 Created: 2025-06-19 Last updated: 2025-08-01Bibliographically approved
Lakaw, A. & Laitinen, M. (2025). Liverpool were superior tonight: Variation and change of verbal agreement patterns in L2 idiolects. In: 8th Conference of the International Society for the Linguistics of English: Book of abstracts. Paper presented at 8th Conference of the International Society for the Linguistics of English (ISLE-8), Santiago de Compostela, Spain, 1 - 4 September, 2025 (pp. 36-36).
Open this publication in new window or tab >>Liverpool were superior tonight: Variation and change of verbal agreement patterns in L2 idiolects
2025 (English)In: 8th Conference of the International Society for the Linguistics of English: Book of abstracts, 2025, p. 36-36Conference paper, Oral presentation with published abstract (Refereed)
Abstract [en]

Plural verbal agreement with sports teams (“Liverpool have beaten Manchester United 4-0 tonight”) is a distinct feature of British English. Prior corpus-based studies show that plural verbal agreement tends to be more frequent with collective nouns in sport texts than in other news (Hundt 1998; Levin 2001). However, beyond these genre-level differences, there are significant gaps in our understanding, particularly regarding sociolinguistic and individual-level variation in this grammatical domain.

This presentation investigates the verbal agreement patterns of individuals who use English as a second language (L2) in public settings. Our study focuses on foreign-born football managers working in England, specifically in the English Premier League, which has attracted managers from various European countries since the late 1990s. These managers, who are public figures and native speakers of languages other than English, conduct pre- and post-game press conferences in English as part of their managerial duties. The individual informants were chosen because the agreement patterns in their native languages favor singular verbal agreement.

We are interested in whether our these L2 speakers of English use singular or plural verbal agreement when talking about their own teams and their opponents. Additionally, we aim to determine if these agreement forms remain stable or change during their careers in England.

To explore this phenomenon, we utilize an underused source for accessing individual variation: a large set of interviews and press conferences available online. We use automatically-generated transcripts from YouTube recordings, with data obtained through national digital humanities infrastructure projects in Finland and in Sweden. This dataset includes recordings of managers’ interviews and press conferences spanning roughly a decade from the mid-2010s onwards.

The topic is of interest to various audiences, not limited to those studying agreement patterns. Our unique data collection also appeals to people interested in digital humanities. The corpus-based observations not only shed light on verbal agreement patterns in proper noun + verb structures but also contribute to debates on how idiolects change in the short-term post-adolescence (Sankoff 2018). The results could provide insights into how non-native adult speakers handle stable lexico-grammatical changes in society. Prior studies have identified three trajectories of individual change: stability, adoption, and swimming against the community current.

National Category
Languages and Literature
Research subject
Humanities, English
Identifiers
urn:nbn:se:lnu:diva-143935 (URN)
Conference
8th Conference of the International Society for the Linguistics of English (ISLE-8), Santiago de Compostela, Spain, 1 - 4 September, 2025
Available from: 2026-01-12 Created: 2026-01-12 Last updated: 2026-01-20Bibliographically approved
Laitinen, M. & Rautionaho, P. (2025). Reuse of social media data in corpus linguistics. International Journal of Corpus Linguistics, 30(2), 171-194
Open this publication in new window or tab >>Reuse of social media data in corpus linguistics
2025 (English)In: International Journal of Corpus Linguistics, ISSN 1384-6655, E-ISSN 1569-9811, Vol. 30, no 2, p. 171-194Article in journal (Refereed) Published
Abstract [en]

The use of very large social media datasets in corpus linguistics has obviousbenefits. Such data represent a novel source of evidence when compared with structured digital text corpora. However, there is a clear need to assess critically how the effective reuse of data can be handled, how findings can be reproduced, and how results can be generalized. A relevant question concerns the presentation of data to ensure reproducibility and replicability. This article surveys the state-of-the-art of descriptions of data collection and methodological transparency in 30 studies that used Twitter/X as their data. The empirical section investigates how easy it would be to reproduce a study based on these descriptions. While we concentrate on evidence from one social media application, the discussion continues to a presentation of concrete steps that might be used to improve data management related to the reuse, discovery, and evaluation of social media data in general.

Place, publisher, year, edition, pages
John Benjamins Publishing Company, 2025
Keywords
social media data, replicability, reproducibility, metadata, research infrastructures
National Category
Studies of Specific Languages
Research subject
Humanities, English
Identifiers
urn:nbn:se:lnu:diva-139904 (URN)10.1075/ijcl.24136.lai (DOI)001507856200001 ()2-s2.0-105009278659 (Scopus ID)
Available from: 2025-06-19 Created: 2025-06-19 Last updated: 2025-10-13Bibliographically approved
Laitinen, M. & Fatemi, M. (2024). Testing the weak-tie hypothesis with social media. In: Céline Poudat & Mathilde Guernut (Ed.), Proceedings of the 11th Conference on CMC and Social Media Corpora for the Humanities. 11th Conference on CMC and Social Media Corpora for the Humanities (CMC 2024), CORLI; Université Côte d’Azur, France, 2024: . Paper presented at 11th Conference on CMC and Social Media Corpora for the Humanities (CMC 2024) (pp. 46-51). Nice
Open this publication in new window or tab >>Testing the weak-tie hypothesis with social media
2024 (English)In: Proceedings of the 11th Conference on CMC and Social Media Corpora for the Humanities. 11th Conference on CMC and Social Media Corpora for the Humanities (CMC 2024), CORLI; Université Côte d’Azur, France, 2024 / [ed] Céline Poudat & Mathilde Guernut, Nice, 2024, p. 46-51Conference paper, Published paper (Refereed)
Abstract [en]

This article combines the study of large-scale social media data with social network theory in sociolinguistics. Given that the purpose of social media is to form networks and communities, big data from social media applications could have substantial potential in deepening the understanding of the role of networks in language variation and change. The study first presents an algorithmic method suitable for directed-graph ego networks in computer-mediated communication. This method measures network strength and enables us to enrich social media data with a network parameter that indexes how strongly (or loosely) people in a network are connected to each other. We then use a large dataset of c. 4.8 billion words from nearly four thousand networks to study how network strength conditions linguistic change. The results show that online networks are highly similar to traditional offline networks, a finding that enables fixing a major methodological limitation in the study of weak ties, namely that the method is less suited for studying socially and geographically mobile individuals. This finding makes it possible to apply the theory of social networks in sociolinguistics to very large digital networks in social media.

Place, publisher, year, edition, pages
Nice: , 2024
Series
Proceedings of the 11th International Conference on CMC and Social Media Corpora for the Humanities
Keywords
social networks, data-intensive methods, social media
National Category
Studies of Specific Languages
Research subject
Humanities, English
Identifiers
urn:nbn:se:lnu:diva-137669 (URN)
Conference
11th Conference on CMC and Social Media Corpora for the Humanities (CMC 2024)
Funder
Academy of Finland
Available from: 2025-03-31 Created: 2025-03-31 Last updated: 2025-04-08Bibliographically approved
Laitinen, M. (2023). A history of personal pronouns in Standard English (1ed.). In: Laura Patterson (Ed.), The Routledge Handbook of Personal Pronouns: (pp. 29-43). London: Routledge
Open this publication in new window or tab >>A history of personal pronouns in Standard English
2023 (English)In: The Routledge Handbook of Personal Pronouns / [ed] Laura Patterson, London: Routledge, 2023, 1, p. 29-43Chapter in book (Refereed)
Place, publisher, year, edition, pages
London: Routledge, 2023 Edition: 1
Series
Routledge Handbooks in Linguistics
National Category
Studies of Specific Languages
Research subject
Humanities, English
Identifiers
urn:nbn:se:lnu:diva-137667 (URN)10.4324/9781003349891-4 (DOI)2-s2.0-85176896087 (Scopus ID)9781032394749 (ISBN)9781003349891 (ISBN)
Available from: 2025-03-31 Created: 2025-03-31 Last updated: 2025-05-14Bibliographically approved
Laitinen, M. & Fatemi, M. (2023). Data-intensive sociolinguistics using social media. Annales Academiae Scientiarum Fennicae, 2023(2), 38-61
Open this publication in new window or tab >>Data-intensive sociolinguistics using social media
2023 (English)In: Annales Academiae Scientiarum Fennicae, ISSN 2953-9048, Vol. 2023, no 2, p. 38-61Article in journal (Refereed) Published
Abstract [en]

This article looks into using large-scale social media data in SSH research and in particular in studies of language variation and change. It presents a study that investigates the role of social networks in linguistic variability. Previous studies have convincingly shown that networks in which people are connected to each other in loose ties tend to contribute positively to language change. Conversely, networks in which people are closely connected to each other inhibit change. This conclusion is, however, based on small datasets from small networks, and this study tests if the difference is diluted when network size is closer to human average. The results from 3,935 networks suggest this to be the case. Towards the end, the article suggests numerous ways in which large-scale social media data and the use of data intensive methodologies could be increased and encouraged in SSH research.

Place, publisher, year, edition, pages
Suomalainen tiedeakatemia, 2023
Keywords
sociolinguistics, networks, big data, social media
National Category
Studies of Specific Languages
Identifiers
urn:nbn:se:lnu:diva-137668 (URN)10.57048/aasf.136177 (DOI)
Available from: 2025-03-31 Created: 2025-03-31 Last updated: 2025-04-15Bibliographically approved
Laitinen, M. & Fatemi, M. (2022). Big and rich social networks in computational sociolinguistics. In: Paula Rautionaho, Hanna Parviainen, Mark Kaunisto & Arja Nurmi (Ed.), Social and Regional Variation in World Englishes: Local and Global Perspectives (pp. 166-189). Routledge
Open this publication in new window or tab >>Big and rich social networks in computational sociolinguistics
2022 (English)In: Social and Regional Variation in World Englishes: Local and Global Perspectives / [ed] Paula Rautionaho, Hanna Parviainen, Mark Kaunisto & Arja Nurmi, Routledge, 2022, p. 166-189Chapter in book (Refereed)
Abstract [en]

Social media data have substantially enlarged the potential pools of evidence in the study of variation and change in English. They offer access to language use of large numbers of informants, but the downside is that they contain inadequate social background information. This seriously restricts the theoretical insight, limiting investigations to the study of actuation of change and leaving out those that focus on diffusion. It is widely held that while actuation is largely functional, diffusion requires information on the broader social structures within which speakers belong. We present an algorithmic method for adding social information to Twitter. Our method builds on interaction parameters (size and structure of networks, similarity, and communication frequency). Using such participant-centred information as a proxy for social information increases the empirical validity substantially. In addition, the article presents a case study of how networks of varying strength condition ongoing change. The data are drawn from five metropolitan centres in the UK and from English as a lingua franca setting in the Nordic region. The results suggest that network size plays an important, yet understudied, role in the social network theory in sociolinguistics.

Place, publisher, year, edition, pages
Routledge, 2022
Series
Paula Rautionaho, Hanna Parviainen, Mark Kaunisto & Arja Nurmi
National Category
General Language Studies and Linguistics
Research subject
Humanities, English
Identifiers
urn:nbn:se:lnu:diva-120019 (URN)10.4324/9781003227342-9 (DOI)2-s2.0-85138257958 (Scopus ID)9781032130361 (ISBN)9781032130392 (ISBN)9781003227342 (ISBN)
Available from: 2023-03-30 Created: 2023-03-30 Last updated: 2025-06-12Bibliographically approved
Taipale, I. & Laitinen, M. (2022). Individual Sensitivity to Change in the Lingua Franca Use of English. Frontiers in Communication, 6, 1-15, Article ID 737017.
Open this publication in new window or tab >>Individual Sensitivity to Change in the Lingua Franca Use of English
2022 (English)In: Frontiers in Communication, E-ISSN 2297-900X, Vol. 6, p. 1-15, article id 737017Article in journal (Refereed) Published
Abstract [en]

The study of ongoing change in English typically focuses on studying evidence from codified varieties of the language. Recent corpus studies show, however, that advanced non-native users of English may display heightened sensitivity to features undergoing frequency shifts similar to that experienced by native speakers. In addition, most studies aiming to detect patterns of linguistic regularity utilize large data sets that attempt to minimize the presence of the individual. In this study, we focus on change in ELF and place non-native individuals at the center of attention. Our empirical section examines how aggregated features that are currently undergoing change in codified varieties of English vary in the repertoires of ELF users of Twitter. To carry out this task, this study utilizes geo-tagged tweets retrieved from the Nordic Tweet Stream. The data obtained from this real-time monitor corpus are freely available for research and re-use at . For the analysis itself, we selected the idiolects of 150 individual users who actively tweet in English from geographically varying locations in Finland. As American English predominates with several patterns of linguistic change in codified varieties of English, a simplified dichotomy between American and British features is utilized as a conceptual tool for inspecting variation. The idiolects are analyzed from the perspective of spelling and lexico-grammatical and morphological variation, such as V + -ing |V + infinitive (e.g. start doing | start to do) and expanded predicates (e.g. take a look | have a look). The quantitative observations show that, particularly in the case of grammatical features, ELF speakers appear to have generally adhered to ongoing linguistic change.

Place, publisher, year, edition, pages
Frontiers Media S.A., 2022
Keywords
individual variation, idiolect, ongoing change, English as a lingua franca, second language, sensitivity, americanization
National Category
General Language Studies and Linguistics
Research subject
Humanities, English
Identifiers
urn:nbn:se:lnu:diva-110692 (URN)10.3389/fcomm.2021.737017 (DOI)000753642600001 ()2-s2.0-85124574132 (Scopus ID)2022 (Local ID)2022 (Archive number)2022 (OAI)
Available from: 2022-03-03 Created: 2022-03-03 Last updated: 2025-08-21Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-3123-6932

Search in DiVA

Show all publications