lnu.sePublications
Change search
Link to record
Permanent link

Direct link
BETA
Publications (10 of 49) Show all publications
Hiltunen, T. & Tyrkkö, J. (2019). Academic Vocabulary in Wikipedia Articles: Frequency and Dispersion in Uneven Datasets. In: Carl Suhr, Terttu Nevalainen, Irma Taavitsainen (Ed.), From Data to Evidence in English Language Research: (pp. 282-306). Leiden: Brill Academic Publishers
Open this publication in new window or tab >>Academic Vocabulary in Wikipedia Articles: Frequency and Dispersion in Uneven Datasets
2019 (English)In: From Data to Evidence in English Language Research / [ed] Carl Suhr, Terttu Nevalainen, Irma Taavitsainen, Leiden: Brill Academic Publishers, 2019, p. 282-306Chapter in book (Refereed)
Abstract [en]

Despite its popularity, the status of Wikipedia in higher education settings remains somewhat controversial, and the linguistic characteristics of the genre have not been exhaustively described. This exploratory paper takes a data-driven approach to assessing the use of academic vocabulary in Wikipedia articles. Our analysis is based on Coxhead’s Academic Word List, and the data comes from the Westbury Lab Wikipedia Corpus. We employ methods of statistical data analysis to classify Wikipedia articles according to the frequencies of academic words, and apply the same procedure to a comparable set of texts representing another genre, published research articles. The unsupervised classification procedure groups the articles according to academic content regardless of topic, which allows us to measure genre-specific similarities. The findings of the study show that academic words are common in both genres in focus, and more interestingly, if we look at aggregate frequencies of academic words, Wikipedia articles are not markedly different from RAs within the same discipline. This being said, we can observe disciplinary differences in the distribution of academic words in Wikipedia, such that Economics writing contains more academic words than the other two disciplines in focus. Disciplinary differences can likewise be observed in the distribution of individual academic words.

Place, publisher, year, edition, pages
Leiden: Brill Academic Publishers, 2019
Series
Language and Computers, ISSN 0921-5034 ; 83
Keywords
wikipedia, corpus linguistics, dispersion, statistics
National Category
Specific Languages
Research subject
Humanities, English
Identifiers
urn:nbn:se:lnu:diva-79630 (URN)10.1163/9789004390652_013 (DOI)978-90-04-39065-2 (ISBN)978-90-04-39064-5 (ISBN)
Available from: 2019-01-18 Created: 2019-01-18 Last updated: 2019-02-11Bibliographically approved
Reski, N., Alissandrakis, A. & Tyrkkö, J. (2019). Collaborative exploration of rich corpus data using immersive virtual reality and non-immersive technologies. In: ADDA: Approaches to Digital Discourse Analysis – ADDA 2, Turku, Finland 23-25 May 2019 ; Book of abstracts. Paper presented at 2nd International Conference: Approaches to Digital Discourse Analysis (ADDA 2), 23-25 May, 2019, Turku, Finland (pp. 7-7). Turku: University of Turku
Open this publication in new window or tab >>Collaborative exploration of rich corpus data using immersive virtual reality and non-immersive technologies
2019 (English)In: ADDA: Approaches to Digital Discourse Analysis – ADDA 2, Turku, Finland 23-25 May 2019 ; Book of abstracts, Turku: University of Turku , 2019, p. 7-7Conference paper, Oral presentation with published abstract (Other academic)
Abstract [en]

In recent years, large textual data sets, comprising many data points and rich metadata, have become a common object of investigation and analysis. Information Visualization and Visual Analytics provide practical tools for visual data analysis, most commonly as interactive two-dimensional (2D) visualizations that are displayed through normal computer monitors. At the same time, display technologies have evolved rapidly over the past decade. In particular, emerging technologies such as virtual reality (VR), augmented reality (AR), or mixed reality (MR) have become affordable and more user-friendly (LaValle 2016). Under the banner of “Immersive Analytics”, researchers started to explore the novel application of such immersive technologies for the purpose of data analysis (Marriott et al. 2018).

By using immersive technologies, researchers hope to increase motivation and user engagement for the overall data analysis activity as well as providing different perspectives on the data. This can be particularly helpful in the case of exploratory data analysis, when the researcher attempts to identify interesting points or anomalies in the data without prior knowledge of what exactly they are searching for. Furthermore, the data analysis process often involves the collaborative sharing of information and knowledge between multiple users for the goal of interpreting and making sense of the explored data together (Isenberg et al. 2011). However, immersive technologies such as VR are often rather single user-centric experiences, where one user is wearing a head-mounted display (HMD) device and is thus visually isolated from the real-world surroundings. Consequently, new tools and approaches for co-located, synchronous collaboration in such immersive data analysis scenarios are needed.

In this software demonstration, we present our developed VR system that enables two users to explore data at the same time, one inside an immersive VR environment, and one outside VR using a non-immersive companion application. The context of this demonstrated data analysis activity is centered around the exploration of the language variability in tweets from the perspectives of multilingualism and sociolinguistics (see, e.g. Coats 2017 and Grieve et al. 2017). Our primary data come from the the Nordic Tweet Stream (NTS) corpus (Laitinen et al. 2018, Tyrkkö 2018), and the immersive VR application visualizes in three dimensions (3D) the clustered Twitter traffic within the Nordic region as stacked cuboids according to their geospatial position, where each stack represents a color-coded language share (Alissandrakis et al. 2018). Through the utilization of 3D gestural input, the VR user can interact with the data using hand postures and gestures in order to move through the virtual 3D space, select clusters and display more detailed information, and to navigate through time (Reski and Alissandrakis 2019) ( https://vrxar.lnu.se/apps/odxvrxnts-360/ ). A non-immersive companion application, running in a normal web browser, presents an overview map of the Nordic region as well as other supplemental information about the data that are more suitable to be displayed using non-immersive technologies.

We will present two complementary applications, each with a different objective within the collaborative data analysis framework. The design and implementation of certain connectivity and collaboration features within these applications facilitate the co-located, synchronous exploration and sensemaking. For instance, the VR user’s position and orientation are displayed and updated in real-time within the overview map of the non-immersive application. The other way around, the selected cluster of the non-immersive user is also highlighted for the user in VR. Initial tests with pairs of language students validated the proof-of-concept of the developed collaborative system and encourage the conduction of further future investigations in this direction.

Place, publisher, year, edition, pages
Turku: University of Turku, 2019
Keywords
virtual reality, Nordic Tweet Stream, digital humanities, immersive analytics
National Category
Human Computer Interaction General Language Studies and Linguistics Language Technology (Computational Linguistics)
Research subject
Computer and Information Sciences Computer Science, Computer Science; Computer Science, Information and software visualization; Humanities, Linguistics
Identifiers
urn:nbn:se:lnu:diva-83858 (URN)
Conference
2nd International Conference: Approaches to Digital Discourse Analysis (ADDA 2), 23-25 May, 2019, Turku, Finland
Projects
DISA-DHOpen Data Exploration in Virtual Reality (ODxVR)
Available from: 2019-05-28 Created: 2019-05-28 Last updated: 2019-06-03Bibliographically approved
Tyrkkö, J. (2019). Kinship References in the British Parliament, 1800-2005. In: Ursula Lutzky, Minna Nevala (Ed.), Reference and Identify in Public Discourses: (pp. 97-124). Amsterdam: John Benjamins Publishing Company
Open this publication in new window or tab >>Kinship References in the British Parliament, 1800-2005
2019 (English)In: Reference and Identify in Public Discourses / [ed] Ursula Lutzky, Minna Nevala, Amsterdam: John Benjamins Publishing Company, 2019, p. 97-124Chapter in book (Refereed)
Abstract [en]

Family and kinship are fundamental concepts of human society and of political governance. Proceeding from the notion that frequency of reference can be taken as a simple measure of visibility and representation, this study examines diachronic trends of kinship reference in British parliamentary debates from 1800 to 2005. Using the Hansard Corpus and pattern-driven corpus linguistic methods, I show that changes in the frequencies of reference to kin reflect societal attitudes to gender roles.

Place, publisher, year, edition, pages
Amsterdam: John Benjamins Publishing Company, 2019
Series
Pragmatics & Beyond New Series, ISSN 0922-842X ; 306
Keywords
Hansard, parliamentary language, politics, language change, sociolinguistics, corpus
National Category
Specific Languages
Research subject
Humanities, English; Humanities, History
Identifiers
urn:nbn:se:lnu:diva-89772 (URN)10.1075/pbns.306.04tyr (DOI)9789027204202 (ISBN)9789027262059 (ISBN)
Available from: 2019-10-22 Created: 2019-10-22 Last updated: 2020-01-08Bibliographically approved
Tyrkkö, J. (2019). Lasikatto ja lasilattia: Monikielisyyden dynamiikkaa 1800-luvun Tampereella. In: Arja Nurmi, Saija Isomaa, Päivi Pahta (Ed.), Kielten ja kirjallisuuksien mosaiikki — Valta, periferia ja arki: (pp. 133-164). Helsinki: Suomalaisen Kirjallisuuden Seura
Open this publication in new window or tab >>Lasikatto ja lasilattia: Monikielisyyden dynamiikkaa 1800-luvun Tampereella
2019 (Finnish)In: Kielten ja kirjallisuuksien mosaiikki — Valta, periferia ja arki / [ed] Arja Nurmi, Saija Isomaa, Päivi Pahta, Helsinki: Suomalaisen Kirjallisuuden Seura, 2019, p. 133-164Chapter in book (Refereed)
Place, publisher, year, edition, pages
Helsinki: Suomalaisen Kirjallisuuden Seura, 2019
Series
Tietolipas ; 263
Keywords
multilingualism, history, Tampere, Finland, Finlayson
National Category
General Language Studies and Linguistics
Research subject
Humanities, Linguistics; Humanities, History
Identifiers
urn:nbn:se:lnu:diva-90238 (URN)
Available from: 2019-11-23 Created: 2019-11-23 Last updated: 2019-11-23
Hiltunen, T. & Tyrkkö, J. (2019). Manual to the LMEMT corpus. In: Irma Taavitsainen, Turo Hiltunen (Ed.), Late Modern English Medical Texts: Writing medicine in the eighteenth century (pp. 337-358). Amsterdam: John Benjamins Publishing Company
Open this publication in new window or tab >>Manual to the LMEMT corpus
2019 (English)In: Late Modern English Medical Texts: Writing medicine in the eighteenth century / [ed] Irma Taavitsainen, Turo Hiltunen, Amsterdam: John Benjamins Publishing Company, 2019, p. 337-358Chapter in book (Other academic)
Place, publisher, year, edition, pages
Amsterdam: John Benjamins Publishing Company, 2019
Keywords
corpus, manual, XML, annotation
National Category
Specific Languages
Research subject
Humanities, English; Humanities, Linguistics
Identifiers
urn:nbn:se:lnu:diva-90241 (URN)10.1075/z.221 (DOI)9789027203229 (ISBN)
Available from: 2019-11-23 Created: 2019-11-23 Last updated: 2019-11-23
Golub, K., Ahlström, I., Hansson, J. & Tyrkkö, J. (2019). Subject metadata for humanities journal articles: Indexing consistency between a local repository and an external bibliographic database. In: Presented at DCMI 2019: Metadata Innovation. Seoul, South Korea - September 23rd-26th, 2019. Paper presented at DCMI 2019: Metadata Innovation. Seoul, South Korea - September 23rd-26th, 2019. Dublin Core Metadata Initiative
Open this publication in new window or tab >>Subject metadata for humanities journal articles: Indexing consistency between a local repository and an external bibliographic database
2019 (English)In: Presented at DCMI 2019: Metadata Innovation. Seoul, South Korea - September 23rd-26th, 2019, Dublin Core Metadata Initiative , 2019Conference paper, Oral presentation with published abstract (Refereed)
Place, publisher, year, edition, pages
Dublin Core Metadata Initiative, 2019
National Category
Information Studies
Research subject
Humanities, Library and Information Science
Identifiers
urn:nbn:se:lnu:diva-89738 (URN)
Conference
DCMI 2019: Metadata Innovation. Seoul, South Korea - September 23rd-26th, 2019
Available from: 2019-10-18 Created: 2019-10-18 Last updated: 2019-10-24Bibliographically approved
Tyrkkö, J. (2019). Surgical and anatomical texts. In: Irma Taavitsainen, Turo Hiltunen (Ed.), Late Modern English Medical Texts: Writing medicine in the eighteenth century (pp. 299-306). Amsterdam: John Benjamins Publishing Company
Open this publication in new window or tab >>Surgical and anatomical texts
2019 (English)In: Late Modern English Medical Texts: Writing medicine in the eighteenth century / [ed] Irma Taavitsainen, Turo Hiltunen, Amsterdam: John Benjamins Publishing Company, 2019, p. 299-306Chapter in book (Refereed)
Place, publisher, year, edition, pages
Amsterdam: John Benjamins Publishing Company, 2019
Keywords
late modern, medicine, surgery, English, book history
National Category
Specific Languages
Research subject
Humanities, English; Humanities, History
Identifiers
urn:nbn:se:lnu:diva-90240 (URN)10.1075/z.221.11.5.tyr (DOI)9789027203229 (ISBN)
Available from: 2019-11-23 Created: 2019-11-23 Last updated: 2020-01-08
Tyrkkö, J. (2019). The symptom comes of age: Sign semantics from the Late Middle to the Late Modern English. In: Irma Taavitsainen, Turo Hiltunen (Ed.), Late Modern English Medical Texts: Writing medicine in the eighteenth century (pp. 199-227). Amsterdam: John Benjamins Publishing Company
Open this publication in new window or tab >>The symptom comes of age: Sign semantics from the Late Middle to the Late Modern English
2019 (English)In: Late Modern English Medical Texts: Writing medicine in the eighteenth century / [ed] Irma Taavitsainen, Turo Hiltunen, Amsterdam: John Benjamins Publishing Company, 2019, p. 199-227Chapter in book (Refereed)
Abstract [en]

The observation, recognition and interpretation of signs of illness has been recognised as one of the cornerstones of medical practice since Hippocrates. Over the centuries, the repertoire of signifier terms used by medical writers has undergone numerous changes. This chapter traces the history of signifier terms used in English vernacular medicine from the Late Middle period to the Late Modern English period. Commenting on the frequencies of the individual terms as well as their many uses and changing meanings, this study demonstrates how a superficially generic lexical field can reflect changes in scientific thought style and index the individual authors’ intellectual and professional affiliations.

Place, publisher, year, edition, pages
Amsterdam: John Benjamins Publishing Company, 2019
Keywords
terminology, late modern, medicine, English
National Category
Specific Languages
Research subject
Humanities, English
Identifiers
urn:nbn:se:lnu:diva-90239 (URN)10.1075/z.221.10tyr (DOI)9789027203229 (ISBN)
Available from: 2019-11-23 Created: 2019-11-23 Last updated: 2020-01-08Bibliographically approved
Alissandrakis, A. (2019). Visualizing rich corpus data using virtual reality. Studies in Variation, Contacts and Change in English, 20
Open this publication in new window or tab >>Visualizing rich corpus data using virtual reality
Show others...
2019 (English)In: Studies in Variation, Contacts and Change in English, ISSN 1797-4453, E-ISSN 1797-4453, Vol. 20Article in journal (Refereed) Published
Abstract [en]

We demonstrate an approach that utilizes immersive virtual reality (VR) to explore and interact with corpus linguistics data. Our case study focuses on the language identification parameter in the Nordic Tweet Stream corpus, a dynamic corpus of Twitter data where each tweet originated within the Nordic countries. We demonstrate how VR can provide previously unexplored perspectives into the use of English and other non-indigenous languages in the Nordic countries alongside the native languages of the region and showcase its geospatial variation. We utilize a head-mounted display (HMD) for a room-scale VR scenario that allows 3D interaction by using hand gestures. In addition to spatial movement through the Nordic areas, the interface enables exploration of the Twitter data based on time (days, weeks, months, or time of predefined special events), making it particularly useful for diachronic investigations.

In addition to demonstrating how the VR methods aid data visualization and exploration, we briefly discuss the pedagogical implications of using VR to showcase linguistic diversity. Our empirical results detail students’ reactions to working in this environment. The discussion part examines the benefits, prospects and limitations of using VR in visualizing corpus data.

Place, publisher, year, edition, pages
Helsinki: VARIENG, 2019
Keywords
virtual reality, Nordic Tweet Stream, digital humanities, immersive analytics
National Category
Human Computer Interaction Language Technology (Computational Linguistics) General Language Studies and Linguistics
Research subject
Computer and Information Sciences Computer Science; Computer and Information Sciences Computer Science, Computer Science; Computer Science, Information and software visualization; Humanities, Linguistics
Identifiers
urn:nbn:se:lnu:diva-90516 (URN)
Projects
DISA-DHOpen Data Exploration in Virtual Reality (ODxVR)
Available from: 2019-12-12 Created: 2019-12-12 Last updated: 2019-12-18Bibliographically approved
Tyrkkö, J. & Limatius, H. (2019). "When did I do dangerous driving then?": Structures and functions of Formula One race radio messages. In: Marcus Callies, Magnus Levin (Ed.), Corpus Approaches to the Language of Sports: Texts, Media, Modalities (pp. 111-138). London: Bloomsbury Academic
Open this publication in new window or tab >>"When did I do dangerous driving then?": Structures and functions of Formula One race radio messages
2019 (English)In: Corpus Approaches to the Language of Sports: Texts, Media, Modalities / [ed] Marcus Callies, Magnus Levin, London: Bloomsbury Academic, 2019, p. 111-138Chapter in book (Refereed)
Place, publisher, year, edition, pages
London: Bloomsbury Academic, 2019
Series
Corpus and Discourse
Keywords
complexity, stress, corpus, performance, sports, formula-1
National Category
General Language Studies and Linguistics
Research subject
Humanities, Linguistics
Identifiers
urn:nbn:se:lnu:diva-89771 (URN)9781350088207 (ISBN)9781350088221 (ISBN)9781350088214 (ISBN)
Available from: 2019-10-22 Created: 2019-10-22 Last updated: 2020-01-08Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-5251-5338

Search in DiVA

Show all publications