lnu.sePublications
Change search
Link to record
Permanent link

Direct link
Publications (10 of 30) Show all publications
Golub, K., Hagelbäck, J. & Ardö, A. (2020). Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches. Journal of Data and Information Science, 5(1), 18-38
Open this publication in new window or tab >>Automatic Classification of Swedish Metadata Using Dewey Decimal Classification: A Comparison of Approaches
2020 (English)In: Journal of Data and Information Science, ISSN 2096-157X, Vol. 5, no 1, p. 18-38Article in journal (Refereed) Published
Abstract [en]

Purpose

With more and more digital collections of various information resources becoming available, also increasing is the challenge of assigning subject index terms and classes from quality knowledge organization systems. While the ultimate purpose is to understand the value of automatically produced Dewey Decimal Classification (DDC) classes for Swedish digital collections, the paper aims to evaluate the performance of six machine learning algorithms as well as a string-matching algorithm based on characteristics of DDC.

Design/methodology/approach

State-of-the-art machine learning algorithms require at least 1,000 training examples per class. The complete data set at the time of research involved 143,838 records which had to be reduced to top three hierarchical levels of DDC in order to provide sufficient training data (totaling 802 classes in the training and testing sample, out of 14,413 classes at all levels).

Findings

Evaluation shows that Support Vector Machine with linear kernel outperforms other machine learning algorithms as well as the string-matching algorithm on average; the string-matching algorithm outperforms machine learning for specific classes when characteristics of DDC are most suitable for the task. Word embeddings combined with different types of neural networks (simple linear network, standard neural network, 1D convolutional neural network, and recurrent neural network) produced worse results than Support Vector Machine, but reach close results, with the benefit of a smaller representation size. Impact of features in machine learning shows that using keywords or combining titles and keywords gives better results than using only titles as input. Stemming only marginally improves the results. Removed stop-words reduced accuracy in most cases, while removing less frequent words increased it marginally. The greatest impact is produced by the number of training examples: 81.90% accuracy on the training set is achieved when at least 1,000 records per class are available in the training set, and 66.13% when too few records (often less than 100 per class) on which to train are available—and these hold only for top 3 hierarchical levels (803 instead of 14,413 classes).

Research limitations

Having to reduce the number of hierarchical levels to top three levels of DDC because of the lack of training data for all classes, skews the results so that they work in experimental conditions but barely for end users in operational retrieval systems.

Practical implications

In conclusion, for operative information retrieval systems applying purely automatic DDC does not work, either using machine learning (because of the lack of training data for the large number of DDC classes) or using string-matching algorithm (because DDC characteristics perform well for automatic classification only in a small number of classes). Over time, more training examples may become available, and DDC may be enriched with synonyms in order to enhance accuracy of automatic classification which may also benefit information retrieval performance based on DDC. In order for quality information services to reach the objective of highest possible precision and recall, automatic classification should never be implemented on its own; instead, machine-aided indexing that combines the efficiency of automatic suggestions with quality of human decisions at the final stage should be the way for the future.

Originality/value

The study explored machine learning on a large classification system of over 14,000 classes which is used in operational information retrieval systems. Due to lack of sufficient training data across the entire set of classes, an approach complementing machine learning, that of string matching, was applied. This combination should be explored further since it provides the potential for real-life applications with large target classification systems.

Place, publisher, year, edition, pages
National Science Library of Chinese Academy of Sciences, 2020
National Category
Information Studies
Research subject
Humanities, Library and Information Science
Identifiers
urn:nbn:se:lnu:diva-93444 (URN)10.2478/jdis-2020-0003 (DOI)000530065400002 ()2-s2.0-85085118766 (Scopus ID)
Available from: 2020-04-15 Created: 2020-04-15 Last updated: 2022-02-24Bibliographically approved
Jercic, P., Hagelbäck, J. & Lindley, C. (2019). An affective serious game for collaboration between humans and robots. Entertainment Computing, 32, 1-10, Article ID 100319.
Open this publication in new window or tab >>An affective serious game for collaboration between humans and robots
2019 (English)In: Entertainment Computing, ISSN 1875-9521, E-ISSN 1875-953X, Vol. 32, p. 1-10, article id 100319Article in journal (Refereed) Published
Abstract [en]

Elicited physiological affect in humans collaborating with their robot partners was investigated to determine its influence on decision-making performance in serious games. A turn-taking version of the Tower of Hanoi game was used, where physiological arousal and valence underlying such human-robot proximate collaboration were investigated. A comparable decision performance in the serious game was found between human and non-humanoid robot arm collaborator conditions, while higher physiological affect was found in humans collaborating with such robot collaborators. It is suggested that serious games which are carefully designed to take into consideration the elicited physiological arousal might witness a better decision-making performance and more positive valence using non-humanoid robot partners instead of human ones.

Place, publisher, year, edition, pages
Elsevier, 2019
Keywords
Autonomous robots, Serious games, Collaborative play, Robot-assisted play, Emotions, Physiology, ECG, GSR, Affect
National Category
Software Engineering
Research subject
Computer Science, Software Technology
Identifiers
urn:nbn:se:lnu:diva-91028 (URN)10.1016/j.entcom.2019.100319 (DOI)000504663900006 ()2-s2.0-85072574983 (Scopus ID)
Available from: 2020-01-20 Created: 2020-01-20 Last updated: 2020-12-14Bibliographically approved
Golub, K., Hagelbäck, J. & Ardö, A. (2019). Automatic classification Using DDC on the Swedish Union Catalogue. In: European DDC Users Group, EDUG, Annual Meeting 9-10 May 2019: National Library of Sweden, Stockholm, Sweden. Paper presented at European DDC Users Group, EDUG, Annual Meeting 9-10 May 2019: National Library of Sweden, Stockholm, Sweden.
Open this publication in new window or tab >>Automatic classification Using DDC on the Swedish Union Catalogue
2019 (English)In: European DDC Users Group, EDUG, Annual Meeting 9-10 May 2019: National Library of Sweden, Stockholm, Sweden, 2019Conference paper, Oral presentation only (Other academic)
National Category
Information Studies
Research subject
Humanities, Library and Information Science
Identifiers
urn:nbn:se:lnu:diva-84605 (URN)
Conference
European DDC Users Group, EDUG, Annual Meeting 9-10 May 2019: National Library of Sweden, Stockholm, Sweden
Available from: 2019-06-04 Created: 2019-06-04 Last updated: 2019-08-07Bibliographically approved
Golub, K., Hagelbäck, J. & Ardö, A. (2019). Automatic subject classification of Swedish DDC: Impact of tuning and training data set. In: 19th European NKOS Workshop, 23rd TPDL: Oslo, 12 September 2019. Paper presented at 19th European NKOS Workshop, 23rd TPDL. Oslo, 12 September 2019. Networked Knowledge Organization Systems/Services/Structures, NKOS
Open this publication in new window or tab >>Automatic subject classification of Swedish DDC: Impact of tuning and training data set
2019 (English)In: 19th European NKOS Workshop, 23rd TPDL: Oslo, 12 September 2019, Networked Knowledge Organization Systems/Services/Structures, NKOS , 2019Conference paper, Oral presentation with published abstract (Refereed)
Abstract [en]

The presentation builds on the NKOS 2018 presentation of automatically produced Dewey Decimal Classification (DDC) classes for Swedish union catalogue (LIBRIS). Based on a dataset of 143,838 records, Support Vector Machine with linear kernel outperforms Multinomial Naïve Bayes algorithm. Impact of features shows that using keywords or combining titles and keywords gives better results than using only titles as input. Stemming only marginally improves the results. Removed stop-words reduced accuracy in most cases, while removing less frequent words increased it marginally. Word embeddings combined with different types of neural networks (Simple linear network, Standard neural network, 1D convolutional neural network, Recurrent neural network) produced worse results than Naïve Bayes /Support Vector Machine, but reach close results. The greatest impact is produced by the number of training examples: 81.37% accuracy on the training set is achieved when at least 1,000 records per class are available, and 66.13% when few records on which to train are available.

Place, publisher, year, edition, pages
Networked Knowledge Organization Systems/Services/Structures, NKOS, 2019
National Category
Information Studies
Research subject
Humanities, Library and Information Science
Identifiers
urn:nbn:se:lnu:diva-89737 (URN)
Conference
19th European NKOS Workshop, 23rd TPDL. Oslo, 12 September 2019
Available from: 2019-10-18 Created: 2019-10-18 Last updated: 2020-01-08Bibliographically approved
Hagelbäck, J., Lincke, A., Löwe, W. & Rall, E. (2019). On the Agreement of Commodity 3D Cameras. In: Hamid R. Arabnia, Leonidas Deligiannidis, Fernando G. Tinetti (Ed.), Proceedings of the 2019 International Conference on Image Processing, Computer Vision, & Pattern Recognition: . Paper presented at 23rd International Conference on Image Processing, Computer Vision, & Pattern Recognition, July 29 - August 1, 2019, USA (pp. 36-42). CSREA Press
Open this publication in new window or tab >>On the Agreement of Commodity 3D Cameras
2019 (English)In: Proceedings of the 2019 International Conference on Image Processing, Computer Vision, & Pattern Recognition / [ed] Hamid R. Arabnia, Leonidas Deligiannidis, Fernando G. Tinetti, CSREA Press, 2019, p. 36-42Conference paper, Published paper (Refereed)
Abstract [en]

The advent of commodity 3D sensor technol- ogy has, amongst other things, enabled the efficient and effective assessment of human movements. Machine learning approaches do not rely manual definitions of gold standards for each new movement. However, to train models for the automated assessments of a new movement they still need a lot of data that map recorded movements to expert judg- ments. As camera technology changes, this training needs to be repeated if a new camera does not agree with the old one. The present paper presents an inexpensive method to check the agreement of cameras, which, in turn, would allow for a safe reuse of trained models regardless of the cameras. We apply the method to the Kinect, Astra Mini, and Real Sense cameras. The results show that these cameras do not agree and that the models cannot be reused without an unacceptable decay in accuracy. However, the suggested method works independent of movements and cameras and could potentially save effort when integrating new cameras in an existing assessment environment.

Place, publisher, year, edition, pages
CSREA Press, 2019
Keywords
3D camera agreement, human movement assessment
National Category
Computer Sciences
Research subject
Computer and Information Sciences Computer Science
Identifiers
urn:nbn:se:lnu:diva-89180 (URN)1-60132-506-1 (ISBN)
Conference
23rd International Conference on Image Processing, Computer Vision, & Pattern Recognition, July 29 - August 1, 2019, USA
Available from: 2019-09-18 Created: 2019-09-18 Last updated: 2020-11-26Bibliographically approved
Kastrati, Z., Kurti, A. & Hagelbäck, J. (2019). The Effect of a Flipped Classroom in a SPOC: Students' Perceptions and Attitudes. In: ICETC 2019: Proceedings of the 2019 11th International Conference on Education Technology and Computers: . Paper presented at 11th International Conference on Education Technology and Computers, October 2019, Amsterdam (pp. 246-249). ACM Publications
Open this publication in new window or tab >>The Effect of a Flipped Classroom in a SPOC: Students' Perceptions and Attitudes
2019 (English)In: ICETC 2019: Proceedings of the 2019 11th International Conference on Education Technology and Computers, ACM Publications, 2019, p. 246-249Conference paper, Published paper (Refereed)
Abstract [en]

The advent of Massive Open Online Courses (MOOCs) and Small Private Online Courses (SPOCs) has brought opportunities to higher education institutions. Despite this, one of the main drawbacks of MOOCs and SPOCs has been relatively low retention rate of the registered students. Having this in mind in this paper we report our research efforts with a SPOC on Applied Machine Learning specifically tailored for professional students. More concretely, we report our findings with regard to the effects of the flipped classroom approach on the students' perceptions and attitudes. The initial results show that flipping the class had direct effects on students' knowledge and skills compared to a fully online class setting. These findings have offered complementary explanations of the survey regression analysis which revealed that course structure/instructional approach followed by course content are the main drivers in accounting for the variance in students' overall perceptions of the course.

Place, publisher, year, edition, pages
ACM Publications, 2019
Series
International conference proceeding series (ICPS)
Keywords
Students' perceptions, Course effectiveness, Flipped classroom, SPOC
National Category
Other Computer and Information Science Didactics
Research subject
Computer and Information Sciences Computer Science, Computer Science
Identifiers
urn:nbn:se:lnu:diva-91129 (URN)10.1145/3369255.3369304 (DOI)2-s2.0-85079079495 (Scopus ID)978-1-4503-7254-1 (ISBN)
Conference
11th International Conference on Education Technology and Computers, October 2019, Amsterdam
Available from: 2020-01-22 Created: 2020-01-22 Last updated: 2021-02-04Bibliographically approved
Hagelbäck, J., Liapota, P., Lincke, A. & Löwe, W. (2019). The performance of some machine learning approaches in human movement assessment. In: Mário Macedo, L. Rodrigues (Ed.), 13th Multi Conference on Computer Science and Information Systems (MCCSIS): . Paper presented at 11th International Conference e-Health 2019, 17-19 July, Porto, Portugal (pp. 35-42). Porto, Portugal: IADIS Press
Open this publication in new window or tab >>The performance of some machine learning approaches in human movement assessment
2019 (English)In: 13th Multi Conference on Computer Science and Information Systems (MCCSIS) / [ed] Mário Macedo, L. Rodrigues, Porto, Portugal: IADIS Press, 2019, p. 35-42Conference paper, Published paper (Refereed)
Abstract [en]

The advent of commodity 3D sensor technology enabled, amongst other things, the efficient and effective assessment of human movements. Statistical and machine learning approaches map recorded movement instances to expert scores to train models for the automated assessment of new movements. However, there are many variations in selecting the approaches and setting the parameters for achieving high performance, i.e., high accuracy and low response time. The present paper researches the design space and the impact of approaches of statistical and machine learning on accuracy and response time in human movement assessment. Results show that a random forest regression approach outperforms linear regression, support vector regression and neuronal network approaches. Since the results do not rely on the movement specifics, they can help improving the performance of automated human movement assessment, in general.

Place, publisher, year, edition, pages
Porto, Portugal: IADIS Press, 2019
Keywords
Human Movement Assessment, Machine Learning, Statistical, Decision trees, Learning systems, Neurons, Regression analysis, Automated assessment, Human movements, Machine learning approaches, Neuronal networks, Paper research, Random forests, Support vector regression (SVR)
National Category
Computer Sciences Other Health Sciences
Research subject
Computer and Information Sciences Computer Science, Computer Science; Health and Caring Sciences
Identifiers
urn:nbn:se:lnu:diva-92121 (URN)10.33965/eh2019_201910l005 (DOI)2-s2.0-85073157836 (Scopus ID)978-989-8533-89-0 (ISBN)
Conference
11th International Conference e-Health 2019, 17-19 July, Porto, Portugal
Available from: 2020-02-17 Created: 2020-02-17 Last updated: 2022-07-14Bibliographically approved
Hagelbäck, J., Liapota, P., Lincke, A. & Löwe, W. (2019). Variants of Dynamic Time Warping and their Performance in Human Movement Assessment. In: 21st International Conference on Artificial Intelligence (ICAI'19: July 29 - August 1, 2019, Las Vegas, USA): . Paper presented at 21st International Conference on Artificial Intelligence, ICAI'19: July 29 - August 1, 2019, Las Vegas, USA (pp. 9-15). CSREA Press
Open this publication in new window or tab >>Variants of Dynamic Time Warping and their Performance in Human Movement Assessment
2019 (English)In: 21st International Conference on Artificial Intelligence (ICAI'19: July 29 - August 1, 2019, Las Vegas, USA), CSREA Press, 2019, p. 9-15Conference paper, Published paper (Refereed)
Abstract [en]

The advent of commodity 3D sensor technology enabled, amongst other things, the efficient and effective assessment of human movements. Statistical and machine learning approaches map recorded movement instances to expert scores to train models for the automated assessment of new movements. However, there are many variations in selecting the approaches and setting the parameters for achieving good performance, i.e., high scoring accuracy and low response time. The present paper researches the design space and the impact of sequence alignment on accuracy and response time. More specifically, we introduce variants of Dynamic Time Warping (DTW) for aligning the phases of slow and fast movement instances and assess their effect on the scoring accuracy and response time. Results show that an automated stripping of leading and trailing frames not belonging to the movement (using one DTW variant) followed by an alignment of selected frames in the movements (based on another DTW variant) outperforms the original DTW and other suggested variants thereof. Since these results are independent of the selected learning approach and do not rely on the movement specifics, the results can help improving the performance of automated human movement assessment, in general.

Place, publisher, year, edition, pages
CSREA Press, 2019
Keywords
Dynamic Time Warping variants, human movement assessment
National Category
Computer Sciences
Research subject
Computer and Information Sciences Computer Science
Identifiers
urn:nbn:se:lnu:diva-89181 (URN)1-60132-501-0 (ISBN)
Conference
21st International Conference on Artificial Intelligence, ICAI'19: July 29 - August 1, 2019, Las Vegas, USA
Available from: 2019-09-18 Created: 2019-09-18 Last updated: 2020-05-08Bibliographically approved
Golub, K., Hagelbäck, J. & Ardö, A. (2018). Automatic classification using DDC on the Swedish Union Catalogue. In: Philipp Mayr, Douglas Tudhope, Joseph Busch, Koraljka Golub, Marjorie Hlava & Marcia Zeng (Ed.), Proceedings of the 18th European Networked Knowledge Organization Systems (NKOS 2018) Workshop, Porto, Portugal, September 13, 2018: . Paper presented at 18th European Networked Knowledge Organization Systems Workshop (NKOS 2018), Porto, Portugal, September 13, 2018 (pp. 4-16). CEUR-WS.org
Open this publication in new window or tab >>Automatic classification using DDC on the Swedish Union Catalogue
2018 (English)In: Proceedings of the 18th European Networked Knowledge Organization Systems (NKOS 2018) Workshop, Porto, Portugal, September 13, 2018 / [ed] Philipp Mayr, Douglas Tudhope, Joseph Busch, Koraljka Golub, Marjorie Hlava & Marcia Zeng, CEUR-WS.org , 2018, p. 4-16Conference paper, Published paper (Refereed)
Abstract [en]

With more and more digital collections of various information re- sources becoming available, also increasing is the challenge of assigning subject index terms and classes from quality knowledge organization systems. While the ultimate purpose is to understand the value of automatically produced Dewey Decimal Classification (DDC) classes for Swedish digital collections, the paper aims to evaluate the performance of two machine learning algorithms for Swe- dish catalogue records from the Swedish union catalogue (LIBRIS). The algo- rithms are tested on the top three hierarchical levels of the DDC. Based on a data set of 143,838 records, evaluation shows that Support Vector Machine with linear kernel outperforms Multinomial Naïve Bayes algorithm. Also, using keywords or combining titles and keywords gives better results than using only titles as input. The class imbalance where many DDC classes only have few records greatly affects classification performance: 81.37% accuracy on the training set is achieved when at least 1,000 records per class are available, and 66.13% when few records on which to train are available. Proposed future research involves an exploration of the intellectual effort put into creating the DDC to further improve the algorithm performance as commonly applied in string matching, and to test the best approach on new digital collections that do not have DDC assigned.

Place, publisher, year, edition, pages
CEUR-WS.org, 2018
Series
CEUR Workshop Proceedings, ISSN 1613-0073 ; 2200
Keywords
LIBRIS, Dewey Decimal Classification, automatic classification, machine learning, Support Vector Machine, Multinomial Naïve Bayes, subject access
National Category
Information Studies
Research subject
Humanities, Library and Information Science
Identifiers
urn:nbn:se:lnu:diva-78378 (URN)2-s2.0-85053933816 (Scopus ID)
Conference
18th European Networked Knowledge Organization Systems Workshop (NKOS 2018), Porto, Portugal, September 13, 2018
Available from: 2018-10-19 Created: 2018-10-19 Last updated: 2020-03-16Bibliographically approved
Qureshi, S., Hagelbäck, J., Iqbal, S. M., Javaid, H. & Lindley, C. (2018). Evaluation of Classifiers for Emotion Detection while Performing Physical and Visual Tasks: Tower of Hanoi and IAPS. In: Kohei Arai, Supriya Kapoor, Rahul Bhatia (Ed.), Intelligent Systems and Applications. IntelliSys 2018: Proceedings of the 2018 Intelligent Systems Conference (IntelliSys) Volume 1. Paper presented at Intelligent Systems Conference (IntelliSys), 6-7 September, 2018, London (pp. 347-363). Springer
Open this publication in new window or tab >>Evaluation of Classifiers for Emotion Detection while Performing Physical and Visual Tasks: Tower of Hanoi and IAPS
Show others...
2018 (English)In: Intelligent Systems and Applications. IntelliSys 2018: Proceedings of the 2018 Intelligent Systems Conference (IntelliSys) Volume 1 / [ed] Kohei Arai, Supriya Kapoor, Rahul Bhatia, Springer, 2018, p. 347-363Conference paper, Published paper (Refereed)
Abstract [en]

With the advancement in robot technology, smart human-robot interaction is of increasing importance for allowing the more excellent use of robots integrated into human environments and activities. If a robot can identify emotions and intentions of a human interacting with it, interactions with humans can potentially become more natural and effective. However, mechanisms of perception and empathy used by humans to achieve this understanding may not be suitable or adequate for use within robots. Electroencephalography (EEG) can be used for recording signals revealing emotions and motivations from a human brain. This study aimed to evaluate different machine learning techniques to classify EEG data associated with specific affective/emotional states. For experimental purposes, we used visual (IAPS) and physical (Tower of Hanoi) tasks to record human emotional states in the form of EEG data. The obtained EEG data processed, formatted and evaluated using various machine learning techniques to find out which method can most accurately classify EEG data according to associated affective/emotional states. The experiment confirms the choice of a method for improving the accuracy of results. According to the results, Support Vector Machine was the first, and Regression Tree was the second best method for classifying EEG data associated with specific affective/emotional states with accuracies up to 70.00% and 60.00%, respectively. In both tasks, SVM was better in performance than RT. 

Place, publisher, year, edition, pages
Springer, 2018
Series
Advances in Intelligent Systems and Computing, ISSN 2194-5357, E-ISSN 2194-5365 ; 868
Keywords
K-Nearest Neighbor (KNN), Regression Tree (RT), Bayesian Network (BNT), Support Vector Machine (SVM), Artificial Neural Networks (ANN), Tower of Hanoi (ToH), Cognitive Psychology, Human Computer Interaction (HCI), Electroencephalography (EEG)
National Category
Computer Sciences
Research subject
Computer and Information Sciences Computer Science, Computer Science
Identifiers
urn:nbn:se:lnu:diva-78544 (URN)10.1007/978-3-030-01054-6_25 (DOI)000591525600025 ()2-s2.0-85057084220 (Scopus ID)978-3-030-01053-9 (ISBN)978-3-030-01054-6 (ISBN)
Conference
Intelligent Systems Conference (IntelliSys), 6-7 September, 2018, London
Available from: 2018-10-30 Created: 2018-10-30 Last updated: 2021-01-15Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-8591-1035

Search in DiVA

Show all publications