lnu.sePublications
Change search
Link to record
Permanent link

Direct link
Martins, Rafael Messias, Dr.ORCID iD iconorcid.org/0000-0002-2901-935X
Publications (10 of 54) Show all publications
Hilasaca, G. M., Marcílio-Jr, W. E., Eler, D. M., Martins, R. M. & Paulovich, F. V. (2024). A Grid-based Method for Removing Overlaps of Dimensionality Reduction Scatterplot Layouts. IEEE Transactions on Visualization and Computer Graphics, 30(8), 5733-5749
Open this publication in new window or tab >>A Grid-based Method for Removing Overlaps of Dimensionality Reduction Scatterplot Layouts
Show others...
2024 (English)In: IEEE Transactions on Visualization and Computer Graphics, ISSN 1077-2626, E-ISSN 1941-0506, Vol. 30, no 8, p. 5733-5749Article in journal (Refereed) Published
Abstract [en]

Dimensionality Reduction (DR) scatterplot layouts have become a ubiquitous visualization tool for analyzing multidimensional datasets. Despite their popularity, such scatterplots suffer from occlusion, especially when informative glyphs are used to represent data instances, potentially obfuscating critical information for the analysis under execution. Different strategies have been devised to address this issue, either producing overlap-free layouts that lack the powerful capabilities of contemporary DR techniques in uncovering interesting data patterns or eliminating overlaps as a post-processing strategy. Despite the good results of post-processing techniques, most of the best methods typically expand or distort the scatterplot area, thus reducing glyphs’ size (sometimes) to unreadable dimensions, defeating the purpose of removing overlaps. This article presents Distance Grid (DGrid) , a novel post-processing strategy to remove overlaps from DR layouts that faithfully preserves the original layout's characteristics and bounds the minimum glyph sizes. We show that DGrid surpasses the state-of-the-art in overlap removal (through an extensive comparative evaluation considering multiple different metrics) while also being one of the fastest techniques, especially for large datasets. A user study with 51 participants also shows that DGrid is consistently ranked among the top techniques for preserving the original scatterplots’ visual characteristics and the aesthetics of the final results.

Place, publisher, year, edition, pages
IEEE, 2024
National Category
Computer Sciences
Research subject
Computer and Information Sciences Computer Science, Computer Science
Identifiers
urn:nbn:se:lnu:diva-126406 (URN)10.1109/tvcg.2023.3309941 (DOI)001262914400072 ()2-s2.0-85169671474 (Scopus ID)
Available from: 2024-01-11 Created: 2024-01-11 Last updated: 2024-08-15Bibliographically approved
Mohseni, Z., Masiello, I. & Martins, R. M. (2024). A technical infrastructure for primary education data that contributes to data standardization. Education and Information Technologies: Official Journal of the IFIP technical committee on Education, 29, 21045-21061
Open this publication in new window or tab >>A technical infrastructure for primary education data that contributes to data standardization
2024 (English)In: Education and Information Technologies: Official Journal of the IFIP technical committee on Education, ISSN 1360-2357, E-ISSN 1573-7608, Vol. 29, p. 21045-21061Article in journal (Refereed) Published
Abstract [en]

There is a significant amount of data available about students and their learning activities in many educational systems today. However, these datasets are frequently spread across several different digital services, making it challenging to use them strategically. In addition, there are no established standards for collecting, processing, analyzing, and presenting such data. As a result, school leaders, teachers, and students do not capitalize on the possibility of making decisions based on data. This is a serious barrier to the improvement of work in schools, teacher and student progress, and the development of effective Educational Technology (EdTech) products and services. Data standards can be used as a protocol on how different IT systems communicate with each other. When working with data from different public and private institutions simultaneously (e.g., different municipalities and EdTech companies), having a trustworthy data pipeline for retrieving the data and storing it in a secure warehouse is critical. In this study, we propose a technical solution containing a data pipeline by employing a secure warehouse—the Swedish University Computer Network (SUNET), which is an interface for information exchange between operational processes in schools. We conducted a user study in collaboration with four municipalities and four EdTech companies based in Sweden. Our proposal involves introducing a data standard to facilitate the integration of educational data from diverse resources in our SUNET drive. To accomplish this, we created customized scripts for each stakeholder, tailored to their specific data formats, with the aim of merging the students’ data. The results of the first four steps show that our solution works. Once the results of the next three steps are in, we will contemplate scaling up our technical solution nationwide. With the implementation of the suggested data standard and the utilization of the proposed technical solution, diverse stakeholders can benefit from improved management, transportation, analysis, and visualization of educational data.

Place, publisher, year, edition, pages
Springer, 2024
Keywords
Data standard, Data pipeline, Secure data pipeline, Educational data, Primary education, Technical infrastructure, SUNET drive
National Category
Information Systems, Social aspects
Research subject
Computer and Information Sciences Computer Science, Information Systems
Identifiers
urn:nbn:se:lnu:diva-129073 (URN)10.1007/s10639-024-12683-2 (DOI)001208963600002 ()2-s2.0-85191712850 (Scopus ID)
Funder
Linnaeus University
Available from: 2024-04-28 Created: 2024-04-28 Last updated: 2024-12-10Bibliographically approved
Chatzimparmpas, A., Martins, R. M., Telea, A. C. & Kerren, A. (2024). DeforestVis: Behavior Analysis of Machine Learning Models with Surrogate Decision Stumps. Computer graphics forum (Print), 43(6), Article ID e15004.
Open this publication in new window or tab >>DeforestVis: Behavior Analysis of Machine Learning Models with Surrogate Decision Stumps
2024 (English)In: Computer graphics forum (Print), ISSN 0167-7055, E-ISSN 1467-8659, Vol. 43, no 6, article id e15004Article in journal (Refereed) Published
Abstract [en]

As the complexity of Machine Learning (ML) models increases and their application in different (and critical) domains grows, there is a strong demand for more interpretable and trustworthy ML. A direct, model-agnostic, way to interpret such models is to train surrogate models—such as rule sets and decision trees—that sufficiently approximate the original ones while being simpler and easier-to-explain. Yet, rule sets can become very lengthy, with many if-else statements, and decision tree depth grows rapidly when accurately emulating complex ML models. In such cases, both approaches can fail to meet their core goal—providing users with model interpretability. To tackle this, we propose DeforestVis, a visual analytics tool that offers summarization of the behavior of complex ML models by providing surrogate decision stumps (one-level decision trees) generated with the Adaptive Boosting (AdaBoost) technique. DeforestVis helps users to explore the complexity vs fidelity trade-off by incrementally generating more stumps, creating attribute-based explanations with weighted stumps to justify decision making, and analyzing the impact of rule overriding on training instance allocation between one or more stumps. An independent test set allows users to monitor the effectiveness of manual rule changes and form hypotheses based on case-by-case analyses. We show the applicability and usefulness of DeforestVis with two use cases and expert interviews with data analysts and model developers.

Place, publisher, year, edition, pages
John Wiley & Sons, 2024
Keywords
Surrogate model, model understanding, adaptive boosting, machine learning, visual analytics
National Category
Computer Sciences
Research subject
Computer Science, Information and software visualization
Identifiers
urn:nbn:se:lnu:diva-127909 (URN)10.1111/cgf.15004 (DOI)001174196500001 ()2-s2.0-85185930256 (Scopus ID)
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2024-02-20 Created: 2024-02-20 Last updated: 2024-10-17Bibliographically approved
Mohseni, Z., Masiello, I., Martins, R. M. & Nordmark, S. (2024). Visual Learning Analytics for Educational Interventions in Primary and Secondary Schools: A Scoping Review. Journal of Learning Analytics, 11(2), 91-111
Open this publication in new window or tab >>Visual Learning Analytics for Educational Interventions in Primary and Secondary Schools: A Scoping Review
2024 (English)In: Journal of Learning Analytics, ISSN 1929-7750, Vol. 11, no 2, p. 91-111Article in journal (Refereed) Published
Abstract [en]

Visual Learning Analytics (VLA) uses analytics to monitor and assess educational data by combining visual and automated analysis to provide educational explanations. Such tools could aid teachers in primary and secondary schools in making pedagogical decisions, however, the evidence of their effectiveness and benefits is still limited. With this scoping review, we provide a comprehensive overview of related research on proposed VLA methods, as well as identifying any gaps in the literature that could assist in describing new and helpful directions to the field. This review searched all relevant articles in five accessible databases — Scopus, Web of Science, ERIC, ACM, and IEEE Xplore — using 40 keywords. These studies were mapped, categorized, and summarized based on their objectives, the collected data, the intervention approaches employed, and the results obtained. The results determined what affordances the VLA tools allowed, what kind of visualizations were used to inform teachers and students, and, more importantly, positive evidence of educational interventions. We conclude that there are moderate-to-clear learning improvements within the limit of the studies’ interventions to support the use of VLA tools. More systematic research is needed to determine whether any learning gains are translated into long-term improvements.

Place, publisher, year, edition, pages
Society for Learning Analytics Research (SoLAR), 2024
Keywords
Visual learning analytics, Learning analytics dashboard, Educational interventions, Primary school, Secondary school, Scoping review, Systematic review
National Category
Computer and Information Sciences Pedagogy
Research subject
Computer and Information Sciences Computer Science
Identifiers
urn:nbn:se:lnu:diva-131233 (URN)10.18608/jla.2024.8309 (DOI)001295934400006 ()2-s2.0-85202576381 (Scopus ID)
Available from: 2024-07-01 Created: 2024-07-01 Last updated: 2024-09-12Bibliographically approved
Mohseni, Z., Masiello, I. & Martins, R. M. (2023). Co-Developing an Easy-to-Use Learning Analytics Dashboard for Teachers in Primary/Secondary Education: A Human-Centered Design Approach. Education Sciences, 13(12), Article ID 1190.
Open this publication in new window or tab >>Co-Developing an Easy-to-Use Learning Analytics Dashboard for Teachers in Primary/Secondary Education: A Human-Centered Design Approach
2023 (English)In: Education Sciences, E-ISSN 2227-7102, Vol. 13, no 12, article id 1190Article in journal (Refereed) Published
Abstract [en]

Learning Analytics Dashboards (LADs) can help provide insights and inform pedagogical decisions by supporting the analysis of large amounts of educational data, obtained from sources such as Digital Learning Materials (DLMs). Extracting requirements is a crucial step in developing a LAD, as it helps identify the underlying design problem that needs to be addressed. In fact, determining the problem that requires a solution is one of the primary objectives of requirements extraction. Although there have been studies on the development of LADs for K12 education, these studies have not specifically emphasized the use of a Human-Centered Design (HCD) approach to better comprehend the teachers’ requirements and produce more stimulating insights. In this paper we apply prototyping, which is widely acknowledged as a successful way for rapidly implementing cost-effective designs and efficiently gathering stakeholder feedback, to elicit such requirements. We present a three-step HCD approach, involving a design cycle that employs paper and interactive prototypes to guide the systematic and effective design of LADs that truly meet teacher requirements in primary/secondary education, actively engaging them in the design process. We then conducted interviews and usability testing to co-design and develop a LAD that can be used in classroom’s everyday learning activities. Our results show that the visualizations of the interactive prototype were easily interpreted by the participants, verifying our initial goal of co-developing an easy-to-use LAD.

Place, publisher, year, edition, pages
MDPI, 2023
Keywords
learning analytics dashboard; human-centered design; paper prototype; interactive prototype; usability test; K12; educational data
National Category
Computer Systems
Research subject
Computer Science, Information and software visualization
Identifiers
urn:nbn:se:lnu:diva-125814 (URN)10.3390/educsci13121190 (DOI)001130760800001 ()2-s2.0-85180651196 (Scopus ID)
Available from: 2023-11-29 Created: 2023-11-29 Last updated: 2024-08-20Bibliographically approved
van den Elzen, S., Andrienko, G., Andrienko, N., Fisher, B. D., Martins, R. M., Peltonen, J., . . . Verleysen, M. (2023). The Flow of Trust: A Visualization Framework to Externalize, Explore, and Explain Trust in ML Applications. IEEE Computer Graphics and Applications, 43(2), 78-88
Open this publication in new window or tab >>The Flow of Trust: A Visualization Framework to Externalize, Explore, and Explain Trust in ML Applications
Show others...
2023 (English)In: IEEE Computer Graphics and Applications, ISSN 0272-1716, E-ISSN 1558-1756, Vol. 43, no 2, p. 78-88Article in journal (Refereed) Published
Abstract [en]

We present a conceptual framework for the development of visual interactive techniques to formalize and externalize trust in machine learning (ML) workflows. Currently, trust in ML applications is an implicit process that takes place in the user-s mind. As such, there is no method of feedback or communication of trust that can be acted upon. Our framework will be instrumental in developing interactive visualization approaches that will help users to efficiently and effectively build and communicate trust in ways that fit each of the ML process stages. We formulate several research questions and directions that include: 1) a typology/taxonomy of trust objects, trust issues, and possible reasons for (mis)trust; 2) formalisms to represent trust in machine-readable form; 3) means by which users can express their state of trust by interacting with a computer system (e.g., text, drawing, marking); 4) ways in which a system can facilitate users- expression and communication of the state of trust; and 5) creation of visual interactive techniques for representation and exploration of trust over all stages of an ML pipeline.

Place, publisher, year, edition, pages
IEEE Computer Society, 2023
Keywords
Conceptual frameworks, Interactive techniques, Interactive visualizations, Learning process, Machine learning applications, Machine-learning, Process stages, Research questions, Visualization framework, Work-flows, Visualization
National Category
Computer Sciences
Research subject
Computer and Information Sciences Computer Science, Computer Science
Identifiers
urn:nbn:se:lnu:diva-123752 (URN)10.1109/MCG.2023.3237286 (DOI)001186626100012 ()2-s2.0-85151377641 (Scopus ID)
Available from: 2023-08-16 Created: 2023-08-16 Last updated: 2024-04-23Bibliographically approved
Mohseni, Z., Masiello, I. & Martins, R. M. (2023). Towards a Teacher-Oriented Framework of Visual Learning Analytics by Scenario-Based Development. In: Daniele Di Mitri, Alejandro Ortega-Arranz, Oleksandra Poquet (Ed.), DCECTEL 2023 Doctoral Consortium of ECTEL 2023: Proceedings of the Doctoral Consortium of the 18th European Conference on Technology Enhanced Learning (DCECTEL 2023)co-located with the 18th European Conference on Technology Enhanced Learning (EC-TEL 2023)Aveiro, Portugal, 4-8 September 2023. Paper presented at Doctoral Consortium of the 18th European Conference on Technology Enhanced Learning (DCECTEL 2023), Aveiro, Portugal, 4-8 September 2023 (pp. 12-17). Technical University of Aachen
Open this publication in new window or tab >>Towards a Teacher-Oriented Framework of Visual Learning Analytics by Scenario-Based Development
2023 (English)In: DCECTEL 2023 Doctoral Consortium of ECTEL 2023: Proceedings of the Doctoral Consortium of the 18th European Conference on Technology Enhanced Learning (DCECTEL 2023)co-located with the 18th European Conference on Technology Enhanced Learning (EC-TEL 2023)Aveiro, Portugal, 4-8 September 2023 / [ed] Daniele Di Mitri, Alejandro Ortega-Arranz, Oleksandra Poquet, Technical University of Aachen , 2023, , p. 6p. 12-17Conference paper, Published paper (Refereed)
Abstract [en]

Visual Learning Analytics (VLA) tools (such as dashboards) serve as a centralized hub for monitoring and analyzing educational data. Dashboards can assist teachers in data-informed pedagogical decision-making and/or students in following their own learning progress. However, the design of VLA tools should include features of trust in order to make analytics overt among its users. In this study, we propose a framework for the development of VLA tools from beginning to end that describes how we intend to develop the digital and technical infrastructure in our project for teachers. With that aim, we offer one scenario describing how data is managed, transferred, analyzed, and visualized by teachers. The suggested framework intends to make it easier for developers to understand the various steps involved in co-designing and developing a reliable VLA tool and to comprehend the importance of the teacher’s participation in design. VLA tools developed based on the proposed framework have the potential to assist teachers in understanding and analyzing educational data, monitoring students’ learning paths based on their learning outcomes and activities, simplifying regular tasks, and giving teachers more time to support teaching/learning and growth.

Place, publisher, year, edition, pages
Technical University of Aachen, 2023. p. 6
Series
CEUR Workshop Proceedings, E-ISSN 1613-0073 ; 3539
Keywords
Visual Learning Analytics Tool, Scenario-based Development, VLA Development Framework, Educational Data
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Research subject
Computer and Information Sciences Computer Science
Identifiers
urn:nbn:se:lnu:diva-125545 (URN)2-s2.0-85178303190 (Scopus ID)
Conference
Doctoral Consortium of the 18th European Conference on Technology Enhanced Learning (DCECTEL 2023), Aveiro, Portugal, 4-8 September 2023
Available from: 2023-11-08 Created: 2023-11-08 Last updated: 2024-06-05Bibliographically approved
Chatzimparmpas, A., Martins, R. M. & Kerren, A. (2023). VisRuler: Visual Analytics for Extracting Decision Rules from Bagged and Boosted Decision Trees. Information Visualization, 22(2), 115-139
Open this publication in new window or tab >>VisRuler: Visual Analytics for Extracting Decision Rules from Bagged and Boosted Decision Trees
2023 (English)In: Information Visualization, ISSN 1473-8716, E-ISSN 1473-8724, Vol. 22, no 2, p. 115-139Article in journal (Refereed) Published
Abstract [en]

Bagging and boosting are two popular ensemble methods in machine learning (ML) that produce many individual decision trees. Due to the inherent ensemble characteristic of these methods, they typically outperform single decision trees or other ML models in predictive performance. However, numerous decision paths are generated for each decision tree, increasing the overall complexity of the model and hindering its use in domains that require trustworthy and explainable decisions, such as finance, social care, and health care. Thus, the interpretability of bagging and boosting algorithms—such as random forest and adaptive boosting—reduces as the number of decisions rises. In this paper, we propose a visual analytics tool that aims to assist users in extracting decisions from such ML models via a thorough visual inspection workflow that includes selecting a set of robust and diverse models (originating from different ensemble learning algorithms), choosing important features according to their global contribution, and deciding which decisions are essential for global explanation (or locally, for specific cases). The outcome is a final decision based on the class agreement of several models and the explored manual decisions exported by users. We evaluated the applicability and effectiveness of VisRuler via a use case, a usage scenario, and a user study. The evaluation revealed that most users managed to successfully use our system to explore decision rules visually, performing the proposed tasks and answering the given questions in a satisfying way.

Place, publisher, year, edition, pages
Sage Publications, 2023
Keywords
Decisions evaluation, rules interpretation, ensemble learning, visual analytics, visualization
National Category
Computer Sciences Human Computer Interaction
Research subject
Computer Science, Information and software visualization
Identifiers
urn:nbn:se:lnu:diva-117897 (URN)10.1177/14738716221142005 (DOI)000916155000001 ()2-s2.0-85144842887 (Scopus ID)
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2022-12-13 Created: 2022-12-13 Last updated: 2023-03-06Bibliographically approved
Neves, T. T., Martins, R. M., Coimbra, D. B., Kucher, K., Kerren, A. & Paulovich, F. V. (2022). Fast and Reliable Incremental Dimensionality Reduction for Streaming Data. Computers & graphics, 102, 233-244
Open this publication in new window or tab >>Fast and Reliable Incremental Dimensionality Reduction for Streaming Data
Show others...
2022 (English)In: Computers & graphics, ISSN 0097-8493, E-ISSN 1873-7684, Vol. 102, p. 233-244Article in journal (Refereed) Published
Abstract [en]

Streaming data applications are becoming more common due to the ability ofdifferent information sources to continuously capture or produce data, such as sensors and social media. Although there are recent advances, most visualization approaches, particularly Dimensionality Reduction (DR) techniques, cannot be directly applied in such scenarios due to the transient nature of streaming data. A few DR methods currently address this limitation using online or incremental strategies, continuously updating the visualization as data is received. Despite their relative success, most impose the need to store and access the data multiple times to produce a complete projection, not being appropriate for streaming where data continuously grow. Others do not impose such requirements but cannot update the position of the data already projected, potentially resulting in visual artifacts. This paper presents Xtreaming, a novel incremental DR technique that continuously updates the visual representation to reflect new emerging structures or patterns without visiting the high-dimensional data more than once. Our tests show that in streaming scenarios where data is not fully stored in-memory, Xtreaming is competitive in terms of quality compared to other streaming and incremental techniques while being orders of magnitude faster.

Place, publisher, year, edition, pages
Elsevier, 2022
Keywords
Incremental Dimensionality Reduction, Streaming Dimensionality Reduction, Multidimensional Projection, Visualization, Visual Analytics
National Category
Computer Sciences
Research subject
Computer Science, Information and software visualization
Identifiers
urn:nbn:se:lnu:diva-106220 (URN)10.1016/j.cag.2021.08.009 (DOI)000802242700015 ()2-s2.0-85114639865 (Scopus ID)2021 (Local ID)2021 (Archive number)2021 (OAI)
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2021-08-23 Created: 2021-08-23 Last updated: 2023-05-02Bibliographically approved
Chatzimparmpas, A., Martins, R. M., Kucher, K. & Kerren, A. (2022). FeatureEnVi: Visual Analytics for Feature Engineering Using Stepwise Selection and Semi-Automatic Extraction Approaches. IEEE Transactions on Visualization and Computer Graphics, 28(4), 1773-1791
Open this publication in new window or tab >>FeatureEnVi: Visual Analytics for Feature Engineering Using Stepwise Selection and Semi-Automatic Extraction Approaches
2022 (English)In: IEEE Transactions on Visualization and Computer Graphics, ISSN 1077-2626, E-ISSN 1941-0506, Vol. 28, no 4, p. 1773-1791Article in journal (Refereed) Published
Abstract [en]

The machine learning (ML) life cycle involves a series of iterative steps, from the effective gathering and preparation of the data—including complex feature engineering processes—to the presentation and improvement of results, with various algorithms to choose from in every step. Feature engineering in particular can be very beneficial for ML, leading to numerous improvements such as boosting the predictive results, decreasing computational times, reducing excessive noise, and increasing the transparency behind the decisions taken during the training. Despite that, while several visual analytics tools exist to monitor and control the different stages of the ML life cycle (especially those related to data and algorithms), feature engineering support remains inadequate. In this paper, we present FeatureEnVi, a visual analytics system specifically designed to assist with the feature engineering process. Our proposed system helps users to choose the most important feature, to transform the original features into powerful alternatives, and to experiment with different feature generation combinations. Additionally, data space slicing allows users to explore the impact of features on both local and global scales. FeatureEnVi utilizes multiple automatic feature selection techniques; furthermore, it visually guides users with statistical evidence about the influence of each feature (or subsets of features). The final outcome is the extraction of heavily engineered features, evaluated by multiple validation metrics. The usefulness and applicability of FeatureEnVi are demonstrated with two use cases and a case study. We also report feedback from interviews with two ML experts and a visualization researcher who assessed the effectiveness of our system.

Place, publisher, year, edition, pages
IEEE, 2022
Keywords
Feature selection, feature extraction, feature engineering, machine learning, visual analytics, visualization
National Category
Computer Sciences Human Computer Interaction
Research subject
Computer Science, Information and software visualization
Identifiers
urn:nbn:se:lnu:diva-108801 (URN)10.1109/TVCG.2022.3141040 (DOI)000761227900006 ()34990365 (PubMedID)2-s2.0-85122858225 (Scopus ID)
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2022-01-05 Created: 2022-01-05 Last updated: 2022-03-29Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-2901-935X

Search in DiVA

Show all publications

Profile pages