lnu.sePublications
Change search
Link to record
Permanent link

Direct link
Publications (10 of 11) Show all publications
Jones, G., Ulan, M., Liziniewicz, M., Lindeberg, J. & Adamopoulos, S. (2024). Relating estimates of wood properties of birch to stem form, age and species. Journal of Forestry Research, 35(1), Article ID 14.
Open this publication in new window or tab >>Relating estimates of wood properties of birch to stem form, age and species
Show others...
2024 (English)In: Journal of Forestry Research, ISSN 1007-662X, E-ISSN 1993-0607, Vol. 35, no 1, article id 14Article in journal (Refereed) Published
Abstract [en]

Birch has long suffered from a lack of active forest management, leading many researchers to use material without a detailed management history. Data collected from three birch (Betula pendula Roth, B. pubescens Ehrh.) sites in southern Sweden were analyzed using regression analysis to detect any trends or differences in wood properties that could be explained by stand history, tree age and stem form. All sites were genetics trials established in the same way. Estimates of acoustic velocity (AV) from non-destructive testing (NDT) and predicted AV had a higher correlation if data was pooled across sites and other stem form factors were considered. A subsample of stems had radial profiles of X-ray wood density and ring width by year created, and wood density was related to ring number from the pith and ring width. It seemed likely that wood density was negatively related to ring width for both birch species. Linear models had slight improvements if site and species were included, but only the youngest site with trees at age 15 had both birch species. This paper indicated that NDT values need to be considered separately, and any predictive models will likely be improved if they are specific to the site and birch species measured.

Place, publisher, year, edition, pages
Springer, 2024
Keywords
Acoustic velocity, Non-destructive testing, Predictive models, Regression analysis, Wood density
National Category
Forest Science Wood Science
Research subject
Technology (byts ev till Engineering), Forestry and Wood Technology
Identifiers
urn:nbn:se:lnu:diva-126764 (URN)10.1007/s11676-023-01669-4 (DOI)001117570200007 ()2-s2.0-85178931158 (Scopus ID)
Available from: 2024-01-16 Created: 2024-01-16 Last updated: 2024-09-05Bibliographically approved
Ulan, M. (2021). Aggregation as Unsupervised Learning in Software Engineering and Beyond. (Doctoral dissertation). Växjö: Linnaeus University Press
Open this publication in new window or tab >>Aggregation as Unsupervised Learning in Software Engineering and Beyond
2021 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Ranking alternatives is fundamental to effective decision making. However, creating an overall ranking is difficult if there are multiple criteria, and no single alternative performs best across all criteria. Software engineering is no exception.

Software quality is usually decomposed hierarchically into characteristics, and their quality can be assessed by various direct and indirect metrics. Although such quality models provide a basic understanding of what data to collect and which metrics to use, it is not clear how the metrics should be combined to assess the overall quality. Due to different approaches for aggregation of metrics, the same quality model and the same metrics for assessing the same software artifact could still lead to different assessment results and even to different interpretations.

The proposed aggregation approach in this thesis is well-defined, interpretable, and applicable under realistic conditions. This approach can turn the quality- model- and metric-based assessment of (software) quality into a reliable and reproducible process. We express quality as the probability of detecting something with equal or worse quality, based on all software artifacts observed; good and bad quality is expressed in terms of lower and higher probabilities. 

We validated our approach theoretically and empirically. We conducted empirical studies on Bug prediction, Maintainability assessment, and Information Quality.

We used Software Visualization to analyze the usability of aggregation for analyzing multivariate data in general and the effect of different alternative aggregation approaches, i.e., we designed and implemented an exploratory multivariate data visualization tool.

Finally, we applied our approach to Multi-criteria Ranking to evaluate its transferability to other domains. We evaluated it on a real-world decision-making problem for assessment and ranking of alternatives. Moreover, we applied our approach to the context of Machine Learning. We created a benchmark from a collection of regression problems, and evaluated how well the aggregation output agrees with a ground truth, and how well it represents the properties of the input variables.

The results showed that our approach is not only theoretically sound, it is also accurate, sensitive, identifies anomalies, scales in performance, and can support multi-criteria decision making. Furthermore, our approach is transferable to other domains that require aggregation in hierarchically structured models, and it can be used as an agnostic unsupervised predictor in the absence of a ground truth.

Place, publisher, year, edition, pages
Växjö: Linnaeus University Press, 2021. p. 51
Series
Linnaeus University Dissertations ; 430
Keywords
quality assessment, quantitative methods, aggregation, multi-criteria decision making, unsupervised machine learning
National Category
Computer Sciences
Research subject
Computer and Information Sciences Computer Science
Identifiers
urn:nbn:se:lnu:diva-108115 (URN)9789189460409 (ISBN)9789189460416 (ISBN)
Public defence
2021-12-17, Weber, building K, Växjö, 13:00 (English)
Opponent
Supervisors
Available from: 2021-11-24 Created: 2021-11-19 Last updated: 2025-03-05Bibliographically approved
Ulan, M., Löwe, W., Ericsson, M. & Wingkvist, A. (2021). Copula-based software metrics aggregation. Software quality journal, 29, 863-899
Open this publication in new window or tab >>Copula-based software metrics aggregation
2021 (English)In: Software quality journal, ISSN 0963-9314, E-ISSN 1573-1367, Vol. 29, p. 863-899Article in journal (Refereed) Published
Abstract [en]

A quality model is a conceptual decomposition of an abstract notion of quality into relevant, possibly conflicting characteristics and further into measurable metrics. For quality assessment and decision making, metrics values are aggregated to characteristics and ultimately to quality scores. Aggregation has often been problematic as quality models do not provide the semantics of aggregation. This makes it hard to formally reason about metrics, characteristics, and quality. We argue that aggregation needs to be interpretable and mathematically well defined in order to assess, to compare, and to improve quality. To address this challenge, we propose a probabilistic approach to aggregation and define quality scores based on joint distributions of absolute metrics values. To evaluate the proposed approach and its implementation under realistic conditions, we conduct empirical studies on bug prediction of ca. 5000 software classes, maintainability of ca. 15000 open-source software systems, and on the information quality of ca. 100000 real-world technical documents. We found that our approach is feasible, accurate, and scalable in performance.

Place, publisher, year, edition, pages
Springer, 2021
Keywords
Quality assessment, Quantitative methods, Software metrics, Aggregation, Multivariate statistical methods, Probabilistic models, Copula
National Category
Software Engineering
Research subject
Computer Science, Software Technology
Identifiers
urn:nbn:se:lnu:diva-106779 (URN)10.1007/s11219-021-09568-9 (DOI)000687914800001 ()2-s2.0-85113308308 (Scopus ID)2021 (Local ID)2021 (Archive number)2021 (OAI)
Available from: 2021-09-03 Created: 2021-09-03 Last updated: 2021-12-23Bibliographically approved
Ulan, M., Löwe, W., Ericsson, M. & Wingkvist, A. (2021). Weighted software metrics aggregation and its application to defect prediction. Empirical Software Engineering, 26(5), Article ID 86.
Open this publication in new window or tab >>Weighted software metrics aggregation and its application to defect prediction
2021 (English)In: Empirical Software Engineering, ISSN 1382-3256, E-ISSN 1573-7616, Vol. 26, no 5, article id 86Article in journal (Refereed) Published
Abstract [en]

It is a well-known practice in software engineering to aggregate software metrics to assess software artifacts for various purposes, such as their maintainability or their proneness to contain bugs. For different purposes, different metrics might be relevant. However, weighting these software metrics according to their contribution to the respective purpose is a challenging task. Manual approaches based on experts do not scale with the number of metrics. Also, experts get confused if the metrics are not independent, which is rarely the case. Automated approaches based on supervised learning require reliable and generalizable training data, a ground truth, which is rarely available. We propose an automated approach to weighted metrics aggregation that is based on unsupervised learning. It sets metrics scores and their weights based on probability theory and aggregates them. To evaluate the effectiveness, we conducted two empirical studies on defect prediction, one on ca. 200 000 code changes, and another ca. 5 000 software classes. The results show that our approach can be used as an agnostic unsupervised predictor in the absence of a ground truth.

Place, publisher, year, edition, pages
Springer, 2021
Keywords
Software assessment, Quantitative methods, Defect prediction, Software metrics, Aggregation, Weighting
National Category
Software Engineering
Research subject
Computer Science, Software Technology
Identifiers
urn:nbn:se:lnu:diva-105905 (URN)10.1007/s10664-021-09984-2 (DOI)000665053900001 ()2-s2.0-85108807147 (Scopus ID)2021 (Local ID)2021 (Archive number)2021 (OAI)
Available from: 2021-07-14 Created: 2021-07-14 Last updated: 2024-08-29Bibliographically approved
Ulan, M., Löwe, W., Ericsson, M. & Wingkvist, A. (2020). Towards Meaningful Software Metrics Aggregation. In: Dario Di Nucci, Coen De Roover (Ed.), Proceedings of the 18th Belgium- Netherlands Software Evolution Workshop: . Paper presented at 8th Belgium-Netherlands Software Evolution Workshop (BENEVOL 2019), Brussels, Belgium, 28-29 November, 2019. CEUR-WS.org, 2605
Open this publication in new window or tab >>Towards Meaningful Software Metrics Aggregation
2020 (English)In: Proceedings of the 18th Belgium- Netherlands Software Evolution Workshop / [ed] Dario Di Nucci, Coen De Roover, CEUR-WS.org , 2020, Vol. 2605Conference paper, Published paper (Refereed)
Abstract [en]

Aggregation of software metrics is a challenging task, it is even more complex when it comes to considering weights to indicate the relative importance of software metrics. These weights are mostly determined manually, it results in subjective quality models, which are hard to interpret. To address this challenge, we propose an automated aggregation approach based on the joint distribution of software metrics. To evaluate the effectiveness of our approach, we conduct an empirical study on maintainability assessment for around 5000 classes from open source software systems written in Java and compare our approach with a classical weighted linear combination approach in the context of maintainability scoring and anomaly detection. The results show that approaches assign similar scores, while our approach is more interpretable, sensitive, and actionable.

Place, publisher, year, edition, pages
CEUR-WS.org, 2020
Series
CEUR Workshop Proceedings, E-ISSN 1613-0073
Keywords
Software metrics, Aggregation, Weights, Copula
National Category
Software Engineering
Research subject
Computer Science, Software Technology
Identifiers
urn:nbn:se:lnu:diva-95349 (URN)2-s2.0-85088393934 (Scopus ID)
Conference
8th Belgium-Netherlands Software Evolution Workshop (BENEVOL 2019), Brussels, Belgium, 28-29 November, 2019
Available from: 2020-06-02 Created: 2020-06-02 Last updated: 2024-08-29Bibliographically approved
Ulan, M. (2019). Foundation of Multi-Criteria Quality Scoring. (Licentiate dissertation). Växjö: Linnaeus University Press
Open this publication in new window or tab >>Foundation of Multi-Criteria Quality Scoring
2019 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Software quality becomes more critical as our dependence on software increases. We need better quality assessment than ever. Comparison and ranking of software artifacts, detection of bad or good quality are important tasks for quality assessment.

Software quality models are widely used to support quality assessment. In general, they have a hierarchical structure and defines quality in terms of sub-qualities and metrics in a tree-like structure. Different metrics evaluate different quality criteria, and several metrics often needs to be assessed and aggregated to obtain a total quality score. The quality models standards of today do not enable numerical metrics aggregation. They leave aggregation to decision makers, and different methods of aggregation lead to different assessment results and interpretations. Hence, there is a need to define metrics aggregation formally based on well-known theories.

We propose to consider the probabilistic nature of quality as a solution. We consider metrics as random variables and define quality scores based on joint probabilities. The aggregation, and the quality model in extension, express quality as the probability of detecting something with equal or worse quality, based on all software projects observed; good and bad quality is expressed in terms of lower and higher probabilities. We analyze metrics dependencies using Bayesian networks and define quality models as directed acyclic graphs. Nodes correspond to metrics, and edges indicate dependencies. We propose an implementation using multi-threading to improve the efficiency of joint probabilities computations.

We validate our approach theoretically and in an empirical study on software quality assessment of approximately 100\,000 real-world software artifacts with approximately 4\,000\,000 measurements in total. The results show that our approach gives likely results and scales in performance to large projects.

We also applied our approach to a multi-criteria decision-making task to propose a ranking method to aid evaluation processes. We use a real-world funding allocation problem for a call that attracted approximately 600 applications to evaluate our approach. We compared our approach with the traditional weighted sum aggregation model and found that ranks are similar between the two methods, but our approach provides a more sound basis for a fair assessment.

Further, we implemented an exploratory multivariate data visualization tool, which visualizes the similarities between software artifacts based on joint distributions. We illustrate the usability of our tool with two case studies of real-world examples: a set of technical documents and an open source project written in Java.

Our overall results show that our approach for multi-criteria quality scoring is well-defined, has a clear interpretation, and is applicable under realistic conditions, generalizable, and transferable to other domains.

Place, publisher, year, edition, pages
Växjö: Linnaeus University Press, 2019
National Category
Computer and Information Sciences
Research subject
Computer and Information Sciences Computer Science
Identifiers
urn:nbn:se:lnu:diva-82619 (URN)
Presentation
2019-06-07, B2018, Hus B, Växjö, 10:15 (English)
Opponent
Supervisors
Available from: 2019-05-29 Created: 2019-05-20 Last updated: 2024-08-29Bibliographically approved
Ulan, M., Ericsson, M., Löwe, W. & Wingkvist, A. (2019). Multi-criteria Ranking Based on Joint Distributions: A Tool to Support Decision Making. In: Pańkowska M., Sandkuhl K (Ed.), Perspectives in Business Informatics Research.BIR 2019: 18th International Conference on Business Informatics Research. Paper presented at 18th International Conference, BIR 2019, Katowice, Poland, September 23-25, 2019 (pp. 74-88). Springer
Open this publication in new window or tab >>Multi-criteria Ranking Based on Joint Distributions: A Tool to Support Decision Making
2019 (English)In: Perspectives in Business Informatics Research.BIR 2019: 18th International Conference on Business Informatics Research / [ed] Pańkowska M., Sandkuhl K, Springer, 2019, p. 74-88Conference paper, Published paper (Refereed)
Abstract [en]

Sound assessment and ranking of alternatives are fundamental to effective decision making. Creating an overall ranking is not trivial if there are multiple criteria, and none of the alternatives is the best according to all criteria. To address this challenge, we propose an approach that aggregates criteria scores based on their joint (probability) distribution and obtains the ranking as a weighted product of these scores. We evaluate our approach in a real-world use case based on a funding allocation problem and compare it with the traditional weighted sum aggregation model. The results show that the approaches assign similar ranks, while our approach is more interpretable and sensitive.

Place, publisher, year, edition, pages
Springer, 2019
Series
Lecture Notes in Business Information Processing, ISSN 1865-1348, E-ISSN 1865-1356 ; 365
Keywords
Aggregation, Management by objectives, Ranking
National Category
Software Engineering
Research subject
Computer and Information Sciences Computer Science, Computer Science; Computer Science, Software Technology
Identifiers
urn:nbn:se:lnu:diva-89171 (URN)10.1007/978-3-030-31143-8_6 (DOI)2-s2.0-85075255852 (Scopus ID)978-3-030-31142-1 (ISBN)978-3-030-31143-8 (ISBN)
Conference
18th International Conference, BIR 2019, Katowice, Poland, September 23-25, 2019
Funder
Knowledge Foundation, 20150088
Available from: 2019-09-17 Created: 2019-09-17 Last updated: 2024-08-29Bibliographically approved
Ulan, M., Hönel, S., Martins, R. M., Ericsson, M., Löwe, W., Wingkvist, A. & Kerren, A. (2018). Artifact: Quality Models Inside Out: Interactive Visualization of Software Metrics by Means of Joint Probabilities.
Open this publication in new window or tab >>Artifact: Quality Models Inside Out: Interactive Visualization of Software Metrics by Means of Joint Probabilities
Show others...
2018 (English)Other (Refereed)
Abstract [en]

Assessing software quality, in general, is hard; each metric has a different interpretation, scale, range of values, or measurement method. Combining these metrics automatically is especially difficult, because they measure different aspects of software quality, and creating a single global final quality score limits the evaluation of the specific quality aspects and trade-offs that exist when looking at different metrics. We present a way to visualize multiple aspects of software quality. In general, software quality can be decomposed hierarchically into characteristics, which can be assessed by various direct and indirect metrics. These characteristics are then combined and aggregated to assess the quality of the software system as a whole. We introduce an approach for quality assessment based on joint distributions of metrics values. Visualizations of these distributions allow users to explore and compare the quality metrics of software systems and their artifacts, and to detect patterns, correlations, and anomalies. Furthermore, it is possible to identify common properties and flaws, as our visualization approach provides rich interactions for visual queries to the quality models’ multivariate data. We evaluate our approach in two use cases based on: 30 real-world technical documentation projects with 20,000 XML documents, and an open source project written in Java with 1000 classes. Our results show that the proposed approach allows an analyst to detect possible causes of bad or good quality.

Publisher
p. 1
Keywords
Hierarchical data exploration, Multivariate data visualization, Joint probabilities, t-SNE, Data abstraction
National Category
Computer Sciences
Research subject
Computer Science, Information and software visualization
Identifiers
urn:nbn:se:lnu:diva-98176 (URN)10.5281/zenodo.1311600 (DOI)
Available from: 2020-09-25 Created: 2020-09-25 Last updated: 2025-05-15Bibliographically approved
Ulan, M., Löwe, W., Ericsson, M. & Wingkvist, A. (2018). Introducing Quality Models Based On Joint Probabilities: Introducing Quality Models Based On Joint Probabilities. In: ICSE '18 Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings: . Paper presented at 40th ACM/IEEE International Conference on Software Engineering (ICSE), MAY 27-JUN 03, 2018, Gothenburg, SWEDEN (pp. 216-217). IEEE
Open this publication in new window or tab >>Introducing Quality Models Based On Joint Probabilities: Introducing Quality Models Based On Joint Probabilities
2018 (English)In: ICSE '18 Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, IEEE, 2018, p. 216-217Conference paper, Published paper (Refereed)
Abstract [en]

Multi-dimensional goals can be formalized in so-called quality models. Often, each dimension is assessed with a set of metrics that are not comparable; they come with different units, scale types, and distributions of values. Aggregating the metrics to a single quality score in an ad-hoc manner cannot be expected to provide a reliable basis for decision making. Therefore, aggregation needs to be mathematically well-defined and interpretable. We present such a way of defining quality models based on joint probabilities. We exemplify our approach using a quality model with 30 standard metrics assessing technical documentation quality and study ca. 20,000 real-world files. We study the effect of several tests on the independence and results show that metrics are, in general, not independent. Finally, we exemplify our suggested definition of quality models in this domain.

Place, publisher, year, edition, pages
IEEE, 2018
Series
Proceedings of the IEEE-ACM International Conference on Software Engineering Companion, ISSN 2574-1926, E-ISSN 2574-1934
Keywords
Quality assessment, Software metrics, Bayesian networks
National Category
Software Engineering
Research subject
Computer Science, Software Technology
Identifiers
urn:nbn:se:lnu:diva-79015 (URN)10.1145/3183440.3195103 (DOI)000450109000078 ()2-s2.0-85049686901 (Scopus ID)978-1-4503-5663-3 (ISBN)
Conference
40th ACM/IEEE International Conference on Software Engineering (ICSE), MAY 27-JUN 03, 2018, Gothenburg, SWEDEN
Available from: 2018-12-06 Created: 2018-12-06 Last updated: 2024-08-29Bibliographically approved
Ulan, M., Hönel, S., Martins, R. M., Ericsson, M., Löwe, W., Wingkvist, A. & Kerren, A. (2018). Quality Models Inside Out: Interactive Visualization of Software Metrics by Means of Joint Probabilities. In: J. Ángel Velázquez Iturbide, Jaime Urquiza Fuentes, Andreas Kerren, and Mircea F. Lungu (Ed.), Proceedings of the 2018 Sixth IEEE Working Conference on Software Visualization, (VISSOFT), Madrid, Spain, 2018: . Paper presented at IEEE Working Conference on Software Visualization (VISSOFT), Madrid, Spain, 24-25 September, 2018 (pp. 65-75). IEEE
Open this publication in new window or tab >>Quality Models Inside Out: Interactive Visualization of Software Metrics by Means of Joint Probabilities
Show others...
2018 (English)In: Proceedings of the 2018 Sixth IEEE Working Conference on Software Visualization, (VISSOFT), Madrid, Spain, 2018 / [ed] J. Ángel Velázquez Iturbide, Jaime Urquiza Fuentes, Andreas Kerren, and Mircea F. Lungu, IEEE, 2018, p. 65-75Conference paper, Published paper (Refereed)
Abstract [en]

Assessing software quality, in general, is hard; each metric has a different interpretation, scale, range of values, or measurement method. Combining these metrics automatically is especially difficult, because they measure different aspects of software quality, and creating a single global final quality score limits the evaluation of the specific quality aspects and trade-offs that exist when looking at different metrics. We present a way to visualize multiple aspects of software quality. In general, software quality can be decomposed hierarchically into characteristics, which can be assessed by various direct and indirect metrics. These characteristics are then combined and aggregated to assess the quality of the software system as a whole. We introduce an approach for quality assessment based on joint distributions of metrics values. Visualizations of these distributions allow users to explore and compare the quality metrics of software systems and their artifacts, and to detect patterns, correlations, and anomalies. Furthermore, it is possible to identify common properties and flaws, as our visualization approach provides rich interactions for visual queries to the quality models’ multivariate data. We evaluate our approach in two use cases based on: 30 real-world technical documentation projects with 20,000 XML documents, and an open source project written in Java with 1000 classes. Our results show that the proposed approach allows an analyst to detect possible causes of bad or good quality.

Place, publisher, year, edition, pages
IEEE, 2018
Keywords
hierarchical data exploration, multivariate data visualization, joint probabilities, t-SNE, data abstraction
National Category
Human Computer Interaction Software Engineering
Research subject
Computer Science, Information and software visualization; Computer Science, Software Technology
Identifiers
urn:nbn:se:lnu:diva-78093 (URN)10.1109/VISSOFT.2018.00015 (DOI)000519580000007 ()2-s2.0-85058463111 (Scopus ID)978-1-5386-8292-0 (ISBN)978-1-5386-8293-7 (ISBN)
Conference
IEEE Working Conference on Software Visualization (VISSOFT), Madrid, Spain, 24-25 September, 2018
Projects
Software technology for self-adaptive systems
Funder
Knowledge Foundation, 20150088
Available from: 2018-10-01 Created: 2018-10-01 Last updated: 2025-05-15Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-3906-7611

Search in DiVA

Show all publications