Price Prediction of Vinyl Records Using Machine Learning Algorithms
2020 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE credits
Student thesis
Abstract [en]
Machine learning algorithms have been used for price prediction within several application areas. Examples include real estate, the stock market, tourist accommodation, electricity, art, cryptocurrencies, and fine wine. Common approaches in studies are to evaluate the accuracy of predictions and compare different algorithms, such as Linear Regression or Neural Networks. There is a thriving global second-hand market for vinyl records, but the research of price prediction within the area is very limited. The purpose of this project was to expand on existing knowledge within price prediction in general to evaluate some aspects of price prediction of vinyl records. That included investigating the possible level of accuracy and comparing the efficiency of algorithms. A dataset of 37000 samples of vinyl records was created with data from the Discogs website, and multiple machine learning algorithms were utilized in a controlled experiment. Among the conclusions drawn from the results was that the Random Forest algorithm generally generated the strongest results, that results can vary substantially between different artists or genres, and that a large part of the predictions had a good accuracy level, but that a relatively small amount of large errors had a considerable effect on the general results.
Place, publisher, year, edition, pages
2020. , p. 88
Keywords [en]
price prediction, price estimation, vinyl records, vinyl prices, regression, machine learning, machine learning algorithms, algorithm comparison, dataset, vinyl dataset, k-nearest neighbors, linear regression, neural network, random forest, discogs, ai, artificial intelligence
Keywords [sv]
prisuppskattning, vinyl, vinylskivor, maskininlärning, artificiell intelligens
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:lnu:diva-96464OAI: oai:DiVA.org:lnu-96464DiVA, id: diva2:1443317
Subject / course
Computer Science
Educational program
Datavetenskap, kandidatprogram, 60 hp
Supervisors
Examiners
2020-06-182020-06-182020-06-18Bibliographically approved