Analysis of the relation between RNA and RBPs using machine learning
2021 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE credits
Student thesisAlternative title
Analys av relationen mellan RNA och RBPs med hjälp av maskininlärning (Swedish)
Abstract [en]
The study of RNA-binding proteins has recently increased in importance due to discoveries of their larger role in cellular processes. One study currently conducted at Umeå University involves constructing a model that will be able to improve our knowledge about T-cells by explaining how these cells work in different diseases. But before this model can become a reality, Umeå Univerity needs to investigate the relation between RNA and RNA-binding proteins and find proteins of which highly contribute to the activity of the RNA-binding proteins. To do so, they have decided to use four penalized regression Machine Learning models to analyse protein sequences from CD4 cells. These models consist of a ridge penalized model, an elastic net model, a neural network model, and a Bayesian model. The results show that the models have a number of RNA-binding protein sequences in common which they list as highly decisive in their predictions.
Place, publisher, year, edition, pages
2021. , p. 38
Keywords [en]
Machine Learning, Supervised Learning, Linear Regression, RNA-binding Proteins, LIME, T-Cells, CD4 Cells, K-mer, Bag of Words, RBP Activity Prediction
National Category
Computer Sciences Cell Biology
Identifiers
URN: urn:nbn:se:lnu:diva-107092OAI: oai:DiVA.org:lnu-107092DiVA, id: diva2:1596786
External cooperation
Umeå universitet
Educational program
Computer Engineering Programme, 180 credits
Supervisors
Examiners
2021-09-232021-09-232021-09-23Bibliographically approved