lnu.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Applicability domain dependent predictive uncertainty in QSAR regressions
Linnéuniversitetet, Fakulteten för Hälso- och livsvetenskap (FHL), Institutionen för biologi och miljö (BOM). Centre of Environmental and Climate Research, Lund University, Lund.
Ideaconsult Ltd, Sofia, Bulgaria.
Linnéuniversitetet, Fakulteten för Hälso- och livsvetenskap (FHL), Institutionen för biologi och miljö (BOM).ORCID-id: 0000-0001-9382-9296
2014 (engelsk)Inngår i: Molecular Informatics, ISSN 1868-1743, Vol. 33, nr 1, s. 26-35Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Predictive models used in decision making, such as QSARs in chemical regulation or drug discovery, call for evaluated approaches to quantitatively assess associated uncertainty in predictions. Uncertainty in less reliable predictions may be captured by locally varying predictive errors. In the current study, model-based bootstrapping was combined with analogy reasoning to generate predictive distributions varying in magnitude over a model’s domain of applicability. A resampling experiment based on PLS regressions on four QSAR data sets demonstrated that predictive errors assessed by k nearest neighbour or weighted PRedicted Error Sum of Squares (PRESS) on samples of external test data or by internal cross-validation improved the performance of the uncertainty assessment. Analogy using similarity defined by Euclidean distances, or differences in standard deviation in perturbed predictions, resulted in better performances than similarity defined by distance to, or density of, the training data. Locally assessed predictive distributions had on average at least as good coverage as Gaussian distribution with variance assessed from the PRESS. An R-code is provided that evaluates performances of the suggested algorithms to assess predictive error based on log likelihood scores and empirical coverage graphs, and which applies these to derive confidence intervals or samples from the predictive distributions of query compounds.

sted, utgiver, år, opplag, sider
2014. Vol. 33, nr 1, s. 26-35
HSV kategori
Forskningsprogram
Naturvetenskap, Miljövetenskap
Identifikatorer
URN: urn:nbn:se:lnu:diva-28032DOI: 10.1002/minf.201200131ISI: 000346768100004Scopus ID: 2-s2.0-84895165560OAI: oai:DiVA.org:lnu-28032DiVA, id: diva2:639899
Tilgjengelig fra: 2013-08-11 Laget: 2013-08-11 Sist oppdatert: 2019-11-25bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstScopus

Person

Sahlin, UllrikaÖberg, Tomas

Søk i DiVA

Av forfatter/redaktør
Sahlin, UllrikaÖberg, Tomas
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 230 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf