lnu.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Supervised Ontology-Based Document Classification Model
Norwegian University of Science and Technology, Norway.ORCID-id: 0000-0002-0199-2377
Norwegian University of Science and Technology, Norway.
2017 (engelsk)Inngår i: Proceedings of the International Conference on Compute and Data Analysis, ICCDA'17, ACM Publications, 2017, s. 245-251Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Ontology-based document classification relies on background knowledge exploited by ontologies to represent documents. Background knowledge is embedded in a document using the exact matching technique. The basic idea of this technique is to map a term to a concept by searching only the concept labels that explicitly occur in a document. Searching only the presence of concept labels limits the capabilities to capture and exploit the whole conceptualization involved in user information and content meanings. Therefore, to address this limitation, we propose a new document classification model based on ontologies. The proposed model uses background knowledge derived by ontologies for document representation. It associates a document with a set of concepts by not only using the exact matching technique but also by identifying and extracting new terms which can be semantically related to the concepts of ontologies. Additionally, the proposed model employs a new concept weighting technique which computes the weight of a concept using the relevance and the importance of the concept. We conducted several experiments using a real ontology and a dataset to test our proposed model. The results obtained by experiments run on 3 different classification algorithms using the baseline ontology, the improved concept vector space model by using the new concept weighting technique, and the enriched ontology, show that our proposed model achieved a considerable improvement of classification performance.

sted, utgiver, år, opplag, sider
ACM Publications, 2017. s. 245-251
Emneord [en]
Document classification, First Sense Heuristic, Maximizing Semantic Similarity, Ontology, SEMCON, iCVS
HSV kategori
Forskningsprogram
Data- och informationsvetenskap, Informatik
Identifikatorer
URN: urn:nbn:se:lnu:diva-88767DOI: 10.1145/3093241.3107883ISBN: 978-1-4503-5241-3 (tryckt)OAI: oai:DiVA.org:lnu-88767DiVA, id: diva2:1346434
Konferanse
International Conference on Compute and Data Analysis - ICCDA, May 19 - 23, 2017, Lakeland, FL, USA
Tilgjengelig fra: 2019-08-27 Laget: 2019-08-27 Sist oppdatert: 2019-09-06bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fullteksthttp://doi.acm.org/10.1145/3093241.3107883

Person

Kastrati, Zenun

Søk i DiVA

Av forfatter/redaktør
Kastrati, Zenun

Søk utenfor DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric

doi
isbn
urn-nbn
Totalt: 145 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf