lnu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A General Framework for Text Document Classification Using SEMCON and ACVSR
2015 (English)In: Human Interface and the Management of Information. Information and Knowledge Design, Springer International Publishing , 2015, p. 310-319Conference paper, Published paper (Refereed)
Abstract [en]

The text document classification employs either text based approach or semantic based approach to index and retrieve text documents. The former uses keywords and therefore provides limited capabilities to capture and exploit the conceptualization involved in user information needs and content meanings. The latter aims to solve these limitations using content meanings, rather than keywords. More formally, the semantic based approach uses the domain ontology to exploit the content meanings of a particular domain. This approach however has some drawbacks. It lacks enrichment of ontology concepts with new lexical resources and evaluation of the importance indicated by weights of those concepts. Therefore to address these issues, this paper proposes a new ontology based text document classification framework. The proposed framework incorporates a newly developed objective metric calledSEMCON to enrich the domain ontology with new concepts by combining contextual as well as semantic information of a term within a text document. The framework also introduces a new approach to automatically estimate the importance of ontology concepts which is indicated by the weights of these concepts, and to enhance the concept vector space model using automatically estimated weights.

Place, publisher, year, edition, pages
Springer International Publishing , 2015. p. 310-319
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:lnu:diva-89059OAI: oai:DiVA.org:lnu-89059DiVA, id: diva2:1350235
Conference
HCI International 2015
Available from: 2019-09-11 Created: 2019-09-11 Last updated: 2019-09-11

Open Access in DiVA

No full text in DiVA

Search in DiVA

By author/editor
Kastrati, Zenun
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf