lnu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Visualization of Text Duplicates in Documents
Växjö universitet, Fakulteten för matematik/naturvetenskap/teknik, Matematiska och systemtekniska institutionen.
Växjö universitet, Fakulteten för matematik/naturvetenskap/teknik, Matematiska och systemtekniska institutionen.
2009 (Engelska)Självständigt arbete på avancerad nivå (masterexamen), 15 poäng / 22,5 hpStudentuppsats (Examensarbete)
Abstract [en]

In this thesis, a tool to visualize duplicate parts in a series of given documents is developed.

Text duplicates are very common nowadays in all fields. This behavior severelyharms the rights of the original authors though it facilitates the work of those whocopy from them. Effective legal measures have been taken when it comes to copyrightissue. An increasing large number of people have paid serious attention to what theywrite when they refer to other people's works. Although references are properly madeby many who admire and respect others' achievements, plagiarism takes place all thetime. Therefore, an intuitive way of visualizing duplicate parts is needed so thatpeople can easily grasp the purpose and decide the legality of those duplicates. Whenit comes to computer science, software clone is very typical phenomenon amongdifferent development groups or even within one group. Since a piece of softwareusually have its hierarchy, it is also interesting to group members when they do aclone detection of their own or other software. For example, if a good overview of thehierarchies is provided in a tree representation, one can easily locate the clones of aparticular node in other trees. More interaction techniques can allow concrete codeaccesses through double clicking on a highlighted node.

To visualize duplicate parts in a nice and intuitive way, a visualization tool isdeveloped for this thesis project. By the time it is done, the following features shouldbe fulfilled. First, the tool can visualize similar or identical parts given a data set.Second, hierarchies of those files can be demonstrated with proper layout. Third, theuser can manipulate the data items on the screen in order to get a better insight of thedata set and help with analysis tasks. Forth, different levels of abstraction areprovided so that the user can either get an overview of all the files or specificallycheck the duplicate parts in the documents of interest.

Ort, förlag, år, upplaga, sidor
2009. , s. 77
Serie
Rapporter från MSI, ISSN 1650-2647
Nyckelord [en]
Duplicates, PREFUSE, Visualization, Treemap, Similarity, Interaction
Identifikatorer
URN: urn:nbn:se:vxu:diva-5408ISRN: VXU/MSI/DA/E/--09029/--SEOAI: oai:DiVA.org:vxu-5408DiVA, id: diva2:224192
Presentation
D1169, School of Mathematics and Systems Engineering, D building, Växjö University (Engelska)
Uppsök

Handledare
Examinatorer
Projekt
Visualization of Text Duplicates in DocumentsTillgänglig från: 2009-06-17 Skapad: 2009-06-17 Senast uppdaterad: 2010-03-10Bibliografiskt granskad

Open Access i DiVA

fulltext(3277 kB)359 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 3277 kBChecksumma SHA-512
0346157fbf818e3e9e6e5932d36fb0b8b7b0d778dd2feec0e4d53ac6c9f6b861d541155884b5da91908529d6147a669b2edd358cbaa555697adac8a8a6f31575
Typ fulltextMimetyp application/pdf

Sök vidare i DiVA

Av författaren/redaktören
Wang, ChaoPan, Han
Av organisationen
Matematiska och systemtekniska institutionen

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 359 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 395 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf