lnu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
PAL, a tool for Pre-annotation and Active Learning
Linnaeus University, Faculty of Technology, Department of Computer Science. (ISOVIS)ORCID iD: 0000-0001-6164-7762
Lund University.ORCID iD: 0000-0002-7240-9003
Linnaeus University, Faculty of Technology, Department of Computer Science. (ISOVIS)ORCID iD: 0000-0002-0519-2537
2017 (English)In: Journal for Language Technology and Computational Linguistics, ISSN 2190-6858, Vol. 31, no 1, 91-110 p.Article in journal (Refereed) Accepted
Abstract [en]

Many natural language processing systems rely on machine learning models that are trained on large amounts of manually annotated text data. The lack of sufficient amounts of annotated data is, however, a common obstacle for such systems, since manual annotation of text is often expensive and time-consuming.

The aim of “PAL, a tool for Pre-annotation and Active Learning” is to provide a ready-made package that can be used to simplify annotation and to reduce the amount of annotated data required to train a machine learning classifier. The package provides support for two techniques that have been shown to be successful in previous studies, namely active learning and pre-annotation.

The output of the pre-annotation is provided in the annotation format of the annotation tool BRAT, but PAL is a stand-alone package that can be adapted to other formats. 

Place, publisher, year, edition, pages
GSCL , 2017. Vol. 31, no 1, 91-110 p.
Keyword [en]
NLP, annotation, pre-annotation, active learning, machine learning, text data
National Category
Language Technology (Computational Linguistics)
Research subject
Computer Science, Information and software visualization; Computer and Information Sciences Computer Science, Computer Science
Identifiers
URN: urn:nbn:se:lnu:diva-63836OAI: oai:DiVA.org:lnu-63836DiVA: diva2:1095746
Projects
StaViCTA
Funder
Swedish Research Council, 2012-5659
Note

TO BE PUBLISHED!

Available from: 2017-05-15 Created: 2017-05-15 Last updated: 2017-05-15

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Skeppstedt, MariaParadis, CaritaKerren, Andreas
By organisation
Department of Computer Science
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

Total: 25 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf