lnu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
PAL, a tool for Pre-annotation and Active Learning
Linnaeus University, Faculty of Technology, Department of Computer Science. (ISOVIS)ORCID iD: 0000-0001-6164-7762
Lund University.ORCID iD: 0000-0002-7240-9003
Linnaeus University, Faculty of Technology, Department of Computer Science. (ISOVIS)ORCID iD: 0000-0002-0519-2537
2016 (English)In: Journal for Language Technology and Computational Linguistics, ISSN 0175-1336, E-ISSN 2190-6858, Vol. 31, no 1, 81-100 p.Article in journal (Refereed) Published
Abstract [en]

Many natural language processing systems rely on machine learning models that are trained on large amounts of manually annotated text data. The lack of sufficient amounts of annotated data is, however, a common obstacle for such systems, since manual annotation of text is often expensive and time-consuming.

The aim of “PAL, a tool for Pre-annotation and Active Learning” is to provide a ready-made package that can be used to simplify annotation and to reduce the amount of annotated data required to train a machine learning classifier. The package provides support for two techniques that have been shown to be successful in previous studies, namely active learning and pre-annotation.

The output of the pre-annotation is provided in the annotation format of the annotation tool BRAT, but PAL is a stand-alone package that can be adapted to other formats. 

Place, publisher, year, edition, pages
GSCL , 2016. Vol. 31, no 1, 81-100 p.
Keyword [en]
NLP, annotation, pre-annotation, active learning, machine learning, text data
National Category
Language Technology (Computational Linguistics)
Research subject
Computer Science, Information and software visualization; Computer and Information Sciences Computer Science, Computer Science
Identifiers
URN: urn:nbn:se:lnu:diva-63836OAI: oai:DiVA.org:lnu-63836DiVA: diva2:1095746
Projects
StaViCTA
Funder
Swedish Research Council, 2012-5659
Available from: 2017-05-15 Created: 2017-05-15 Last updated: 2017-09-15Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full text

Search in DiVA

By author/editor
Skeppstedt, MariaParadis, CaritaKerren, Andreas
By organisation
Department of Computer Science
In the same journal
Journal for Language Technology and Computational Linguistics
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

Total: 145 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf