Multi-Criteria-based Active Learning for Named Entity Recognition

Dan Shen,Jie Zhang,Jian Su,Guodong Zhou,C. Tan

Published 2004 in Annual Meeting of the Association for Computational Linguistics

ABSTRACT

In this paper, we propose a multi-criteria-based active learning approach and effectively apply it to named entity recognition. Active learning targets to minimize the human annotation efforts by selecting examples for labeling. To maximize the contribution of the selected examples, we consider the multiple criteria: informativeness, representativeness and diversity and propose measures to quantify them. More comprehensively, we incorporate all the criteria using two selection strategies, both of which result in less labeling cost than single-criterion-based method. The results of the named entity recognition in both MUC-6 and GENIA show that the labeling cost can be reduced by at least 80% without degrading the performance.

PUBLICATION RECORD

  • Publication year

    2004

  • Venue

    Annual Meeting of the Association for Computational Linguistics

  • Publication date

    2004-07-21

  • Fields of study

    Computer Science

  • Identifiers
  • External record

    Open on Semantic Scholar

  • Source metadata

    Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

  • No claims are published for this paper.

CONCEPTS

  • No concepts are published for this paper.

REFERENCES

Showing 1-34 of 34 references · Page 1 of 1

CITED BY

Showing 1-100 of 257 citing papers · Page 1 of 3