A Bootstrapping Method for Learning Semantic Lexicons using Extraction Pattern Contexts

Michael Thelen,E. Riloff

Published 2002 in Conference on Empirical Methods in Natural Language Processing

ABSTRACT

This paper describes a bootstrapping algorithm called Basilisk that learns high-quality semantic lexicons for multiple categories. Basilisk begins with an unannotated corpus and seed words for each semantic category, which are then bootstrapped to learn new words for each category. Basilisk hypothesizes the semantic class of a word based on collective information over a large body of extraction pattern contexts. We evaluate Basilisk on six semantic categories. The semantic lexicons produced by Basilisk have higher precision than those produced by previous techniques, with several categories showing substantial improvement.

PUBLICATION RECORD

  • Publication year

    2002

  • Venue

    Conference on Empirical Methods in Natural Language Processing

  • Publication date

    2002-07-06

  • Fields of study

    Computer Science

  • Identifiers
  • External record

    Open on Semantic Scholar

  • Source metadata

    Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

  • No claims are published for this paper.

CONCEPTS

  • No concepts are published for this paper.

REFERENCES

Showing 1-18 of 18 references · Page 1 of 1

CITED BY

Showing 1-100 of 424 citing papers · Page 1 of 5