Motivation State-of-the-art biomedical named entity recognition (BioNER) systems often require handcrafted features specific to each entity type, such as genes, chemicals and diseases. Although recent studies explored using neural network models for BioNER to free experts from manual feature engineering, the performance remains limited by the available training data for each entity type. Results We propose a multi-task learning framework for BioNER to collectively use the training data of different types of entities and improve the performance on each of them. In experiments on 15 benchmark BioNER datasets, our multi-task model achieves substantially better performance compared with state-of-the-art BioNER systems and baseline neural sequence labeling models. Further analysis shows that the large performance gains come from sharing character- and word-level information among relevant biomedical entities across differently labeled corpora. Availability Our source code is available at https://github.com/yuzhimanhua/lm-lstm-crf. Contact xwang174@illinois.edu, xiangren@usc.edu.
Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning
Xuan Wang,Yu Zhang,Xiang Ren,Yuhao Zhang,M. Zitnik,Jingbo Shang,C. Langlotz,Jiawei Han
Published 2018 in bioRxiv
ABSTRACT
PUBLICATION RECORD
- Publication year
2018
- Venue
bioRxiv
- Publication date
2018-01-30
- Fields of study
Biology, Medicine, Computer Science, Mathematics
- Identifiers
- External record
- Source metadata
Semantic Scholar, PubMed
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-48 of 48 references · Page 1 of 1