We use a convolutional neural network to perform authorship identification on a very homogeneous dataset of scientific publications. In order to investigate the effect of domain biases, we obscure words below a certain frequency threshold, retaining only their POS-tags. This procedure improves test performance due to better generalization on unseen data. Using our method, we are able to predict the authors of scientific publications in the same discipline at levels well above chance.
Authorship Attribution with Convolutional Neural Networks and POS-Eliding
Julian Hitschler,Esther van den Berg,Ines Rehbein
Published 2017 in Unknown venue
ABSTRACT
PUBLICATION RECORD
- Publication year
2017
- Venue
Unknown venue
- Publication date
Unknown publication date
- Fields of study
Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-17 of 17 references · Page 1 of 1
CITED BY
Showing 1-48 of 48 citing papers · Page 1 of 1