Authorship Attribution with Convolutional Neural Networks and POS-Eliding

Julian Hitschler,Esther van den Berg,Ines Rehbein

Published 2017 in Unknown venue

ABSTRACT

We use a convolutional neural network to perform authorship identification on a very homogeneous dataset of scientific publications. In order to investigate the effect of domain biases, we obscure words below a certain frequency threshold, retaining only their POS-tags. This procedure improves test performance due to better generalization on unseen data. Using our method, we are able to predict the authors of scientific publications in the same discipline at levels well above chance.

PUBLICATION RECORD

Publication year
2017
Venue
Unknown venue
Publication date
Unknown publication date
Fields of study
Computer Science
Identifiers
DOI 10.18653/v1/W17-4907
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Convolutional Neural Networks
2018cited by this paper
Overview of the Author Identification Task at PAN-2017: Style Breach Detection and Author Clustering
2017cited by this paper
Convolutional Neural Networks for Authorship Attribution of Short Texts
2017cited by this paper
Authorship Attribution Using Text Distortion
2017cited by this paper
Authorship Attribution Using a Neural Network Language Model
2016cited by this paper
Author Identification Using Multi-headed Recurrent Neural Networks
2015cited by this paper
Convolutional Neural Networks for Sentence Classification
2014influential reference
Overview of the Author Identification Task at PAN 2013
2013cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013cited by this paper
Native language detection with 'cheap' learner corpora
2013cited by this paper
Stylometry and the interplay of topic and L1 in the different annotation layers in the FALKO corpus
2011cited by this paper
A survey of modern authorship attribution methods
2009cited by this paper
The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
2008cited by this paper
Investigating Topic Influence in Authorship Attribution
2007cited by this paper
Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network
2003cited by this paper
Inference and Disputed Authorship: The Federalist
1966cited by this paper
ON SENTENCE- LENGTH AS A STATISTICAL CHARACTERISTIC OF STYLE IN PROSE: WITH APPLICATION TO TWO CASES OF DISPUTED AUTHORSHIP
1939cited by this paper

CITED BY

Responsible guidelines for authorship attribution tasks in NLP
2025cites this paper
Contrastive Disentanglement for Authorship Attribution
2024cites this paper
An Empirical Analysis of the Writing Styles of Persona-Assigned LLMs
2024cites this paper
Indonesian news article authorship attribution multilabel multiclass classification using IndoBERT
2024cites this paper
Understanding writing style in social media with a supervised contrastively pre-trained transformer
2023cites this paper
Deep Learning-based Method for Enhancing the Detection of Arabic Authorship Attribution using Acoustic and Textual-based Features
2023cites this paper
Evaluation of Feature Extraction Techniques in Automatic Authorship Attribution
2023cites this paper
Valla: Standardizing and Benchmarking Authorship Attribution and Verification Through Empirical Evaluation and Comparative Analysis
2023cites this paper
Authorship Attribution of Social Media Messages
2023cites this paper
Detection of news written by the ChatGPT through authorship attribution performed by a Bidirectional LSTM model
2023influential citation
Generating Authorship Embeddings with Transformers
2022cites this paper
On the State of the Art in Authorship Attribution and Authorship Verification
2022cites this paper
Post-Authorship Attribution Using Regularized Deep Neural Network
2022cites this paper
Detection of changes in literary writing style using N-grams as style markers and supervised machine learning
2022cites this paper
The Limits of Word Level Differential Privacy
2022cites this paper
Revealing the Demographic Attributes of the Authors from the Abstracts of Scientific Articles
2022cites this paper
Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT
2022cites this paper
Computational Measures of Deceptive Language: Prospects and Issues
2022cites this paper
Authorship Attribution of Small Messages Through Language Models
2022cites this paper
Authorship Attribution of Scientific Abstracts
2022cites this paper
Evaluation of Deep Learning-based Authorship Attribution Methods on Hungarian Texts
2022cites this paper
A taxonomy and review of generalization research in NLP
2022cites this paper
PART: Pre-trained Authorship Representation Transformer
2022cites this paper
Unifying Lexical, Syntactic, and Structural Representations of Written Language for Authorship Attribution
2021cites this paper
The Topic Confusion Task: A Novel Scenario for Authorship Attribution
2021cites this paper
A language-independent authorship attribution approach for author identification of text documents
2021cites this paper
Authorship Attribution using Filtered N-grams as Features
2021cites this paper
TURINGBENCH: A Benchmark Environment for Turing Test in the Age of Neural Text Generation
2021cites this paper
POSNoise: An Effective Countermeasure Against Topic Biases in Authorship Analysis
2020cites this paper
Detecting Rumors on Social Media Based on a CNN Deep Learning Technique
2020cites this paper
Will Longformers PAN Out for Authorship Verification? Notebook for PAN at CLEF 2020
2020cites this paper
A Self-supervised Representation Learning of Sentence Structure for Authorship Attribution
2020cites this paper
Authorship Attribution for Neural Text Generation
2020influential citation
The Role of Traditional Features in Authorship Attribution
2020cites this paper
Gated POS-Level Language Model for Authorship Verification
2020cites this paper
Syntactic Neural Model for Authorship Attribution
2020cites this paper
An Improved Topic Masking Technique for Authorship Analysis
2020cites this paper
Similarity Learning for Authorship Verification in Social Media
2019cites this paper
Syntactic Recurrent Neural Network for Authorship Attribution
2019cites this paper
Multi-Task Learning for Authorship Attribution via Topic Approximation and Competitive Attention
2019cites this paper
Open Set Authorship Attribution toward Demystifying Victorian Periodicals
2019cites this paper
Reduce & Attribute: Two-Step Authorship Attribution for Large-Scale Problems
2019cites this paper
The Myth of Double-Blind Review Revisited: ACL vs. EMNLP
2019influential citation
Explainable Authorship Verification in Social Media via Attention-based Similarity Learning
2019cites this paper
Style-Aware Neural Model with Application in Authorship Attribution
2019cites this paper
Reconsidering authorship in the Ciceronian corpus through computational authorship attribution
2019cites this paper
Machine Learning Techniques for Detecting Identifying Linguistic Patterns in News Media
2019cites this paper
Authorship Attribution with Neural Networks and Multiple Features: Notebook for PAN at CLEF 2018
2018cites this paper