Exploiting Language Models to Classify Events from Twitter

Published 2015 in Computational Intelligence and Neuroscience

ABSTRACT

Classifying events is challenging in Twitter because tweets texts have a large amount of temporal data with a lot of noise and various kinds of topics. In this paper, we propose a method to classify events from Twitter. We firstly find the distinguishing terms between tweets in events and measure their similarities with learning language models such as ConceptNet and a latent Dirichlet allocation method for selectional preferences (LDA-SP), which have been widely studied based on large text corpora within computational linguistic relations. The relationship of term words in tweets will be discovered by checking them under each model. We then proposed a method to compute the similarity between tweets based on tweets' features including common term words and relationships among their distinguishing term words. It will be explicit and convenient for applying to k-nearest neighbor techniques for classification. We carefully applied experiments on the Edinburgh Twitter Corpus to show that our method achieves competitive results for classifying events.

PUBLICATION RECORD

Publication year
2015
Venue
Computational Intelligence and Neuroscience
Publication date
2015-09-14
Fields of study
Medicine, Computer Science
Identifiers
DOI 10.1155/2015/401024 PMID 26451139 PMCID 4584231
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Learning to classify short text from scientific documents using topic models with various types of knowledge
2015cited by this paper
Information Spread of Emergency Events: Path Searching on Social Networks
2014cited by this paper
ConceptNet 5: A Large Semantic Network for Relational Knowledge
2013influential reference
Extraction of Semantic Relation Based on Feature Vector from Wikipedia
2012cited by this paper
Object matching in tweets with spatial models
2012cited by this paper
Analysis of Browsing Behaviors with Ant Colony Clustering Algorithm
2012cited by this paper
Psychology and behavior mechanism of micro-blog information spreading
2012cited by this paper
Finding Bursty Topics from Microblogs
2012cited by this paper
Beyond Trending Topics: Real-World Event Identification on Twitter
2011cited by this paper
Event Detection in Twitter
2011cited by this paper
Event Discovery in Social Media Feeds
2011cited by this paper
Summarizing a Document Stream
2011cited by this paper
Tracking and Connecting Topics via Incremental Hierarchical Dirichlet Processes
2011cited by this paper
Relevance Modeling for Microblog Summarization
2011cited by this paper
Safety Information Mining — What can NLP do in a disaster—
2011influential reference
Discovering Sociolinguistic Associations with Structured Sparsity
2011cited by this paper
Extracting events and event descriptions from Twitter
2011cited by this paper
Emergency Event: Internet Spread, Psychological Impacts and Emergency Management
2011cited by this paper
Recognizing Named Entities in Tweets
2011cited by this paper
A Hidden Topic-Based Framework toward Building Applications with Short Web Documents
2011cited by this paper
Comparing Twitter Summarization Algorithms for Multiple Post Summaries
2011cited by this paper
Automatic Summarization of Twitter Topics
2010cited by this paper
Earthquake shakes Twitter users: real-time event detection by social sensors
2010influential reference
A Latent Dirichlet Allocation Method for Selectional Preferences
2010cited by this paper
Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments
2010cited by this paper
The Edinburgh Twitter Corpus
2010cited by this paper
Applications of Topics Models to Analysis of Disaster-Related Twitter Data
2009influential reference
Normalized (pointwise) mutual information in collocation extraction
2009cited by this paper
Latent Dirichlet Allocation
2009cited by this paper
Integrating web-based intelligence retrieval and decision-making from the twitter trends knowledge base
2009cited by this paper
Text classification based on multi-word with support vector machine
2008cited by this paper
Latent semantic analysis
2008cited by this paper
The Tradeoffs Between Open and Traditional Relation Extraction
2008cited by this paper
Information diffusion through blogspace
2004cited by this paper
Event threading within news topics
2004cited by this paper
Mixed-membership models of scientific publications
2004cited by this paper
BlogPulse: Automated Trend Discovery for Weblogs
2003cited by this paper
Learning to classify text using support vector machines - methods, theory and algorithms
2002cited by this paper
An introduction to latent semantic analysis
1998cited by this paper
On-line new event detection and tracking
1998cited by this paper
Identifying Temporal Patterns and Key Players in Document Collections
1995cited by this paper
Word Association Norms, Mutual Information, and Lexicography
1989cited by this paper

CITED BY

Reviewer recommendation method for scientific research proposals: a case for NSFC
2022cites this paper
Hot Topic Recognition of Health Rumors Based on Anti-Rumor Articles on the WeChat Official Account Platform: Topic Modeling
2022cites this paper
Hybrid Onion Layered System for the Analysis of Collective Subjectivity in Social Networks
2022cites this paper
RevDet: Robust and Memory Efficient Event Detection and Tracking in Large News Feeds
2021cites this paper
Continuous Similarity Learning with Shared Neural Semantic Representation for Joint Event Detection and Evolution
2020cites this paper
Speak up, Fight Back! Detection of Social Media Disclosures of Sexual Harassment
2019cites this paper
Criminal Event Ontology Population and Enrichment using Patterns Recognition from Text
2019cites this paper
Mediagasms, Ironic Nerds, and Mainstream Geeks: A Multimethodological Ideographic Cluster Analysis of and on Twitter
2018cites this paper
A learning framework for information block search based on probabilistic graphical models and Fisher Kernel
2017cites this paper
Twitter as a Tool for Health Research: A Systematic Review
2017cites this paper
Towards knowledge modeling and manipulation technologies: A survey
2016cites this paper
owards knowledge modeling and manipulation technologies : A urvey ndrew
2016cites this paper
Sub-story detection in Twitter with hierarchical Dirichlet processes
2016cites this paper
Automated code compliance checking in the construction domain using semantic natural language processing and logic-based reasoning
2015cites this paper