Mining motifs in massive time series databases

Pranav Patel,Eamonn J. Keogh,Jessica Lin,Stefano Lonardi

Published 2002 in IEEE International Conference on Data Mining. Proceedings

ABSTRACT

The problem of efficiently locating previously known patterns in a time series database (i.e., query by content) has received much attention and may now largely be regarded as a solved problem. However, from a knowledge discovery viewpoint, a more interesting problem is the enumeration of previously unknown, frequently occurring patterns. We call such patterns "motifs", because of their close analogy to their discrete counterparts in computation biology. An efficient motif discovery algorithm for time series would be useful as a tool for summarizing and visualizing massive time series databases. In addition it could be used as a subroutine in various other data mining tasks, including the discovery of association rules, clustering and classification. In this paper we carefully motivate, then introduce, a nontrivial definition of time series motifs. We propose an efficient algorithm to discover them, and we demonstrate the utility and efficiency of our approach on several real world datasets.

PUBLICATION RECORD

Publication year
2002
Venue
IEEE International Conference on Data Mining. Proceedings
Publication date
2002-12-09
Fields of study
Computer Science
Identifiers
DOI 10.1109/ICDM.2002.1183925
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Discovering similar multidimensional trajectories
2002cited by this paper
Monotony of Surprise and Large-Scale Quest for Unusual Words
2002cited by this paper
Mining long sequential patterns in a noisy environment
2002cited by this paper
Mining the MACHO dataset
2001cited by this paper
Epsilon grid order: an algorithm for the similarity join on massive high-dimensional data
2001cited by this paper
Meta-patterns: revealing hidden periodic patterns
2001cited by this paper
Distance measures for effective clustering of ARIMA time-series
2001influential reference
Locally adaptive dimensionality reduction for indexing large time series databases
2001influential reference
Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases
2001cited by this paper
Discovery of Temporal Patterns. Learning Rules about the Qualitative Behaviour of Time Series
2001cited by this paper
Global detectors of unusual words: design, implementation, and applications to pattern discovery in biosequences
2001cited by this paper
Finding Motifs Using Random Projections
2001cited by this paper
Landmarks: a new model for similarity-based pattern querying in time series databases
2000cited by this paper
Deformable Markov model templates for time-series pattern matching
2000influential reference
Fast Time Sequence Indexing for Arbitrary Lp Norms
2000cited by this paper
An Updated Bibliography of Temporal, Spatial, and Spatio-temporal Data Mining Research
2000cited by this paper
Identifying Representative Trends in Massive Time Series Data Sets Using Sketches
2000cited by this paper
Combinatorial Approaches to Finding Subtle Signals in DNA Sequences
2000cited by this paper
Probabilistic and Statistical Properties of Words: An Overview
2000cited by this paper
Identifying DNA and protein patterns with statistically significant alignments of multiple sequences
1999cited by this paper
Measuring time series similarity through large singular features revealed with wavelet transformation
1999cited by this paper
Adaptive query processing for time-series data
1999cited by this paper
Lecture Notes in Artificial Intelligence
1999cited by this paper
Efficient time series matching by wavelets
1999influential reference
On clustering fMRI time series.
1999cited by this paper
Rule Discovery from Time Series
1998cited by this paper
MALM: a framework for mining sequence database at multiple abstraction levels
1998cited by this paper
An Enhanced Representation of Time Series Which Allows Fast and Accurate Classification, Clustering and Relevance Feedback
1998influential reference
Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies.
1998cited by this paper
Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids
1998cited by this paper
Initialization of Iterative Refinement Clustering Algorithms
1998cited by this paper
Scaling Clustering Algorithms to Large Databases
1998cited by this paper
Using Signature Files for Querying Time-Series Data
1997influential reference
Querying Shapes of Histories
1995influential reference
Syntactic recognition of ECG signals by attributed finite automata
1995cited by this paper
Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment.
1993cited by this paper
Efficient Similarity Search In Sequence Databases
1993influential reference
New techniques for best-match retrieval
1990influential reference
Methods for discovering novel motifs in nucleic acid sequences
1989cited by this paper
Some approaches to best-match file searching
1973cited by this paper

CITED BY

Learning to Reason: Temporal Saliency Distillation for Interpretable Knowledge Transfer
2026cites this paper
Exact and Efficient Similar Subtrajectory Search: Integrating Constraints and Simplification
2025cites this paper
Automatic Lifestate Identification for High-Dimensional Time Series Data
2025cites this paper
Compositionality in Time Series: A Proof of Concept using Symbolic Dynamics and Compositional Data Augmentation
2025cites this paper
Properties and predicted functions of large genes and proteins of apicomplexan parasites
2024cites this paper
AutoML for anomaly detection in a semi or unsupervised setting on time series
2023cites this paper
A machine learning-based Anomaly Detection Framework for building electricity consumption data
2023cites this paper
Data mining approach for production order identification in load profiles of machine tools: A change-point and clustering based analysis
2023cites this paper
Visualizing Large-Scale Spatial Time Series with GeoChron
2023cites this paper
Exploring and visualizing temporal relations in multivariate time series
2023cites this paper
ML-Assisted Optimization of Securities Lending
2023cites this paper
Efficiently Mining Frequent Representative Motifs in Large Collections of Time Series
2023cites this paper
Machine Learning Based Estimation of Buildings’ Characteristics Employing Electrical and Chilled Water Consumption Data: Pipeline Optimization
2023cites this paper
Motiflets - Simple and Accurate Detection of Motifs in Time Series
2022cites this paper
Synthetic Ground Truth Generation of an Electricity Consumption Dataset
2022cites this paper
Evaluating the influence of sleep quality and quantity on glycemic control in adults with type 1 diabetes
2022cites this paper
Motiflets - Fast and Accurate Detection of Motifs in Time Series
2022cites this paper
Real-time visual analytics for in-home medical rehabilitation of stroke patient—systematic review
2022cites this paper
TraTSA: A Transprecision Framework for Efficient Time Series Analysis
2022cites this paper
Discovering All-chain Set with Direction and Graduality Characteristics over Streaming Time Series
2022cites this paper
gsstSIM: A high‐performance and synchronized similarity analysis method of spatiotemporal trajectory based on grid model representation
2022cites this paper
Fast and Scalable Mining of Time Series Motifs with Probabilistic Guarantees
2022cites this paper
MATSA: An MRAM-Based Energy-Efficient Accelerator for Time Series Analysis
2022cites this paper
SMM: Leveraging Metadata for Contextually Salient Multi-Variate Motif Discovery
2021cites this paper
Discovering motifs restricted in space-time. (Découverte de motifs fréquents limités en espace-temps)
2021cites this paper
Modern Machine Learning Methods for Telemetry-Based Spacecraft Health Monitoring
2021cites this paper
ECG Prediction based on Bidirectional Time Series Chain Discovery Algorithm
2021cites this paper
Analysis of temporal patterns in animal movement 1 networks
2021cites this paper
Variable-Length Latent Motif Discovery
2021cites this paper
An Invariance-guided Stability Criterion for Time Series Clustering Validation
2021cites this paper
Modelling and Reasoning for Indirect Sensing over Discrete-time via Markov Logic Networks
2021cites this paper
Machine Learning–based Cyber Attacks Targeting on Controlled Information
2021cites this paper
Strongly Sublinear Algorithms for Testing Pattern Freeness
2021cites this paper
Driving Maneuver Classification Using Domain Specific Knowledge and Transfer Learning
2021cites this paper
Classification and characterization of intra-day load curves of PV and non-PV households using interpretable feature extraction and feature-based clustering
2021cites this paper
Mining Graph-Fourier Transform Time Series for Anomaly Detection of Internet Traffic at Core and Metro Networks
2021cites this paper
Anytime Subgroup Discovery in High Dimensional Numerical Data
2021cites this paper
Neighbor Profile: Bagging Nearest Neighbors for Unsupervised Time Series Mining
2020cites this paper
Development of methods for analyzing patterns of current consumption in a system for wireless monitoring the effectiveness of metalworking production
2020cites this paper
Time Series Segmentation with Leg Analysis for Human Motion Analysis
2020cites this paper
Mining subsequent trend patterns from financial time series
2020cites this paper
Interval Feature Transformation for Time Series Classification Using Perceptually Important Points
2020cites this paper
Co-eye: a multi-resolution ensemble classifier for symbolically approximated time series
2020cites this paper
A Musical Similarity Metric based on Symbolic Aggregate Approximation
2020cites this paper
Motif Discovery Using Similarity-Constraints Deep Neural Networks
2020cites this paper
Co-eye: A Multi-resolution Symbolic Representation to TimeSeries Diversified Ensemble Classification
2020cites this paper
A Smartphone Lightweight Method for Human Activity Recognition Based on Information Theory
2020cites this paper
Semi-supervised time series classification method for quantum computing
2020cites this paper
Spatial-time motifs discovery
2020cites this paper
Latent Motif Discovery using Maximum Clique algorithms
2020cites this paper
Parameterless Semi-supervised Anomaly Detection in Univariate Time Series
2020cites this paper
SWSD: An Abnormal Detection Algorithm on Unequally Spaced Time Series for Disaster Prediction
2020cites this paper
NATSA: A Near-Data Processing Accelerator for Time Series Analysis
2020cites this paper
Multichannel Symbolic Aggregate Approximation Intelligent Icons: Application for Activity Recognition
2020cites this paper
Analysis of temporal patterns in animal movement networks
2020cites this paper
Exploiting the fine-grained similarity of a large-scale rice species using shape motif discovery
2020influential citation
Faster and simpler algorithms for finding large patterns in permutations
2019cites this paper
A survey of trajectory distance measures and performance evaluation
2019cites this paper
Expos´e Master’s Thesis: A Synthetic Motif Generator
2019influential citation
Pattern detection for time series trajectories in human in the loop applications
2019cites this paper
Finding and Counting Permutations via CSPs
2019influential citation
Discovery of Time Series Motifs on Intel Many-Core Systems
2019cites this paper
Motif Density for Selecting Optimal Window Length in Motif Discovery
2019cites this paper
A Review on Time Series Motif Discovery Techniques an Application to ECG Signal Classification: ECG Signal Classification Using Time Series Motif Discovery Techniques
2019cites this paper
Topological Approach for Finding Nearest Neighbor Sequence in Time Series
2019cites this paper
Time Series Segmentation with Leg Analysis for Human Motion Analysis /Author=Imamura, M.; Inoue, M.; Terada, M.; Nikovski, D.N. /CreationDate=December 11, 2019 /Subject=Data Analytics
2019cites this paper
Discovering All-Chain Set in Streaming Time Series
2019cites this paper
Motif Discovery and Anomaly Detection in an ECG Using Matrix Profile
2019cites this paper
Proceedings - 28. Workshop Computational Intelligence, Dortmund, 29. - 30. November 2018
2018cites this paper
GrammarViz 3.0
2018influential citation
Study on Visual Techniques of Potential Pattern Discovery for Time Series Data
2018cites this paper
Mining Rules from Real-Valued Time Series: A Relative Information-Gain-Based Approach
2018cites this paper
OS-level Side Channels without Procfs: Exploring Cross-App Information Leakage on iOS
2018cites this paper
Exposé for Student Research Project : Generator of Synthetic Time Series for Motif Discovery
2018cites this paper
Time Series Chains: A Novel Tool for Time Series Data Mining
2018cites this paper
Short-term trend prediction in financial time series data
2018cites this paper
In Search of Sustainable Design Patterns: Combining Data Mining and Semantic Data Modelling on Disparate Building Data
2018cites this paper
Modeling the Effects of Students' Interactions with Immersive Simulations using Markov Switching Systems
2018cites this paper
Exploring variable-length time series motifs in one hundred million length scale
2018cites this paper
Surgical motion analysis using discriminative interpretable patterns
2018cites this paper
A Fast Online Algorithm for Analyzing Magnitude Fluctuation of Time Series
2018cites this paper
Monitoring Range Motif on Streaming Time-Series
2018cites this paper
Evaluation of Similarity Measures for Shift-Invariant Image Motif Discovery
2018cites this paper
A Real Time Anomaly Detection Method Based on Variable N-Gram for Flight Data
2018cites this paper
Introducing time series chains: a new primitive for time series data mining
2018influential citation
Towards a Near Universal Time Series Data Mining Tool: Introducing the Matrix Profile
2018cites this paper
Representation learning in multi-dimensional clinical timeseries for risk and event prediction
2017cites this paper
A variable-length motifs discovery method in time series using hybrid approach
2017cites this paper
Matrix Profile VI: Meaningful Multidimensional Motif Discovery
2017cites this paper
How to Quantify the Impact of Lossy Transformations on Event Detection
2017cites this paper
Characterizing Product Lifecycle in Online Marketing: Sales, Trust, Revenue, and Competition Modeling
2017cites this paper
Matrix Profile VII: Time Series Chains: A New Primitive for Time Series Data Mining (Best Student Paper Award)
2017influential citation
Univariate and Multivariate Time Series Manifold Learning
2017influential citation
Towards a Plug-and-Play and Holistic Data Mining Framework for Understanding and Facilitating Operations in Smart Buildings
2017cites this paper
Research Activity Classification based on Time Series Bibliometrics
2017influential citation
Content-Based Multimedia Analytics : Rethinking the Speed and Accuracy of Information Retrieval for Threat Detection
2017cites this paper
Motif-based Rule Discovery for Predicting Real-valued Time Series
2017influential citation
Combining Unsupervised Anomaly Detection and Neural Networks for Driver Identification
2017cites this paper
Clustering Distributed Short Time Series with Dense Patterns
2017cites this paper
Mining electrical meter data to predict principal building use, performance class, and operations strategy for hundreds of non-residential buildings
2017cites this paper