Cross-Domain Pre-training with Language Models for Transferable Time Series Representations

Mingyue Cheng,Xiaoyu Tao,Qi Liu,Hao Zhang,Yiheng Chen,Defu Lian

Published 2024 in Web Search and Data Mining

ABSTRACT

Pre-training universal models across multiple domains to enhance downstream tasks is a prevalent learning paradigm. However, there has been minimal progress in pre-training transferable models across domains for time series representation. This dilemma is incurred by two key factors: the limited availability of training set within each domain and the substantial differences in data characteristics between domains. To address these challenges, we present a novel framework, namely CrossTimeNet, designed to perform cross-domain self-supervised pre-training to benefit target tasks. Specifically, to address the issue of data scarcity, we utilize a pre-trained language model as the backbone network to effectively capture the sequence dependencies of the input time series. Meanwhile, we adopt the recovery of corrupted region inputs as a self-supervised optimization objective, taking into account the locality of the time series. To address discrepancies in data characteristics, we introduce a novel tokenization module that converts continuous time series inputs into discrete token sequences using vector quantization techniques. This approach facilitates the learning of transferable time series models across different domains. Extensive experimental results on diverse time series tasks, including classification and forecasting, demonstrate the effectiveness of our approach. Our codes are publicly available at https://github.com/Mingyue-Cheng/CrossTimeNet.

PUBLICATION RECORD

Publication year
2024
Venue
Web Search and Data Mining
Publication date
2024-03-19
Fields of study
Computer Science
Identifiers
DOI 10.1145/3701551.3703498 arXiv 2403.12372
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Unified Training of Universal Time Series Forecasting Transformers
2024cited by this paper
Exploring Adapter-based Transfer Learning for Recommender Systems: Empirical Studies and Practical Insights
2023cited by this paper
Adaptive Normalization for Non-stationary Time Series Forecasting: A Temporal Slice Perspective
2023cited by this paper
Bake off redux: a review and experimental evaluation of recent time series classification algorithms
2023cited by this paper
Transformers in Time Series: A Survey
2022cited by this paper
Understanding Masked Image Modeling via Learning Occlusion Invariant Feature
2022cited by this paper
Training language models to follow instructions with human feedback
2022cited by this paper
TS2Vec: Towards Universal Representation of Time Series
2021cited by this paper
Time-Series Representation Learning via Temporal and Contextual Contrasting
2021cited by this paper
Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series
2021cited by this paper
Is BERT a Cross-Disciplinary Knowledge Learner? A Surprising Finding of Pre-trained Models' Transferability
2021cited by this paper
Unsupervised Representation Learning for Time Series with Temporal Neighborhood Coding
2021cited by this paper
LoRA: Low-Rank Adaptation of Large Language Models
2021cited by this paper
Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting
2021cited by this paper
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
2021cited by this paper
Gated Transformer Networks for Multivariate Time Series Classification
2021cited by this paper
A Transformer-based Framework for Multivariate Time Series Representation Learning
2020cited by this paper
Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting
2020cited by this paper
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019cited by this paper
TS-CHIEF: a scalable and accurate forest algorithm for time series classification
2019cited by this paper
Representation Learning with Contrastive Predictive Coding
2018cited by this paper
The human infant brain: A neural architecture able to learn language
2017cited by this paper
Attention is All you Need
2017cited by this paper
Neural Discrete Representation Learning
2017cited by this paper
Time series classification from scratch with deep neural networks: A strong baseline
2016cited by this paper
The BOSS is concerned with time series classification in the presence of noise
2015cited by this paper
Time Series Classification Using Multi-Channels Deep Convolutional Neural Networks
2014cited by this paper
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
2014cited by this paper
A time series forest for classification and feature extraction
2013cited by this paper
SFA: a symbolic fourier approximation and index for similarity search in high dimensional datasets
2012cited by this paper
Time series shapelets: a new primitive for data mining
2009cited by this paper
Querying and mining of time series data: experimental comparison of representations and distance measures
2008cited by this paper
Experiencing SAX: a novel symbolic representation of time series
2007cited by this paper
Noise reduction in chaotic time-series data: A survey of common methods.
1993cited by this paper
Some Recent Advances in Forecasting and Control
1968cited by this paper

CITED BY

PiXTime: A Model for Federated Time Series Forecasting with Heterogeneous Data Structures Across Nodes
2026cites this paper
CoGenCast: A Coupled Autoregressive-Flow Generative Framework for Time Series Forecasting
2026cites this paper
InstructTime++: Time Series Classification with Multimodal Language Modeling via Implicit Feature Enhancement
2026cites this paper
Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs
2025cites this paper
Prioritizing Alignment Paradigms over Task-Specific Model Customization in Time-Series LLMs
2025cites this paper
From Values to Tokens: An LLM-Driven Framework for Context-aware Time Series Forecasting via Symbolic Discretization
2025cites this paper
OneCast: Structured Decomposition and Modular Generation for Cross-Domain Time Series Forecasting
2025cites this paper
Research on cross-domain large model transfer learning methods for natural language processing
2025cites this paper
Business English Intelligent Translation Model Based on Adversarial Adaptive Training and Terminology Consistency Optimization
2025cites this paper
Time Series Analysis in Frequency Domain: A Survey of Open Challenges, Opportunities and Benchmarks
2025cites this paper
Few-shot outliers classification in high-speed railway track geometry based on self-supervised transformer
2025cites this paper
MGTC: Multi-Granularity Temporal Aware Time Series Classification
2025cites this paper
TableTime: Reformulating Time Series Classification as Training-Free Table Understanding with Large Language Models
2024cites this paper
Hierarchical Multimodal LLMs with Semantic Space Alignment for Enhanced Time Series Classification
2024cites this paper
Leveraging Multi-AI Agents for Cross-Domain Knowledge Discovery
2024cites this paper
DisenTS: Disentangled Channel Evolving Pattern Modeling for Multivariate Time Series Forecasting
2024cites this paper
TimeMAE: Self-Supervised Representations of Time Series with Decoupled Masked Autoencoders
2023cites this paper