PinFM: Foundation Model for User Activity Sequences at a Billion-scale Visual Discovery Platform

Xiangyi Chen,Kousik Rajesh,Matthew Lawhon,Zelun Wang,Hanyu Li,Haomiao Li,Saurabh Vishwas Joshi,Pong Eksombatchai,Jaewon Yang,Yi-Ping Hsu,Jiajing Xu,Chuck Rosenberg

Published 2025 in ACM Conference on Recommender Systems

ABSTRACT

User activity sequences have emerged as one of the most important signals in recommender systems. We present a foundational model, PinFM, for understanding user activity sequences across multiple applications at a billion-scale visual discovery platform. We pretrain a transformer model with 20B+ parameters using extensive user activity data, then fine-tune it for specific applications, efficiently coupling it with existing models. While this pretraining-and-fine-tuning approach has been popular in other domains, such as Vision and NLP, its application in industrial recommender systems presents numerous challenges. The foundational model must be scalable enough to score millions of items every second while meeting tight cost and latency constraints imposed by these systems,. Additionally, it should capture the interactions between user activities and other features and handle new items that were not present during the pretraining stage. We developed innovative techniques to address these challenges. Our infrastructure and algorithmic optimizations, such as the Deduplicated Cross-Attention Transformer (DCAT), improved our throughput by 600% on Pinterest internal data. We demonstrate that PinFM can learn interactions between user sequences and candidate items by altering input sequences, leading to a 20% increase in engagement with new items. PinFM is now deployed to help improve the experience of more than a half billion users across various applications.

PUBLICATION RECORD

Publication year
2025
Venue
ACM Conference on Recommender Systems
Publication date
2025-07-17
Fields of study
Computer Science
Identifiers
DOI 10.1145/3705328.3748050 arXiv 2507.12704
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation
2025cited by this paper
SessionRec: Next Session Prediction Paradigm For Generative Sequential Recommendation
2025cited by this paper
External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
2025cited by this paper
Multi-Behavior Generative Recommendation
2024cited by this paper
Taming the One-Epoch Phenomenon in Online Recommendation System by Two-stage Contrastive ID Pre-training
2024cited by this paper
Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations
2024influential reference
Text Is All You Need: Learning Language Representations for Sequential Recommendation
2023cited by this paper
Scaling Law of Large Sequential Recommendation Models
2023cited by this paper
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
2023cited by this paper
TransAct: Transformer-based Realtime User Action Model for Recommendation at Pinterest
2023cited by this paper
TWIN: TWo-stage Interest Network for Lifelong User Behavior Modeling in CTR Prediction at Kuaishou
2023cited by this paper
Towards Universal Sequence Representation Learning for Recommender Systems
2022cited by this paper
PinnerFormer: Sequence Modeling for User Representation at Pinterest
2022cited by this paper
M6-Rec: Generative Pretrained Language Models are Open-Ended Recommender Systems
2022cited by this paper
TorchRec: a PyTorch Domain Library for Recommendation Systems
2022cited by this paper
FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference
2021cited by this paper
Recommender Systems
2021cited by this paper
On Layer Normalization in the Transformer Architecture
2020cited by this paper
DCN V2: Improved Deep & Cross Network and Practical Lessons for Web-scale Learning to Rank Systems
2020cited by this paper
PTUM: Pre-training User Model from Unlabeled User Behaviors via Self-supervision
2020cited by this paper
Language Models are Unsupervised Multitask Learners
2019cited by this paper
Self-Attentive Sequential Recommendation
2018cited by this paper
Representation Learning with Contrastive Predictive Coding
2018cited by this paper
Graph Convolutional Neural Networks for Web-Scale Recommender Systems
2018cited by this paper
Deep Interest Network for Click-Through Rate Prediction
2017cited by this paper
RECURRENT NEURAL NETWORKS
2015cited by this paper
Factorization Machines
2010cited by this paper

CITED BY

Deep Learning to Rank in Industrial Search Engines, Recommender Systems and Online Advertising: An Overview and New Perspectives
2026cites this paper
LocalSUG: Geography-Aware LLM for Query Suggestion in Local-Life Services
2026cites this paper
DenseRec: Revisiting Dense Content Embeddings for Sequential Transformer-based Recommendation
2025cites this paper
PLUM: Adapting Pre-trained Language Models for Industrial-scale Generative Recommendations
2025cites this paper