MUD: Towards a Large-Scale and Noise-Filtered UI Dataset for Modern Style UI Modeling

Sidong Feng,Suyu Ma,Han Wang,David Kong,Chunyang Chen

Published 2024 in International Conference on Human Factors in Computing Systems

ABSTRACT

The importance of computational modeling of mobile user interfaces (UIs) is undeniable. However, these require a high-quality UI dataset. Existing datasets are often outdated, collected years ago, and are frequently noisy with mismatches in their visual representation. This presents challenges in modeling UI understanding in the wild. This paper introduces a novel approach to automatically mine UI data from Android apps, leveraging Large Language Models (LLMs) to mimic human-like exploration. To ensure dataset quality, we employ the best practices in UI noise filtering and incorporate human annotation as a final validation step. Our results demonstrate the effectiveness of LLMs-enhanced app exploration in mining more meaningful UIs, resulting in a large dataset MUD of 18k human-annotated UIs from 3.3k apps. We highlight the usefulness of MUD in two common UI modeling tasks: element detection and UI retrieval, showcasing its potential to establish a foundation for future research into high-quality, modern UIs.

PUBLICATION RECORD

Publication year
2024
Venue
International Conference on Human Factors in Computing Systems
Publication date
2024-05-11
Fields of study
Computer Science
Identifiers
DOI 10.1145/3613904.3642350 arXiv 2405.07090
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics
2023cited by this paper
Towards Efficient Record and Replay: A Case Study in WeChat
2023cited by this paper
Unveiling the Tricks: Automated Detection of Dark Patterns in Mobile Applications
2023cited by this paper
Video2Action: Reducing Human Interactions in Action Annotation of App Tutorial Videos
2023cited by this paper
LLaMA: Open and Efficient Foundation Language Models
2023cited by this paper
Read It, Don't Watch It: Captioning Bug Recordings Automatically
2023cited by this paper
A Large-Scale Longitudinal Analysis of Missing Label Accessibility Failures in Android Apps
2022cited by this paper
Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development
2022cited by this paper
Gallery D.C.: Auto-created GUI Component Gallery for Design Search and Knowledge Discovery
2022cited by this paper
PaLM: Scaling Language Modeling with Pathways
2022cited by this paper
GANSpiration: Balancing Targeted and Serendipitous Inspiration in User Interface Design with Style-Based Generative Adversarial Network
2022cited by this paper
Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at Scale
2022cited by this paper
Guided Bug Crush: Assist Manual GUI Testing of Android Apps via Hint Moves
2022cited by this paper
Efficiency Matters: Speeding Up Automated Testing with GUI Rendering Inference
2022cited by this paper
Fill in the Blank: Context-aware Automated Text Input Generation for Mobile GUI Testing
2022cited by this paper
Enabling Conversational Interaction with Mobile UI using Large Language Models
2022cited by this paper
Psychologically-inspired, unsupervised inference of perceptual groups of GUI widgets from GUI images
2022cited by this paper
GIFdroid: An Automated Light-weight Tool for Replaying Visual Bug Reports
2022cited by this paper
Screen2Vec: Semantic Embedding of GUI Screens and GUI Components
2021cited by this paper
GIFdroid: Automated Replay of Visual Bug Reports for Android Apps
2021cited by this paper
Large-Scale Modeling of Mobile User Click Behaviors Using Deep Learning
2021cited by this paper
Screen2Words: Automatic Mobile UI Summarization with Multimodal Learning
2021cited by this paper
Auto-Icon: An Automated Code Generation Tool for Icon Designs Assisting in UI Development
2021cited by this paper
VINS: Visual Search for Mobile User Interface Design
2021cited by this paper
Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels
2021cited by this paper
Enrico: A Dataset for Topic Modeling of Mobile UI Designs
2020cited by this paper
Mapping Natural Language Instructions to Mobile UI Action Sequences
2020influential reference
Language Models are Few-Shot Learners
2020cited by this paper
Object detection for graphical user interface: old fashioned or deep learning or a combination?
2020cited by this paper
Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements
2020cited by this paper
UIED: a hybrid tool for GUI element detection
2020cited by this paper
From Lost to Found
2020cited by this paper
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
2019cited by this paper
Swire: Sketch-based User Interface Retrieval
2019cited by this paper
Humanoid: A Deep Learning-Based Approach to Automated Black-box Android App Testing
2019cited by this paper
Gallery D.C.: Design Search and Knowledge Discovery through Auto-created GUI Component Gallery
2019influential reference
Practical GUI Testing of Android Applications Via Model Abstraction and Refinement
2019cited by this paper
StoryDroid: Automated Generation of Storyboard for Android Apps
2019cited by this paper
Guigle: A GUI Search Engine for Android Apps
2019cited by this paper
RoBERTa: A Robustly Optimized BERT Pretraining Approach
2019cited by this paper
Learning user interface element interactions
2019cited by this paper
Machine Learning-Based Prototyping of Graphical User Interfaces for Mobile Apps
2018cited by this paper
The Design of Everyday Things (Revised and Expanded Edition)
2018cited by this paper
From UI Design Image to GUI Skeleton: A Neural Machine Translator to Bootstrap Mobile GUI Implementation
2018cited by this paper
Rico: A Mobile App Dataset for Building Data-Driven Design Applications
2017cited by this paper
Guided, stochastic model-based GUI testing of Android apps
2017cited by this paper
DroidBot: A Lightweight UI-Guided Test Input Generator for Android
2017cited by this paper
ERICA: Interaction Mining Mobile Apps
2016cited by this paper
Sapienz: multi-objective automated testing for Android applications
2016cited by this paper
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
2015cited by this paper
Mobile Design Pattern Gallery: UI Patterns for Smartphone Apps
2014cited by this paper
Dynodroid: an input generation system for Android apps
2013cited by this paper
Insights into layout patterns of mobile user interfaces by an automatic analysis of android apps
2013cited by this paper
Webzeitgeist: design mining the web
2013cited by this paper
Using GUI ripping for automated testing of Android applications
2012cited by this paper
Leakage in data mining: formulation, detection, and avoidance
2011cited by this paper
Gestalt theory, engagement and interaction
2010cited by this paper
The Pascal Visual Object Classes (VOC) Challenge
2010influential reference
Prefab: implementing advanced behaviors using pixel-based reverse engineering of interface structure
2010cited by this paper
Sikuli: using GUI screenshots for search and automation
2009cited by this paper
Breadth-first crawling yields high-quality pages
2001cited by this paper

CITED BY

AutoStructGUI: Bridging Design and Implementation of GUI through Structured Layout Generation
2026cites this paper
A comparative analysis of generative AI adoption among design professionals in China and the United Kingdom: a UTAUT perspective
2026cites this paper
Controllable GUI Exploration
2025cites this paper
PixelWeb: The First Web GUI Dataset with Pixel-Wise Labels
2025cites this paper
Understanding the LLM-ification of CHI: Unpacking the Impact of LLMs at CHI through a Systematic Literature Review
2025cites this paper
Breaking Single-Tester Limits: Multi-Agent LLMs for Multi-User Feature Testing
2025cites this paper
DuetUI: A Bidirectional Context Loop for Human-Agent Co-Generation of Task-Oriented Interfaces
2025cites this paper
A Domain-Agnostic Framework for Visual Element Detection From High-Level Descriptions
2025cites this paper
UISearch: Graph-Based Embeddings for Multimodal Enterprise UI Screenshots Retrieval
2025influential citation
Early Accessibility: Automating Alt-Text Generation for UI Icons During App Development
2025cites this paper
Agent for User: Testing Multi - User Interactive Features in TikTok
2025cites this paper
Towards an Inclusive Mobile Web: A Dataset and Framework for Focusability in UI Accessibility
2025cites this paper
How CO2STLY Is CHI? The Carbon Footprint of Generative AI in HCI Research and What We Should Do About It
2025cites this paper
Examining the Impact of Large Language Models on Design: Functions, Strengths, Limitations, and Roles
2025cites this paper
AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents
2024influential citation
AXIS: Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents
2024cites this paper
Enabling Cost-Effective UI Automation Testing with Retrieval-Based LLMs: A Case Study in WeChat
2024cites this paper
WaybackUI: A Dataset to Support the Longitudinal Analysis of User Interfaces
year unknowncites this paper