Contrasting Traditional and LLMs-Based Approaches For Holistic Analysis of Disparate Data Types

Abhishek Santra,Sharma Chakravarthy

Published 2025 in Proceedings of the 2025 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems

ABSTRACT

Research has addressed the modeling and analysis of individual data types fairly successfully whether it is mining, computing objectives, or knowledge discovery. However, handling disparate data types holistically is challenging, and can benefit from new models and/or approaches. It is also important to note that holistic analysis is significantly different from traditional data mining/analysis, which deals with a specific type of knowledge discovery using specific algorithms (e.g., clustering, association rule mining, etc.) on specific data types and formats. In contrast, holistic analysis is a broader umbrella concept and is likely to need an ensemble of modeling and analysis alternatives. Hence, for the modeling and analysis of multiple data types with different characteristics, a suite of existing approaches and their extensions, as well as new ones, is required. When dealing with diverse complex data, all aspects – data preparation, modeling, analysis, drill-down, and visualization – become important for understanding the data as well as the knowledge discovered/objectives computed. In this position paper, we present our vision for the holistic modeling of data of different types from different sources to perform analysis and knowledge discovery. This can be seen as the first step towards big data analysis. The goal is to accommodate multiple data types (structured, unstructured, image/video, and text from natural language) using models that are amenable to expressive analysis and/or infer knowledge using scalable approaches such as LLMs (Large Language Models). In this paper, we contrast two significantly different approaches: i) the use of graph and multilayer network (MLN) data models (termed the traditional approach) and the use of LLMs (termed the AI-induced approach). After motivation and the current state-of-the-art, we elaborate on the details of the traditional approach, which we understand better, and contrast it with the AI-induced LLM-based approach. We highlight the challenges and, through two different representative applications, show how these two proposed approaches can provide two different (and even orthogonal) paths towards information integration/fusion and concomitant holistic analysis.

PUBLICATION RECORD

Publication year
2025
Venue
Proceedings of the 2025 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems
Publication date
2025-11-25
Fields of study
Not labeled
Identifiers
DOI 10.1145/3759023.3759115
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Facilitating Holistic Evaluations with LLMs: Insights from Scenario-Based Experiments
2024cited by this paper
YOLOv10: Real-Time End-to-End Object Detection
2024cited by this paper
Substructure Discovery in Heterogeneous Multilayer Networks
2024cited by this paper
YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
2024cited by this paper
Efficient community detection in multilayer networks using boolean compositions
2023cited by this paper
Video Understanding With Large Language Models: A Survey
2023influential reference
YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors
2022cited by this paper
What factors affect consumers’ dining sentiments and their ratings: Evidence from restaurant online review data
2021cited by this paper
From Base Data To Knowledge Discovery - A Life Cycle Approach - Using Multilayer Networks
2021cited by this paper
An improved YOLO-based road traffic monitoring system
2021cited by this paper
A Survey of the Usages of Deep Learning for Natural Language Processing
2020cited by this paper
Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition
2020cited by this paper
A Survey on Deep Learning for Multimodal Data Fusion
2020cited by this paper
MIRIS: Fast Object Track Queries in Video
2020cited by this paper
Probabilistic Databases for All
2020cited by this paper
Ontology-Driven Food Category Classification in Images
2019cited by this paper
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019cited by this paper
VidCEP: Complex Event Processing Framework to Detect Spatiotemporal Patterns in Video Streams
2019cited by this paper
Real-Time Video Analytics: The Killer App for Edge Computing
2017cited by this paper
Efficient Community Re-creation in Multilayer Networks Using Boolean Operations
2017cited by this paper
HUBify: Efficient Estimation of Central Entities Across Multiplex Layer Compositions
2017cited by this paper
Sentiment analysis using product review data
2015cited by this paper
SVQL: A SQL Extended Query Language for Video Databases
2015cited by this paper
Adapting Stream Processing Framework for Video Analysis
2015cited by this paper
Sentiment Analysis of Short Informal Texts
2014cited by this paper
New methods for MRI denoising based on sparseness and self-similarity
2012cited by this paper
MiMAG: mining coherent subgraphs in multi-layer graphs with edge labels
2012cited by this paper
C-SPARQL: a Continuous Query Language for RDF Data Streams
2010cited by this paper
Multimodal fusion for multimedia analysis: a survey
2010cited by this paper
A Survey of Text Mining Techniques and Applications
2009cited by this paper
A framework for supporting quality of service requirements in a data stream management system
2005cited by this paper
PLACE: A Query Processor for Handling Real-time Spatio-temporal Data Streams
2004cited by this paper
Evaluation of shape similarity measurement methods for spine X-ray images
2004cited by this paper
SnoopIB: Interval-based event specification and detection for active databases
2003cited by this paper
A comparison of similarity measures for use in 2-D-3-D medical image registration
1998cited by this paper
Fundamentals of Database Systems, 2nd Edition
1994cited by this paper
The entity-relationship model: toward a unified view of data
1975cited by this paper
A Relational Model for Large Shared Data Banks
1970cited by this paper
A relational model of data for large shared data banks
1970cited by this paper

CITED BY

No citing papers are available for this paper.