Future terabit networks are committed to dramatically improving big data motion between geographically dispersed HPC data centers.The scientific community takes advantage of the terabit networks such as DOE's ESnet and accelerates the trend to build a small world of collaboration between geospatial HPC data centers. It improves information and resource sharing for joint simulation and analysis between the HPC data centers. In this paper, we propose to build SCISPACE (Scientific Collaboration Workspace) for collaborative data centers. It provides a global view of information shared from multiple geo-distributed HPC data centers under a single workspace. SCISPACE supports native data-access to gain high-performance when data read or write is required in native data center namespace. It is accomplished by integrating a metadata export protocol. To optimize scientific collaborations across HPC data centers, SCISPACE implements search and discovery service. To evaluate, we configured two geo-distributed small-scale HPC data centers connected via high-speed Infiniband network, equipped with LustreFS. We show the feasibility of SCISPACE using real scientific datasets and applications. The evaluation results show average 36\% performance boost when the proposed native-data access is employed in collaborations.
SCISPACE: A Scientific Collaboration Workspace for File Systems in Geo-Distributed HPC Data Centers
Awais Khan,Taeuk Kim,Hyunki Byun,Youngjae Kim,Sungyong Park,Hyogi Sim
Published 2018 in arXiv.org
ABSTRACT
PUBLICATION RECORD
- Publication year
2018
- Venue
arXiv.org
- Publication date
2018-03-22
- Fields of study
Computer Science, Engineering, Environmental Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-21 of 21 references · Page 1 of 1
CITED BY
Showing 1-1 of 1 citing papers · Page 1 of 1