Unicorn: Unified resource orchestration for multi-domain, geo-distributed data analytics
Abstract
Data-intensive analytics is entering the era of multi-organizational, geographically-distributed, collaborative computing, where different organizations contribute various resources, e.g., sensing, computation, storage and networking resources, to collaboratively collect, share and analyze extremely large amounts of data. This new paradigm calls for a framework to manage a large set of distributively owned heterogeneous resources, with the fundamental objective of efficient resource utilization, following the autonomy and privacy of resource owners. In this paper, we design Unicorn, the first unified framework that accomplishes this goal. The foundation of Unicorn is RSDP, an autonomous, privacy-preserving resource discovery and representation system to provide accurate resource availability information. Its core is a novel abstraction called resource vector abstraction which describes the resource availability in a set of linear constraints. In addition, Unicorn also provides a series of advanced solutions to support automatic, efficient management of resource dynamics on both supply and demand sides, including an automatic workflow transformer, an intelligent resource demand estimator and an efficient, scalable multi-resource orchestrator. Being the first unified framework for this new paradigm, Unicorn plays a fundamental role in next-generation data-intensive collaborative computing systems.
Additional Information
© 2018 IEEE. We thank Justas Balcas, Shiwei Chen, Lili Liu, Maria Spiropulu and Jean-Roch Vlimant for helpful discussion during the work. The Yale team was supported in part by NSF grant #1440745, CC*IIE Integration: Dynamically Optimizing Research Data Workflow with a Software Defined Science Network; International Technology Alliance Agreement No W911NF-16-3-0002; Google Research Award, SDN Programming Using just Minimal Abstractions; NSFC #61672385, FAST Magellan. The Tongji team was supported by China Postdoctoral Science Foundation #2017M611618. The Caltech team was supported in part by DOE/ASCR project #000219898, SDN NGenIA; DOE award #DE-AC02-07CHI1359, SENSE, FNAL PO #626507; NSF award #1246133, ANES; NSF award #1341024, CHOPIN.Additional details
- Eprint ID
- 87597
- Resolver ID
- CaltechAUTHORS:20180706-110608219
- NSF
- OAC-1440745
- Army Research Laboratory
- W911NF-16-3-0002
- National Natural Science Foundation of China
- 61672385
- Chinese Postdoctoral Science Foundation
- 2017M611618
- Department of Energy (DOE)
- 000219898
- Department of Energy (DOE)
- DE-AC02-07CHI1359
- NSF
- OAC-1246133
- NSF
- OAC-1341024
- Created
-
2018-07-06Created from EPrint's datestamp field
- Updated
-
2021-11-15Created from EPrint's last_modified field