Publications

Books & Chapters

  1. Dataflow Model for Cloud Computing Frameworks in Big Data. Dong Dai, Yong Chen, and Gangyong Jia. High Performance Computing for Big Data: Methodologies and Applications. CRC Press.

Papers in Refereed Journals

  1. [ToS'22] A Study of Failure Recovery and Logging of High-Performance Parallel File Systems. Runzhou Han, Om Rameshwar Gatla, Mai Zheng, Jinrui Cao, Di Zhang, Dong Dai, Yong Chen, Jonathan Cook. Accepted to appear in ACM Transactions on Storage, 2022 [Paper]
  2. [TCC'21] Dynamic Resource Provisioning for Iterative Workloads on Apache Spark. Dazhao Cheng, Yu Wang, Dong Dai. IEEE Transactions on Cloud Computing, 2021. [Paper]
  3. [JPDC'21] I/O characteristic discovery for storage system optimizations. Jiang Zhou, Yong Chen, Dong Dai, Yu Zhuang, Weiping Wang. Journal of Parallel and Distributed Computing, 2021. [Paper]
  4. [TC'19] PRS: A Pattern-Directed Replication Scheme for Heterogeneous Object-Based Storage . Jiang Zhou, Yong Chen, Wei Xie, Dong Dai, Shuibing He, Weiping Wang. IEEE Transactions on Computers, 2019. [Paper]
  5. [TPDS'18] Managing Rich Metadata in High-Performance Computing Systems Using a Graph Model. Dong Dai, Yong Chen, Philip Carns, John Jenkins, Wei Zhang, and Robert Ross. Accepted to appear in the IEEE Transactions on Parallel and Distributed Systems, 2018. [Paper]
  6. [TCC'18] Trigger-based Incremental Data Processing with Unified Sync and Async Model. Dong Dai, Yong Chen, Dries Kimpe, and Robert Ross. Accepted to appear in Transaction on Cloud Computing, 2018. [Paper]
  7. [Parco'18] Vectorizing Disk Blocks for Efficient Storage Systems via Deep Learning. Dong Dai, Forrest Sheng Bao, Jiang Zhou, Xuanhua Shi, and Yong Chen. International Journal of Parallel Computing, 2018. [Paper]
  8. [Parco'16] An Asynchronous Traversal Engine for Graph-Based Rich Metadata Management. Dong Dai, Philip Carns, Robert Ross, John Jenkins, Nicholas Muirheada, and Yong Chen. International Journal of Parallel Computing, 2016. [Paper]
  9. [TCBB'16] Analyzing Large Biological Datasets in Bioinformatics with Maximal Information Coefficient . Chao Wang, Dong Dai, Xi Li, Aili Wang, and Xuehai Zhou. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2016. [Paper]

Papers in Refereed Conference Proceedings

  1. [HPDC'22] SchedInspector: A Batch Job Scheduling Inspector Using Reinforcement Learning. Di Zhang, Dong Dai, Bing Xie. Accepted to appear in the 31st International ACM Symposium on High-Performance Parallel and Distributed Computing, 2022 [Paper] [Slides/PPT] [Code]
  2. [CCGRID'22] VCSR: Mutable CSR Graph Format Using Vertex-Centric Packed Memory Array. Abdullah Al Raqibul Islam, Dong Dai, Dazhao Cheng. Accepted to appear in The 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, 2022 [Paper] [Slides/PDF] [Code]
  3. [HotStorage'21] SentiLog: Anomaly Detecting on Parallel File Systems via Log-based Sentiment Analysis. Di Zhang, Dong Dai, Runzhou Han, Mai Zheng. In Proceedings of the 13th ACM Workshop on Hot Topics in Storage and File Systems (HotStorage '21), 2021. (Best Paper Nominee) [Paper][Slides]
  4. [SC'20] RLScheduler: An AutomatedHPC Batch Job Scheduler Using Reinforcement Learning. Di Zhang, Dong Dai, Youbiao He, Forrest Sheng Bao, Bing Xie. Accepted to appear in the International Conference for High Performance Computing, Networking, Storage and Analysis, 2020. [Paper][Code][Slides]
  5. [MSST'20] A Performance Study of Optane Persistent Memory: From Indexing Data Structures’ Perspective. Abdullah Al Raqibul Islam, Anirudh Narayanan, Christopher York, Dong Dai. Accepted to appear in the 36th International Conference on Massive Storage Systems and Technology, 2020. [Paper][Slides][Code]
  6. [PPoPP'20] Understand the Overheads of Storage Data Structures on Persistent Memory (Poster). Abdullah Al Raqibul Islam, Dong Dai. In 25th ACM SIGPLAN Symposium on Principle and Practice of Parallel Programming, 2020. [Paper][Code][Slides]
  7. [MSST'19] A Performance Study of Lustre File System Checker: Bottlenecks and Potentials. Dong Dai, Om Rameshwar Gatla, and Mai Zheng. Accepted to appear in the 35th International Conference on Massive Storage Systems and Technology, 2019. [Paper][Slides]
  8. [CLOUD'18] I/O Characteristics Discovery in Cloud Storage Systems. Jiang Zhou, Dong Dai, Yu Mao, Xin Chen, Yu Zhuang, and Yong Chen. In 2018 IEEE 11th International Conference on Cloud Computing, 2018.
  9. [ICS'18] PFault: A General Framework for Analyzing the Reliability of High-Performance Parallel File Systems. Jinrui Cao, Om Rameshwar Gatla, Mai Zheng, Dong Dai, Vidya Eswarappa, Yan Mu and Yong Chen. In the proceedings of the 32nd ACM/SIGARCH International Conference on Supercomputing, 2018. (acceptance rate: 36/193=18.7%) [Paper]
  10. [CCGrid'18] AKIN: A Streaming Graph Partitioning Algorithm for Distributed Graph Storage Systems. Wei Zhang, Yong Chen, and Dong Dai. In the proceedings of the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, 2018. (acceptance rate: 20.8%). [Paper]
  11. [PACT'17] Lightweight Provenance Service for High Performance Computing. Dong Dai, Yong Chen, Philip Carns, John Jenkins, and Robert Ross. In the proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017. (acceptance rate: 23%). [Paper][Slides][Codes]
  12. [HPDC'17] IOGP: An Incremental Online Graph Partitioning Algorithm for Distributed Graph Databases. Dong Dai, Wei Zhang, and Yong Chen. In the proceedings of the 26th ACM International Symposium on High Performance Parallel and Distributed Computing, 2017. (acceptance rate: 19%). [Paper][Slides] [Flyer] [Codes]
  13. [CCGrid'17] Pattern-Directed Replication Scheme for Heterogeneous Object-based Storage. Jiang Zhou, Wei Xie, Dong Dai, and Yong Chen. In The proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, 2017 [Paper]
  14. [PDSW-DISCS'16] A Generic Framework for Testing Parallel File Systems. Jinrui Cao, Simeng Wang, Dong Dai, Mai Zheng, and Yong Chen. In the proceedings of joint international workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems held in Conjunction with SC'16, (acceptance rate: 27%). [Paper]
  15. [Cluster'16]GraphMeta: A Graph-based Engine for Managing Large-Scale HPC Rich Metadata. Dong Dai, Yong Chen, Philip Carns, John Jenkins, Wei Zhang, and Robert Ross. In Proceedings of the IEEE International Conference on Cluster Computing, 2016. (acceptance rate: 24.1%). [Paper] [Slides] [Video] [Codes]
  16. [P2S2'16] Block2Vec: A Deep Learning Strategy on Mining Block Correlations in Storage Systems. Dong Dai, Forrest Sheng Bao, Jiang Zhou, and Yong Chen. In proceedings of the ninth International Workshop on Parallel Programming Models and Systems Software for High-End Computing, held in conjunction with the 45th International Conference on Parallel Processing (ICPP), 2016 [Paper] [Slides] [Codes]
  17. [P2S2'16] Log-assisted Straggler-aware I/O Scheduler for High-End Computing. Neda Tavakoli, Dong Dai, and Yong Chen. In proceedings of the ninth International Workshop on Parallel Programming Models and Systems Software for High-End Computing, held in conjunction with the 45th International Conference on Parallel Processing (ICPP), 2016 [Paper] [Slides]
  18. [Cluster'15 ]GraphTrek: Asynchronous Graph Traversal for Property Graph Based Metadata Management. Dong Dai, Pilip Carns, Robert Ross, John Jenkins, Kyle Blauer, and Yong Chen. In proceedings of the IEEE International Conference on Cluster Computing, 2015. (acceptance rate: 38/157=24.2%). [Paper]
  19. [SC'14] Two-Choice Randomized Dynamic I/O Scheduler for Object Storage Systems . Dong Dai, Yong Chen, Dries Kimpe, and Robert B. Ross. In proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2014. (acceptance rate: 82/394=20.8%). [Paper]
  20. [PDSW'14] Using Property Graphs for Rich Metadata Management in HPC Systems. Dong Dai, Robert B. Ross, Philip Carns, Dries Kimpe, and Yong Chen. In 9th Parallel Data Storage Workshop held in Conjunction with SC14, 2014. [Paper] [Slides]
  21. [BigData'14] Provenance-Based Object Storage Prediction Scheme for Scientific Big Data Applications . Dong Dai, Yong Chen, Dries Kimpe, and Robert B. Ross. In proceedings of the 2014 IEEE International Conference on Big Data, (acceptance rate: 49/264=18.6%). [Paper]
  22. [HPDC'14] Domino: An Incremental Computing Framework in Cloud with Eventual Synchronization. Dong Dai, Xuehai Zhou, Dries Kimpe, Robert B. Ross and Yong Chen. In proceedings of the 23rd ACM International Symposium on High-Performance Parallel and Distributed Computing, 2014. Short paper (acceptance rate: 49 full papers and 57 short papers accepted out of 264 complete submissions). [Paper]
  23. [CLUSTERW'12] Sedna: A Memory Based Key-Value Storage System for Realtime Processing in Cloud. Dong Dai, Xi Li, Chao Wang, Mingming Sun, Xuehai Zhou. Cluster Computing Workshops, 2012. [Paper]

Technical Reports and Other Publications

  1. Prototyping, Testing, and Evaluating Scaling Incremental Applications and Frameworks in Cloud. Dong Dai, Yong Chen, and Robert B. Ross. NSFCloud Workshop (Position Paper), December 11-12, 2014, Arlington, VA.