Arif, Moiz, et al. "Application-Attuned Memory Management for Containerized HPC Workflows." Proceedings of the Proceedings of the 38th IEEE International Parallel & Distributed Processing Symposium (IPDPS). Ed. IEEE. San Francisco, California, USA: IEEE, 2024. Web.
Assogba, Kevin, Bogdan Nicolae, and M. Mustafa Rafique. "Optimizing the Training of Co-Located Deep Learning Models Using Cache-Aware Staggering." Proceedings of the In Proceedings of the 30th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC). Ed. IEEE. Goa, India: IEEE, 2023. Web.
Maurya, Avinash, et al. "Towards Efficient I/O Pipelines using Accumulated Compression." Proceedings of the In Proceedings of the 30th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC). Ed. IEEE. Goa, India: n.p., 2023. Web.
Assogba, Kevin, et al. "PredictDDL: Reusable Workload Performance Prediction for Distributed Deep Learning." Proceedings of the In Proceedings of the 25th IEEE International Conference on Cluster Computing (Cluster). Ed. IEEE. Santa Fe, New Mexico, USA: n.p., 2023. Web.
Maurya, Avinash, et al. "GPU-Enabled Asynchronous Multi-level Checkpoint Caching and Prefetching." Proceedings of the In Proceedings of the 32nd ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC). Ed. ACM. Orlando, Florida, USA: n.p., 2023. Web.
Maurya, Avinash, et al. "Towards Efficient Cache Allocation for High-Frequency Checkpointing." Proceedings of the In Proceedings of the 29th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC). Ed. IEEE. Bangalore, India: n.p., 2022. Web.
Arif, Moiz, Kevin Assogba, and M. Mustafa Rafique. "Canary: Fault-tolerant FaaS for Stateful Time-sensitive Applications." Proceedings of the In Proceedings of the 35th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC). Ed. IEEE/ACM. Dallas, Texas, USA: n.p., 2022. Web.
Arif, Moiz, et al. "Exploiting CXL-based Memory for Distributed Deep Learning." Proceedings of the In Proceedings of the 51st International Conference on Parallel Processing (ICPP). Ed. ICPP. Bordeaux, France: n.p., 2022. Web.
Assogba, Kevin, et al. "On Realizing Efficient Deep Learning Using Serverless Computing." Proceedings of the In Proceedings of the 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid). Ed. IEEE. Taormina (Messina), Italy: n.p., 2022. Web.
Maurya, Avinash, et al. "Towards Efficient I/O Scheduling for Collaborative Multi-Level Checkpointing." Proceedings of the In Proceedings of the 29th IEEE International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). Ed. IEEE. Virtual Conference, USA: n.p., 2021. Web.
Arif, Moiz, et al. "Infrastructure-Aware TensorFlow for Heterogeneous Datacenters." Proceedings of the In Proceedings of the 28th IEEE International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). Ed. IEEE. Nice, France: n.p., 2020. Web.
Maurya, Avinash, et al. "CoSim: A Simulator for Co-Scheduling of Batch and On-Demand Jobs in HPC Datacenters." Proceedings of the In Proceedings of the 24th IEEE/ACM International Symposium on Distributed Simulation and Real Time Applications (DS-RT). Ed. IEEE. Prague, Czech Republic: n.p., 2020. Web.
Kwon, Minseok, et al. "CuVPP: Filter-based Longest Prefix Matching in Software Data Planes." Proceedings of the In Proceedings of the 22nd IEEE International Conference on Cluster Computing (Cluster). Ed. IEEE. Kobe, Japan: n.p., 2020. Web.
Han, Jingoo, et al. "MARBLE: A Multi-GPU Aware Job Scheduler for Deep Learning on HPC Systems." Proceedings of the In Proceedings of the 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid). Ed. IEEE. Melbourne,, Victoria, Australia: n.p., 2020. Web.
Han, Jingoo, et al. "A Quantitative Study of Deep Learning Training on Heterogeneous Supercomputers." Proceedings of the In Proceedings of the 21st IEEE International Conference on Cluster Computing (Cluster). Ed. IEEE. Albuquerque, New Mexico, USA: n.p., 2019. Web.