I am a Research Scientist in the Dept. of Computer Science & Engineering at the Ohio State University. I am currently working with Dr. Dhabaleswar K. Panda in the Network-Based Computing Lab.

My areas of interest include Parallel computer architecture, network-based computing, exascale computing, network topology aware computing, QoS, power-aware LAN-WAN communication, fault tolerance, virtualization, high performance job startup, InfiniBand network analysis and monitoring, and cloud computing.

I extensive experience in designing, developing and deploying solutions targeted at multiple HPC domains. I have published over 50 papers in major journals and international conferences. I have also served as Program Chair/Co-Chair and Program Committee Member for several international conferences and workshops. I have delivered a large number of Tutorials, Presentations and Invited talks Worldwide. I lead the design, development, testing and distribution of the MVAPICH2, MVAPICH2-X and OSU InfiniBand Network Analysis and Monitoring (OSU INAM)software stacks.


Currently, I am working on the following projects:

MVAPICH: MPI over InfiniBand, Omni-Path, Ethernet/iWARP, and RoCE

High-Performance Big Data (HiBD)

OSU InfiniBand Network Analysis and Monitoring (OSU INAM) Tool

Design of Scalable Data-Centers over InfiniBand and iWARP

Design and Evaluation of Communications Protocols over InfiniBand WAN

I earned my Bachelors degree in Computer Science and Engineering from the University of Kerala at the University College of Engineering, Trivandrum.

Events and Announcements:


Research Publications

[ Technical Reports]

2016 , 2015 , 2014 , 2013 , 2012 , 2011 , 2010 , 2009 , 2008

2016

  • Designing MPI Library with On-Demand Paging (ODP) of InfiniBand: Challenges and Benefits, M. Li, K. Hamidouche, X. Lu, H. Subramoni, J. Zhang, and D. K. Panda, The International Conference for High Performance Computing, Networking, Storage and Analysis (Supercomputing.16), Nov 2016
  • INAM2: InfiniBand Network Analysis & Monitoring with MPI, H. Subramoni, A. Augustine, M. Arnold, J. Perkins, X. Lu, K. Hamidouche, and D. K. Panda, International Supercomputing Conference (ISC.16), Jun 2016
  • Exploiting Maximal Overlap for Non-Contiguous Data Movement Processing on Modern GPU-enabled System, C. Chu, K. Hamidouche, A. Venkatesh, D. Banerjee, H. Subramoni, and D. K. Panda, The 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS '16), May 2016
  • SHMEMPMI - Shared Memory based PMI for Improved Performance and Scalability, S. Chakraborty, H. Subramoni, J. Perkins, and D. K. Panda, 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'16), May 2016

2015

  • Offloaded GPU Collectives using CORE-Direct and CUDA Capabilities on IB Clusters, A.Venkatesh, K. Hamidouche, H. Subramoni, and D. K. Panda, 22nd IEEE International Conference on High Performance Computing (HiPC'15), Dec 2015
  • GPU-Aware Design, Implementation, and Evaluation of Non-blocking Collective Benchmarks, A. Awan, K. Hamidouche, A. Venkatesh, J. Perkins, H. Subramoni, and D. K. Panda, Proceedings of the 22nd European MPI Users' Group Meeting (EuroMPI.15), Sep 2015
  • High Performance MPI Datatype Support with User-mode Memory Registration: Challenges, Designs and Benefits, M. Li, H. Subramoni, K. Hamidouche, X. Lu, and D. K. Panda, IEEE International Conference on Cluster Computing (Cluster'15), Sep 2015
  • Exploiting GPUDirect RDMA in Designing High Performance OpenSHMEM for NVIDIA GPU Clusters, K. Hamidouche, A. Venkatesh, A. Awan, H. Subramoni, and D. K. Panda, IEEE International Conference on Cluster Computing (Cluster'15), Sep 2015
  • Impact of InfiniBand DC Transport Protocol on Energy Consumption of All-to-all Collective Algorithms, H. Subramoni, A. Venkatesh, K. Hamidouche, K. Tomko, and D. K. Panda, 23rd International Symposium on High Performance Interconnects (HotI'15), Aug 2015
  • Designing Non-Blocking Personalized Collectives with Near Perfect Overlap for RDMA-Enabled Clusters, H. Subramoni, A. Awan, K. Hamidouche, D. Pekurovsky, A. Venkatesh, S. Chakraborty, K. Tomko, and D. K. Panda, International Supercomputing Conference (ISC'15), Jul 2015
  • On-demand Connection Management for OpenSHMEM and OpenSHMEM+MPI, S. Chakraborty, H. Subramoni, J. Perkins, A. Awan, and D. K. Panda, High-level Parallel Programming Models and Supportive Environments (HIPS'15), Published in Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop (IPDPSW '15), May 2015
  • Non-blocking PMI Extensions for Fast MPI Startup, S. Chakraborty, H. Subramoni, A. Moody, A. Venkatesh, J. Perkins, and D. K. Panda, IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'15), May 2015

2014

  • A High Performance Broadcast Design with Hardware Multicast and GPUDirect RDMA for Streaming Applications on Infiniband Clusters, A. Venkatesh, H. Subramoni, K. Hamidouche, and D. K. Panda, IEEE International Conference on High Performance Computing (HiPC '14), Dec 2014
  • Designing Scalable Out-of-core Sorting with Hybrid MPI+PGAS Programming Models, J. Jose, S. Potluri, H. Subramoni, X. Lu, K. Hamidouche, K. Schulz, H. Sundar, and D. K. Panda, International Conference on Partitioned Global Address Space Programming Models (PGAS '14), Oct 2014
  • H. Subramoni, K. C. Kandalla, J. Jose, K. Tomko, K. Schulz, D. Pekurovsky and D. K. Panda, Designing Topology Aware Communication Schedules for Alltoall Operations in Large InfiniBand Clusters, Int'l Conference on Parallel Processing (ICPP '14), October 2014.
  • S. Chakraborty, H. Subramoni, J. Perkins, A. Moody, M. Arnold, and D. K. Panda, PMI Extensions for Scalable MPI Startup, EuroMPI/ASIA 2014, September 2014.
  • H. Subramoni, K. Hamidouche, A. Venkatesh, S. Chakraborty, and D. K. Panda, Designing MPI Library with Dynamic Connected Transport (DCT) of InfiniBand: Early Experiences, International Supercomputing Conference (ISC'14), June 2014.

2013

  • S. Potluri, D. Bureddy, K. Hamidouche, A. Venkatesh, K. Kandalla, H. Subramoni and D. K. Panda, MVAPICH-PRISM: A Proxy-based Communication Framework using InfiniBand and SCIF for Intel MIC Clusters, Int'l Conference on Supercomputing (SC '13), November 2013.
  • K. C. Kandalla, H. Subramoni, K. Tomko, D. Pekurovsky and D. K. Panda, A Novel Functional Partitioning Approach to Design High-Performance MPI-3 Non-Blocking Alltoallv Collective on Multi-core Systems, Int'l Conference on Parallel Processing (ICPP '13), October 2013.
  • X. Lu, N. Islam, M. Wasi-ur-Rahman, J. Jose, H. Subramoni, H. Wang and D. K. Panda, High-Performance Design of Hadoop RPC with RDMA over InfiniBand, Int'l Conference on Parallel Processing (ICPP '13), October 2013.
  • H. Subramoni, D. Bureddy, K. Kandalla, K. Schulz, B. Barth, J. Perkins, M. Arnold and D. K. Panda, Design of Network Topology Aware Scheduling Services for Large InfiniBand Clusters, IEEE Cluster (Cluster '13), September 2013.
  • K. Hamidouche, S. Potluri, H. Subramoni, K. Kandalla and D. K. Panda, MIC-RO: Enabling Efficient Remote Offload on Heterogeneous Many Integrated Core (MIC) Clusters with InfiniBand, Int'l Conference on Supercomputing (ICS '13), June 2013.
  • S. Potluri, D. Bureddy, H. Wang, H. Subramoni and D. K. Panda, Extending OpenSHMEM for GPU Computing, Int'l Parallel and Distributed Processing Symposium (IPDPS '13), May 2013.

2012

  • H. Subramoni, S. Potluri, K. Kandalla, B. Barth, J. Vienne, J. Keasler, K. Tomko, K. Schulz, A. Moody and D. K. Panda, Design of a Scalable InfiniBand Topology Service to Enable Network-Topology-Aware Placement of Processes, Int'l Conference on Supercomputing (SC '12), November 2012. Best Paper and Best Student Paper Finalist.
  • N. S. Islam, M. W. Rahman, J. Jose, R. Rajachandrasekar, H. Wang, H. Subramoni, C. Murthy and D. K. Panda, High Performance RDMA-Based Design of HDFS over InfiniBand, Int'l Conference on Supercomputing (SC '12), November 2012.
  • R. Rajachandrasekar, J. Jaswani, H. Subramoni and D. K. Panda, Minimizing Network Contention in InfiniBand Clusters with a QoS-Aware Data-Staging Framework, IEEE Cluster (Cluster '12), September 2012.
  • K. Kandalla, A. Buluc, H. Subramoni, K. Tomko, J. Vienne, L. Oliker and D. K. Panda, Can Network-Offload based Non-Blocking Neighborhood MPI Collectives Improve Communication Overheads of Irregular Graph Algorithms? Int'l Workshop on Parallel Algorithm and Parallel Software (IWPAPS12), held in conjunction with IEEE Cluster (Cluster '12), September 2012.
  • H. Subramoni, J. Vienne and D. K. Panda, A Scalable InfiniBand Network-Topology-Aware Performance Analysis Tool for MPI, Int'l Workshop on Productivity and Performance (Proper '12), held in conjunction with EuroPar, August 2012.
  • J. Huang, X. Ouyang, J. Jose, M. W. Rahman, H. Wang, M. Luo, H. Subramoni, C. Murthy and D. K. Panda, High-Performance Design of HBase with RDMA over InfiniBand, Int'l Parallel and Distributed Processing Symposium (IPDPS '12), May 2012.
  • K. Kandalla, U. Yang, J. Keasler, T. Kolev, A. Moody, H. Subramoni, K. Tomko, J. Vienne and D. K. Panda, Designing Non-blocking Allreduce with Collective Offload on InfiniBand Clusters: A Case Study with Conjugate Gradient Solvers Int'l Parallel and Distributed Processing Symposium (IPDPS '12), May 2012.
  • S. P. Raikar, H. Subramoni, K. Kandalla, J. Vienne and D. K. Panda, Designing Network Failover and Recovery in MPI for Multi-Rail InfiniBand Clusters, Int'l Workshop on System Management Techniques, Processes, and Services (SMTPS), in conjunction with Int'l Parallel and Distributed Processing Symposium (IPDPS '12), May 2012.
  • J. Jose, H. Subramoni, K. Kandalla, M. W. Rahman, H. Wang, S. Narravula and D. K. Panda, Scalable Memcached design for InfiniBand Clusters using Hybrid Transports, Int'l Symposium on Cluster, Cloud, and Grid Computing (CCGrid 2012), May 2012.

2011

  • H. Subramoni, K. Kandalla, J. Vienne, S. Sur, B. Barth, K. Tomko, R. McLay, K. Schulz and D. K. Panda, Design and Evaluation of Network Topology-/Speed-Aware Broadcast Algorithms for InfiniBand Clusters, IEEE Cluster '11, Sept. 2011.
  • J. Jose, H. Subramoni, M. Luo, M. Zhang, J. Huang, M. W. Rahman, N. S. Islam, X. Ouyang, H. Wang, S. Sur and D. K. Panda, Memcached Design on High Performance RDMA Capable Interconnects, Int'l Conference on Parallel Processing (ICPP '11), Sept. 2011.
  • N. Dandapanthula, H. Subramoni, J. Vienne, K. Kandalla, S. Sur, D. K. Panda, and R. Brightwell, INAM - A Scalable InfiniBand Network Analysis and Monitoring Tool, 4th Int'l Workshop on Productivity and Performance (PROPER 2011), in conjunction with EuroPar, Aug. 2011.
  • K. Kandalla, H. Subramoni, J. Vienne, K. Tomko, S. Sur and D. K. Panda, Designing Non-blocking Broadcast with Collective Offload on InfiniBand Clusters: A Case Study with HPL, Hot Interconnect '11, Aug. 2011.
  • K. Kandalla, H. Subramoni,, K. Tomko, D. Pekurovsky, S. Sur and D. K. Panda, High-Performance and Scalable Non-Blocking All-to-All with Collective Offload on InfiniBand Clusters: A Study with Parallel 3D FFT, Int'l Supercomputing Conference (ISC), June 2011.

2010

2009

2008



subramon@cse.ohio-state.edu