I am a Research Scientist in the Dept. of Computer Science & Engineering at
the Ohio State University. I am currently working with
Dr. Dhabaleswar K. Panda in the
Network-Based Computing Lab.
My areas of interest include Parallel computer architecture,
network-based computing, exascale computing, network topology aware computing,
QoS, power-aware LAN-WAN communication, fault tolerance, virtualization, high
performance job startup, InfiniBand network analysis and monitoring, and cloud
computing.
I have extensive experience in designing, developing and deploying solutions
targeted at multiple HPC domains. I have published over 75 papers in major
journals and international conferences. I have also served as Program
Chair/Co-Chair and Program Committee Member for several international
conferences and workshops. I have delivered a large number of Tutorials,
Presentations and Invited talks Worldwide. I lead the design, development,
testing and distribution of the MVAPICH2, MVAPICH2-GDR, MVAPICH2-X and OSU InfiniBand
Network Analysis and Monitoring (OSU INAM)software stacks.
Currently, I am working on the following projects:
-
MVAPICH: MPI over InfiniBand, Omni-Path, Ethernet/iWARP, and RoCE
-
High-Performance Big Data (HiBD)
-
OSU InfiniBand Network Analysis and Monitoring (OSU INAM) Tool
-
Design of Scalable Data-Centers over InfiniBand and iWARP
-
Design and Evaluation of Communications Protocols over InfiniBand WAN
I earned my Bachelors degree in Computer Science and Engineering
from the
University of Kerala
at the
University College of Engineering,
Trivandrum.
Events and Announcements:
[
Technical Reports]
2016 ,
2015 ,
2014 ,
2013 ,
2012 ,
2011 ,
2010 ,
2009 ,
2008
-
Designing MPI Library with On-Demand Paging (ODP) of InfiniBand: Challenges
and Benefits, M. Li, K. Hamidouche, X. Lu, H. Subramoni, J. Zhang, and D. K.
Panda, The International Conference for High Performance Computing, Networking,
Storage and Analysis (Supercomputing.16), Nov 2016
-
INAM2: InfiniBand Network Analysis & Monitoring with MPI, H. Subramoni,
A. Augustine, M. Arnold, J. Perkins, X. Lu, K. Hamidouche, and D. K. Panda,
International Supercomputing Conference (ISC.16), Jun 2016
-
Exploiting Maximal Overlap for Non-Contiguous Data Movement Processing on
Modern GPU-enabled System, C. Chu, K. Hamidouche, A. Venkatesh, D. Banerjee,
H. Subramoni, and D. K. Panda, The 30th IEEE International Parallel & Distributed
Processing Symposium (IPDPS '16), May 2016
-
SHMEMPMI - Shared Memory based PMI for Improved Performance and Scalability,
S. Chakraborty, H. Subramoni, J. Perkins, and D. K. Panda, 16th IEEE/ACM
International Symposium on Cluster, Cloud and Grid Computing (CCGrid'16), May
2016
-
Offloaded GPU Collectives using CORE-Direct and CUDA Capabilities on IB
Clusters, A.Venkatesh, K. Hamidouche, H. Subramoni, and D. K. Panda, 22nd IEEE
International Conference on High Performance Computing (HiPC'15), Dec 2015
-
GPU-Aware Design, Implementation, and Evaluation of Non-blocking Collective
Benchmarks, A. Awan, K. Hamidouche, A. Venkatesh, J. Perkins, H. Subramoni, and
D. K. Panda, Proceedings of the 22nd European MPI Users' Group Meeting
(EuroMPI.15), Sep 2015
-
High Performance MPI Datatype Support with User-mode Memory Registration:
Challenges, Designs and Benefits, M. Li, H. Subramoni, K. Hamidouche, X. Lu, and
D. K. Panda, IEEE International Conference on Cluster Computing (Cluster'15),
Sep 2015
-
Exploiting GPUDirect RDMA in Designing High Performance OpenSHMEM for NVIDIA GPU
Clusters, K. Hamidouche, A. Venkatesh, A. Awan, H. Subramoni, and D. K. Panda,
IEEE International Conference on Cluster Computing (Cluster'15), Sep 2015
-
Impact of InfiniBand DC Transport Protocol on Energy Consumption of All-to-all
Collective Algorithms, H. Subramoni, A. Venkatesh, K. Hamidouche, K. Tomko, and
D. K. Panda, 23rd International Symposium on High Performance Interconnects
(HotI'15), Aug 2015
-
Designing Non-Blocking Personalized Collectives with Near Perfect Overlap for
RDMA-Enabled Clusters, H. Subramoni, A. Awan, K. Hamidouche, D. Pekurovsky, A.
Venkatesh, S. Chakraborty, K. Tomko, and D. K. Panda, International
Supercomputing Conference (ISC'15), Jul 2015
-
On-demand Connection Management for OpenSHMEM and OpenSHMEM+MPI, S. Chakraborty,
H. Subramoni, J. Perkins, A. Awan, and D. K. Panda, High-level Parallel
Programming Models and Supportive Environments (HIPS'15), Published in
Proceedings of the 2015 IEEE International Parallel and Distributed Processing
Symposium Workshop (IPDPSW '15), May 2015
-
Non-blocking PMI Extensions for Fast MPI Startup, S. Chakraborty, H. Subramoni,
A. Moody, A. Venkatesh, J. Perkins, and D. K. Panda, IEEE/ACM International
Symposium on Cluster, Cloud and Grid Computing (CCGrid'15), May 2015
-
A High Performance Broadcast Design with Hardware Multicast and GPUDirect RDMA
for Streaming Applications on Infiniband Clusters, A. Venkatesh, H. Subramoni,
K. Hamidouche, and D. K. Panda, IEEE International Conference on High
Performance Computing (HiPC '14), Dec 2014
-
Designing Scalable Out-of-core Sorting with Hybrid MPI+PGAS Programming Models,
J. Jose, S. Potluri, H. Subramoni, X. Lu, K. Hamidouche, K. Schulz, H. Sundar,
and D. K. Panda, International Conference on Partitioned Global Address Space
Programming Models (PGAS '14), Oct 2014
-
H. Subramoni, K. C. Kandalla, J. Jose, K. Tomko, K. Schulz,
D. Pekurovsky and D. K. Panda, Designing Topology Aware Communication
Schedules for Alltoall Operations in Large InfiniBand Clusters,
Int'l Conference on Parallel Processing (ICPP '14), October 2014.
-
S. Chakraborty, H. Subramoni, J. Perkins, A. Moody, M. Arnold,
and D. K. Panda, PMI Extensions for Scalable MPI Startup,
EuroMPI/ASIA 2014, September 2014.
-
H. Subramoni, K. Hamidouche, A. Venkatesh, S. Chakraborty,
and D. K. Panda, Designing MPI Library with Dynamic Connected
Transport (DCT) of InfiniBand: Early Experiences, International
Supercomputing Conference (ISC'14), June 2014.
-
S. Potluri, D. Bureddy, K. Hamidouche, A. Venkatesh, K. Kandalla,
H. Subramoni and D. K. Panda, MVAPICH-PRISM: A Proxy-based
Communication Framework using InfiniBand and SCIF for Intel MIC
Clusters, Int'l Conference on Supercomputing (SC '13), November 2013.
-
K. C. Kandalla, H. Subramoni, K. Tomko, D. Pekurovsky and D. K. Panda,
A Novel Functional Partitioning Approach to Design High-Performance MPI-3
Non-Blocking Alltoallv Collective on Multi-core Systems, Int'l Conference on
Parallel Processing (ICPP '13), October 2013.
-
X. Lu, N. Islam, M. Wasi-ur-Rahman, J. Jose, H. Subramoni, H. Wang and
D. K. Panda, High-Performance Design of Hadoop RPC with RDMA over InfiniBand,
Int'l Conference on Parallel Processing (ICPP '13), October 2013.
-
H. Subramoni, D. Bureddy, K. Kandalla, K. Schulz, B. Barth, J. Perkins,
M. Arnold and D. K. Panda, Design of Network Topology Aware Scheduling
Services for Large InfiniBand Clusters, IEEE Cluster (Cluster '13),
September 2013.
-
K. Hamidouche, S. Potluri, H. Subramoni, K. Kandalla and D. K. Panda,
MIC-RO: Enabling Efficient Remote Offload on Heterogeneous Many Integrated
Core (MIC) Clusters with InfiniBand, Int'l Conference on Supercomputing
(ICS '13), June 2013.
-
S. Potluri, D. Bureddy, H. Wang, H. Subramoni and D. K. Panda,
Extending OpenSHMEM for GPU Computing, Int'l Parallel and Distributed
Processing Symposium (IPDPS '13), May 2013.
-
H. Subramoni, S. Potluri, K. Kandalla, B. Barth, J. Vienne, J. Keasler,
K. Tomko, K. Schulz, A. Moody and D. K. Panda,
Design of a Scalable InfiniBand Topology Service to Enable
Network-Topology-Aware Placement of Processes,
Int'l Conference on Supercomputing (SC '12),
November 2012.
Best Paper and Best Student Paper Finalist.
-
N. S. Islam, M. W. Rahman, J. Jose, R. Rajachandrasekar,
H. Wang, H. Subramoni, C. Murthy and D. K. Panda,
High Performance RDMA-Based Design of HDFS over InfiniBand,
Int'l Conference on Supercomputing (SC '12),
November 2012.
-
R. Rajachandrasekar, J. Jaswani, H. Subramoni and D. K. Panda,
Minimizing Network Contention in InfiniBand Clusters with a
QoS-Aware Data-Staging Framework,
IEEE Cluster (Cluster '12),
September 2012.
-
K. Kandalla, A. Buluc, H. Subramoni, K. Tomko, J. Vienne, L. Oliker
and D. K. Panda,
Can Network-Offload based Non-Blocking Neighborhood MPI Collectives
Improve Communication Overheads of Irregular Graph Algorithms?
Int'l Workshop on Parallel Algorithm and Parallel Software (IWPAPS12),
held in conjunction with IEEE Cluster (Cluster '12),
September 2012.
-
H. Subramoni, J. Vienne and D. K. Panda,
A Scalable InfiniBand Network-Topology-Aware
Performance Analysis Tool for MPI,
Int'l Workshop on Productivity and Performance (Proper '12),
held in conjunction with EuroPar,
August 2012.
-
J. Huang, X. Ouyang, J. Jose, M. W. Rahman, H. Wang,
M. Luo, H. Subramoni, C. Murthy and D. K. Panda,
High-Performance Design of HBase with RDMA over InfiniBand,
Int'l Parallel and Distributed Processing Symposium (IPDPS '12),
May 2012.
-
K. Kandalla, U. Yang, J. Keasler, T. Kolev, A. Moody,
H. Subramoni, K. Tomko, J. Vienne and D. K. Panda,
Designing Non-blocking Allreduce with Collective Offload
on InfiniBand Clusters: A Case Study with
Conjugate Gradient Solvers
Int'l Parallel and Distributed Processing Symposium (IPDPS '12),
May 2012.
-
S. P. Raikar, H. Subramoni, K. Kandalla, J. Vienne and
D. K. Panda,
Designing Network Failover and Recovery in MPI for Multi-Rail
InfiniBand Clusters,
Int'l Workshop on System Management Techniques, Processes,
and Services (SMTPS), in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '12),
May 2012.
-
J. Jose, H. Subramoni, K. Kandalla, M. W. Rahman, H. Wang,
S. Narravula and D. K. Panda,
Scalable Memcached design for InfiniBand Clusters using Hybrid Transports,
Int'l Symposium on Cluster, Cloud,
and Grid Computing (CCGrid 2012), May 2012.
-
H. Subramoni, K. Kandalla, J. Vienne, S. Sur, B. Barth, K. Tomko,
R. McLay, K. Schulz and D. K. Panda,
Design and Evaluation of Network Topology-/Speed-Aware Broadcast
Algorithms for InfiniBand Clusters,
IEEE Cluster '11, Sept. 2011.
-
J. Jose, H. Subramoni, M. Luo, M. Zhang, J. Huang, M. W. Rahman,
N. S. Islam, X. Ouyang, H. Wang, S. Sur and D. K. Panda,
Memcached Design on High Performance RDMA Capable Interconnects,
Int'l Conference on Parallel Processing (ICPP '11), Sept. 2011.
-
N. Dandapanthula, H. Subramoni, J. Vienne, K. Kandalla, S. Sur,
D. K. Panda, and R. Brightwell,
INAM - A Scalable InfiniBand Network Analysis and Monitoring Tool,
4th Int'l Workshop on Productivity and Performance (PROPER 2011),
in conjunction with EuroPar, Aug. 2011.
-
K. Kandalla, H. Subramoni, J. Vienne, K. Tomko, S. Sur
and D. K. Panda,
Designing Non-blocking Broadcast
with Collective Offload on InfiniBand Clusters: A Case Study with HPL,
Hot Interconnect '11, Aug. 2011.
-
K. Kandalla, H. Subramoni,, K. Tomko, D. Pekurovsky,
S. Sur and D. K. Panda,
High-Performance and Scalable Non-Blocking All-to-All with Collective
Offload on InfiniBand Clusters: A Study with Parallel 3D FFT,
Int'l Supercomputing Conference (ISC), June 2011.
-
H. Subramoni, P. Lai, S. Sur and D. K. Panda,
Improving Application Performance and Predictability using Multiple
Virtual Lanes in Modern Multi-Core InfiniBand Clusters,
International Conference on Parallel Processing (ICPP '10),
September 2010.
-
H. Subramoni, K. Kandalla, S. Sur and D. K. Panda,
Design and Evaluation of Generalized Collective Communication
Primitives with Overlap using ConnectX-2 Offload Engine ,
Int'l Symposium on Hot Interconnects (HotI), Aug. 2010,
Conference Slides.
-
H. Subramoni, P. Lai, R. Kettimuthu and D. K. Panda,
High Performance Data Transfer in Grid Environment Using GridFTP over
InfiniBand,
Int'l Symposium on Cluster Computing and the Grid (CCGrid '10),
May 2010,
Conference Slides.
-
K. Kandalla, H. Subramoni, A. Vishnu and D. K. Panda,
Designing Topology-Aware Collective Communication Algorithms for Large
Scale InfiniBand Clusters:
Case Studies with Scatter and Gather,
The 10th Workshop on Communication Architecture for Clusters (CAC '10),
April 2010.
-
P. Lai, H. Subramoni, S. Narravula, A. Mamidala and D. K. Panda,
Designing Efficient FTP Mechanisms for High Performance Data-Transfer
over InfiniBand,
Int'l Conference on Parallel Processing (ICPP '09), September 2009.
Conference Slides.
-
H. Subramoni, P. Lai, M. luo, Dhabaleswar K. Panda,
RDMA over Ethernet - A Preliminary Study,
Workshop on High Performance Interconnects for Distributed Computing
(HPIDC '09), September 2009.
Conference Slides.
-
H. Subramoni, M. Koop, and D. K. Panda,
Designing Next Generation Clusters: Evaluation of InfiniBand DDR/QDR
on Intel Computing Platforms,
17th Annual Symposium on High-Performance Interconnects (HotI '09),
August 2009.
Technical Report.
Conference Slides.
-
K. Kandalla, H. Subramoni, G. Santhanaraman, M. Koop and
D. K. Panda,
Designing Multi-Leader-Based Allgather Algorithms for Multi-Core
Clusters,
The 9th Workshop on Communication Architecture for Clusters (CAC '09),
May 2009.
Technical Report.
Conference Slides.
-
H. Subramoni, G. Marsh, S. Narravula, P. Lai and D.K. Panda,
Design and Evaluation of Benchmarks for Financial Applications using Advanced Message Queuing Protocol (AMQP) over InfiniBand,
Workshop on High Performance Computational Finance (In conjunction with SC '08), Austin, TX, November 2008.
Technical Report,
Conference Slides.
-
S. Narravula, H. Subramoni, P. Lai, R. Noronha and D. K. Panda
Performance of HPC Middleware over InfiniBand WAN,
Int'l Conference on Parallel Processing (ICPP '08), Portland, Oregon, September 2008.
Technical Report.
Conference Slides
subramon@cse.ohio-state.edu
|