[BANANA] Berkeley Lab - Scientific Computing Seminar, September 21,
2006
Parry Husbands
pjrhusbands at lbl.gov
Mon Sep 18 22:21:05 PDT 2006
Date: Thursday, September 21, 2006
Time: 1:00pm-2:00pm
Location: Building 50F-1647
Seminar Speaker: Yijian Wang
Department of Electrical and Computer Engineering
Northeastern University
http://www.ece.neu.edu/students/yiwang/
Title: Accelerating I/O-Bound Parallel Scientific Applications
Abstract:
In the area of high performance parallel computation, there is a growing
need to process very large, complex datasets. When executing these
workloads
on cluster-based systems, performance cannot scale by simply increasing the
number of compute nodes. To effectively exploit parallel resources, we
need to
parallelize file I/O. The potential impact of exploiting parallel I/O
grows as the gap
between CPU and disk speeds continues to increase. While parallel I/O
middleware
systems (e.g., MPI I/O) provide users with environments where large
datasets can
be shared among multiple distributed processes, the performance of
I/O-bound
applications depends heavily on how the data is accessed and where the
data is
physically located on disk. I/O operations need to be parallelized both at
the
application level and at the disk level.
In this talk, we present an I/O partitioning scheme that utilizes both
local and centralized
I/O storage within a cluster to improve I/O parallelism and scalability.
We characterize
file access patterns by studying I/O profiles. We propose and evaluate a
new file-partitioning
algorithm to optimize data layout in cluster I/O subsystems. We also
present a workload
balancing mechanism that targets on heterogeneous clusters and tunes
workload assignments
to adapt the underlying storage devices. By migrating I/O workloads from
heavily loaded nodes
to lightly loaded nodes, we are able to balance I/O workloads and achieve
higher parallel I/O
throughput. Experimental results show that by utilizing our optimization
approaches, I/O
performance can be improved significantly.
Sponsor of Seminar: David Skinner
More information about the BANANA
mailing list