Skip to Content.
Sympa Menu

h5part - [H5part] Performance

h5part AT

Subject: H5Part development and discussion

List archive

[H5part] Performance

Chronological Thread 
  • From: John Biddiscombe <biddisco AT>
  • To: h5part AT
  • Subject: [H5part] Performance
  • Date: Wed, 19 Mar 2008 11:09:26 +0100 (CET)
  • List-archive: <>
  • List-id: H5Part development and discussion <>
  • Organization: cscs


I recently ran some IO tests on a blue gene machine using 32-2048
processors for the writes of data. The tests were designed to mimic the
type of usage we expect from SPH codes which have approx 5000 particles
per processor (total in this case fo 2048 nodes = 10million)
Performance using H5Part was very slow compared to using a single block
write using raw HDF5 calls. The main difference was H5Part writes each
scalar array independently (more calls to write, more cache hit/miss

Has anyone done any performance tests/tuning on large numbers of
processors. Based on the results I have been getting, we will not use
H5Part on the bluegene (unless I can find a way of speeding it up)

Thanks. (NB. I am in the process of collecting statistics which I'll
happily share with you all, but I would like to save myself the trouble
if any of you have done similar studies and published interesting
workarounds etc)


Archive powered by MHonArc 2.6.19.

Top of Page