[aspect-devel] [cse.ucdavis.edu #13562] Fwd: Fwd: error when writing checkpoint files

Magali Billen mibillen at ucdavis.edu
Sun Jul 8 01:19:41 PDT 2018


Hi everyone, 

My cluster is really old (GigE…), and is perhaps of a dying breed of “individual PI” clusters.
 So, that problem is not fixable until I write a grant to add nodes to a larger, newer cluster with a more modern set-up.

Bill - is MPI-IO enabled on Peleton? This cluster, or one like it, is what I would be buying nodes to add to.

Wolfgang  -  your responses help to solve another mystery of why VTU works and not checkpointing. 
I started my PRM files from a file that John Naliboff gave me (he was using my cluster with a visiting student),
 and in it the parameter “Number of grouped files” is set to zero (see below).  I had not dug into what that meant, 
but now its clear. 

Maybe the only related question, is whether it is possible to create a similar variable for Checkpointing?
If not, I guess that's just really strong motivation for me to write an IFR proposal quickly ;-) (and a proposal for time
on a national lab machine).

-Magali

# INFORMATION ON OUTPUT TO BE CREATED
subsection Postprocess
  set List of postprocessors = visualization, velocity statistics, temperature statistics

  subsection Visualization
    set List of output variables      = density, viscosity, strain rate
    set Output format                 = vtu
    set Time between graphical output = 0.10e6
    set Number of grouped files       = 0
  end
end


> On Jul 8, 2018, at 5:22 AM, Wolfgang Bangerth <bangerth at colostate.edu> wrote:
> 
> On 07/07/2018 06:45 AM, Magali Billen wrote:
>> I’ll be at CIDER the last two weeks of  July and I’ll try to talk to Rene in person about this issue and try to understand
>> more about what options might exists.  Since this is all handled by other libraries (p4est), there may be no real option.   I
>>  don’t feel like I have the expertise or experience with Aspect to wade into this on my own. Maybe after talking with Rene,
>> we can see about trying to compiling p4est with mpi-io and see what happens.
> 
> p4est can be configured to disable MPI-IO. So that's a problem that can be solved. But deal.II also uses MPI-IO, here:
> 
> https://github.com/dealii/dealii/blob/master/source/base/data_out_base.cc#L7286
> 
> This deal.II function is called from essentially all ASPECT runs:
> 
> https://github.com/geodynamics/aspect/blob/master/source/postprocess/visualization.cc#L594
> 
> The default for the number of grouped files is 16, and I suspect that most people leave it as is -- so basically everyone ends up in the `else` branch in line 598.
> 
> In other words, while I don't know whether people use checkpoint/restart frequently, pretty much everyone I know uses VTU output, and that uses MPI-IO. I can't really reconcile this, but it seems to suggest that MPI-IO must work for most of our users.
> 
> Best
> Wolfgang
> 
> 
> -- 
> ------------------------------------------------------------------------
> Wolfgang Bangerth          email:                 bangerth at colostate.edu
>                           www: http://www.math.colostate.edu/~bangerth/
> 

____________________________________________________________
Professor of Geophysics 
Earth & Planetary Sciences Dept., UC Davis
Davis, CA 95616
2129 Earth & Physical Sciences Bldg.
Office Phone: (530) 752-4169
http://magalibillen.faculty.ucdavis.edu

Currently on Sabbatical at Munich University (LMU)
Department of Geophysics (PST + 9 hr)

Avoid implicit bias - check before you submit: 
http://www.tomforth.co.uk/genderbias/
___________________________________________________________



More information about the Aspect-devel mailing list