[aspect-devel] Aspect hangs after several time steps

Lev Karatun lev.karatun at gmail.com
Sun Feb 21 21:07:28 PST 2016


Hi Timo,

I've simplified the model to the extreme, and still the solver doesn't
converge (The residual in the last step was nan). It's not even using any
of the plugins that I used, so I'm really confused. The cookbooks, however,
run without problems. (and my model in release mode runs successfully too)

I'm trying to run the models on a lab computer, single core. I attached the
prm file that I'm using, could you have a look please?
Thanks in advance!

Best regards,
Lev Karatun.

2016-02-12 9:10 GMT-05:00 Timo Heister <heister at clemson.edu>:

> Lev,
>
> your system setup (compiler, operating system, standard libraries)
> doesn't allow for the debugging trick to detect floating point
> exceptions. You could try a recent version of clang (or gcc if you
> were using clang) and see. Sorry, I don't know a sure way to get this
> enabled (I have no problems with it on ubuntu 14.04 using gcc or
> clang).
>
> That said, convergence failures can have other reasons not detectable
> using this technique. It is hard to tell, but you should probably
> simplify your test problem and if it works, increase the complexity
> one by one. Do the included files like convection-box or shell 2d/3d
> work correctly on your machine (is it a cluster?)?
>
> Best,
> Timo
>
> On Fri, Feb 12, 2016 at 5:21 AM, Lev Karatun <lev.karatun at gmail.com>
> wrote:
> > Hi Wolfgang,
> >
> > I fixed the issues that Timo pointed out, but I'm still getting the "did
> not
> > converge" error. Is there a way for me to debug it the same way Timo did?
> > What am I missing? A compiler with C++11 support? Something else?
> >
> > Best regards,
> > Lev Karatun.
> >
> > 2016-02-03 9:20 GMT-05:00 Wolfgang Bangerth <bangerth at tamu.edu>:
> >>
> >>
> >> Lev,
> >> this is complicated.
> >>
> >> You switched on floating point signals by hand, but your compiler
> support
> >> library has functions that trip this, so your program gets stopped in a
> >> place that is outside anyone's control (and where it doesn't actually
> matter
> >> much). This is why floating point exceptions were disabled automatically
> >> when you ran the ASPECT cmake script. In other words, to make progress,
> you
> >> need to disable floating point exceptions again.
> >>
> >> Of course, without being able to run FPE on your system, you won't be
> able
> >> to reproduce the issues Timo found on his system. But you don't have to
> find
> >> these issues -- he already did that for you. Just fix these places and
> you
> >> should be able to make progress.
> >>
> >> Best
> >>  W.
> >>
> >>
> >> On 02/01/2016 11:00 PM, Lev Karatun wrote:
> >>>
> >>> Hi Wolfgang,
> >>>
> >>> I've tried this, the last stack frame is:
> >>>
> >>>     #13 0x00000000012e50d5 in main (argc=2, argv=0x7fffffffe128) at
> >>>     /home/lev/aspect/aspect_debug_new/source/main.cc:513
> >>>
> >>>
> >>> The 513 line in main.cc is:
> >>>
> >>>     aspect::Simulator<3>::declare_parameters(prm);
> >>>
> >>>
> >>> Looks like it stops at the attempt of reading the Start time line:
> >>>
> >>>     #10 0x00007ffff537a569 in dealii::ParameterHandler::declare_entry
> >>>     (this=0x7fffffffdc90, entry="Start time", default_value="0",
> >>> pattern=...,
> >>>     documentation=
> >>>          "The start time of the simulation. Units: Years if the 'Use
> >>> years in
> >>>     output instead of seconds' parameter is set; seconds otherwise.")
> >>>          at
> >>>
> >>>
> /home/lev/distrib/dealii_debug_new/source/base/parameter_handler.cc:1628
> >>>
> >>>
> >>> but it's just "set Start time = 0", I never changed it.
> >>>
> >>> Full backtrace:
> >>>
> >>>     Program received signal SIGFPE, Arithmetic exception.
> >>>     __mpn_lshift () at ../sysdeps/x86_64/lshift.S:26
> >>>     26              movq    -8(%rsi,%rdx,8), %mm7
> >>>     (gdb) bt
> >>>     #0  __mpn_lshift () at ../sysdeps/x86_64/lshift.S:26
> >>>     #1  0x0000003db304a53e in ___printf_fp (fp=0x7fffffff71b0,
> >>>     info=0x7fffffff70b0, args=<value optimized out>) at printf_fp.c:483
> >>>     #2  0x0000003db30458a0 in _IO_vfprintf_internal (s=<value optimized
> >>> out>,
> >>>     format=<value optimized out>, ap=<value optimized out>) at
> >>> vfprintf.c:1640
> >>>     #3  0x0000003db306f752 in _IO_vsnprintf (string=0x7fffffff7420 "",
> >>>     maxlen=<value optimized out>, format=0x7fffffff74f0 "%.*g",
> >>>     args=0x7fffffff7310)
> >>>          at vsnprintf.c:120
> >>>     #4  0x0000003dbd87eb4f in std::__convert_from_v (__cloc=<value
> >>> optimized
> >>>     out>, __out=0x7fffffff7420 "", __size=45, __fmt=0x7fffffff74f0
> >>> "%.*g")
> >>>          at
> >>>
> >>>
> /usr/src/debug/gcc-4.4.7-20120601/obj-x86_64-redhat-linux/x86_64-redhat-linux/libstdc++-v3/include/x86_64-redhat-linux/bits/c++locale.h:89
> >>>     #5  0x0000003dbd880f23 in std::num_put<char,
> >>>     std::ostreambuf_iterator<char, std::char_traits<char> >
> >>>      >::_M_insert_float<double> (this=0x3dbdaf22e0, __s=..., __io=
> >>>          ..., __fill=32 ' ', __mod=<value optimized out>,
> >>>     __v=-1.7976931348623157e+308)
> >>>          at
> >>>
> >>>
> /usr/src/debug/gcc-4.4.7-20120601/obj-x86_64-redhat-linux/x86_64-redhat-linux/libstdc++-v3/include/bits/locale_facets.tcc:980
> >>>     #6  0x0000003dbd881249 in std::num_put<char,
> >>>     std::ostreambuf_iterator<char, std::char_traits<char> > >::do_put
> >>>     (this=<value optimized out>, __s=...,
> >>>          __io=<value optimized out>, __fill=<value optimized out>,
> >>> __v=<value
> >>>     optimized out>)
> >>>          at
> >>>
> >>>
> /usr/src/debug/gcc-4.4.7-20120601/obj-x86_64-redhat-linux/x86_64-redhat-linux/libstdc++-v3/include/bits/locale_facets.tcc:1127
> >>>     #7  0x0000003dbd89487f in put (this=0x7fffffff7630,
> >>>     __v=-1.7976931348623157e+308)
> >>>          at
> >>>
> >>>
> /usr/src/debug/gcc-4.4.7-20120601/obj-x86_64-redhat-linux/x86_64-redhat-linux/libstdc++-v3/include/bits/locale_facets.h:2390
> >>>     #8  std::basic_ostream<char, std::char_traits<char>
> >>> >::_M_insert<double>
> >>>     (this=0x7fffffff7630, __v=-1.7976931348623157e+308)
> >>>          at
> >>>
> >>>
> /usr/src/debug/gcc-4.4.7-20120601/obj-x86_64-redhat-linux/x86_64-redhat-linux/libstdc++-v3/include/bits/ostream.tcc:73
> >>>     #9  0x00007ffff5373468 in dealii::Patterns::Double::description
> >>>     (this=0x1f92620) at
> >>>
> >>> /home/lev/distrib/dealii_debug_new/source/base/parameter_handler.cc:290
> >>>     #10 0x00007ffff537a569 in dealii::ParameterHandler::declare_entry
> >>>     (this=0x7fffffffdc90, entry="Start time", default_value="0",
> >>> pattern=...,
> >>>     documentation=
> >>>          "The start time of the simulation. Units: Years if the 'Use
> >>> years in
> >>>     output instead of seconds' parameter is set; seconds otherwise.")
> >>>          at
> >>>
> >>>
> /home/lev/distrib/dealii_debug_new/source/base/parameter_handler.cc:1628
> >>>     #11 0x000000000103fef2 in aspect::Parameters<3>::declare_parameters
> >>>     (prm=...) at
> >>>     /home/lev/aspect/aspect_debug_new/source/simulator/parameters.cc:88
> >>>     #12 0x000000000104e2e8 in aspect::Simulator<3>::declare_parameters
> >>>     (prm=...) at
> >>>
>  /home/lev/aspect/aspect_debug_new/source/simulator/parameters.cc:1283
> >>>     #13 0x00000000012e50d5 in main (argc=2, argv=0x7fffffffe128) at
> >>>     /home/lev/aspect/aspect_debug_new/source/main.cc:513
> >>>
> >>>
> >>>
> >>> Best regards,
> >>> Lev Karatun.
> >>>
> >>> 2016-02-01 21:34 GMT-05:00 Wolfgang Bangerth <bangerth at tamu.edu
> >>> <mailto:bangerth at tamu.edu>>:
> >>>
> >>>     On 02/01/2016 06:42 PM, Lev Karatun wrote:
> >>>
> >>>
> >>>         Could you please tell me what I'm doing wrong?..
> >>>         Thanks in advance.
> >>>
> >>>
> >>>     I don't think you *need* any of these packages. You may not get to
> >>> see the
> >>>     exact location where the error happened in BLAS or some other
> >>> low-level
> >>>     library, but ultimately you only want to know the place where you
> >>> were in
> >>>     your own code. In other words, if the debugger stops at the
> location
> >>> where
> >>>     the error happens, get a backtrace and go to the last stack frame
> in
> >>> your
> >>>     code. Your task is then to find out what is wrong in that line.
> >>>
> >>>     Best
> >>>       W.
> >>>
> >>>     --
> >>>
> >>>
> ------------------------------------------------------------------------
> >>>     Wolfgang Bangerth               email: bangerth at math.tamu.edu
> >>>     <mailto:bangerth at math.tamu.edu>
> >>>                                      www:
> >>> http://www.math.tamu.edu/~bangerth/
> >>>
> >>>     _______________________________________________
> >>>     Aspect-devel mailing list
> >>>     Aspect-devel at geodynamics.org <mailto:Aspect-devel at geodynamics.org>
> >>>     http://lists.geodynamics.org/cgi-bin/mailman/listinfo/aspect-devel
> >>>
> >>>
> >>>
> >>>
> >>> _______________________________________________
> >>> Aspect-devel mailing list
> >>> Aspect-devel at geodynamics.org
> >>> http://lists.geodynamics.org/cgi-bin/mailman/listinfo/aspect-devel
> >>>
> >>
> >>
> >> --
> >> ------------------------------------------------------------------------
> >> Wolfgang Bangerth               email:
> bangerth at math.tamu.edu
> >>                                 www:
> http://www.math.tamu.edu/~bangerth/
> >>
> >> _______________________________________________
> >> Aspect-devel mailing list
> >> Aspect-devel at geodynamics.org
> >> http://lists.geodynamics.org/cgi-bin/mailman/listinfo/aspect-devel
> >
> >
> >
> > _______________________________________________
> > Aspect-devel mailing list
> > Aspect-devel at geodynamics.org
> > http://lists.geodynamics.org/cgi-bin/mailman/listinfo/aspect-devel
>
>
>
> --
> Timo Heister
> http://www.math.clemson.edu/~heister/
> _______________________________________________
> Aspect-devel mailing list
> Aspect-devel at geodynamics.org
> http://lists.geodynamics.org/cgi-bin/mailman/listinfo/aspect-devel
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.geodynamics.org/pipermail/aspect-devel/attachments/20160222/719945b7/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 134.prm
Type: application/octet-stream
Size: 8845 bytes
Desc: not available
URL: <http://lists.geodynamics.org/pipermail/aspect-devel/attachments/20160222/719945b7/attachment-0001.obj>


More information about the Aspect-devel mailing list