[CIG-SEISMO] Error executing specfem3d with CUDA / GeForce GT 730
Leandro Gazoni
lgazoni at gmail.com
Thu May 3 15:39:33 PDT 2018
Hi Daniel,
Thanks for your help! about the architecture of the video card I believe
the GT 730 is Kepler.
I compiled Specfem3d using only the command: "./configure --with-cuda"
without any arguments. But I've tried before with ./configure --with-cuda =
cuda5, both generate the makefile and compile without errors.
*The answer to configure follows.*
*## ---- ##*
*## CUDA ##*
*## ---- ##*
*checking for nvcc ... /usr/local/cuda-9.1/bin/nvcc*
*checking for cuda_runtime.h ... yes*
*checking nvcc compilation with cudaMalloc in -lcudart ... yes*
*checking nvcc linking with cudaMalloc in -lcudart ... yes*
*checking linking with cudaMalloc in -lcudart ... yes*
The problem is when running the simple_model example.
*nacib at jobi:~/Downloads/specfem3d/EXAMPLES/meshfem3D_examples/simple_model$
./run_this_example.shrunning example: Qui Mai 3 13:24:35 -03 2018 setting
up example... running mesher... running database generation... running
solver...Error in setConst_hprime_xx: invalid device symbolThe problem is
maybe -arch sm_13 instead of -arch sm_11 in the Makefile, please
doublecheck*
-----
*nacib at jobi:/usr/local/cuda-9.1/samples/bin/x86_64/linux/release$
./deviceQuery*
*./deviceQuery Starting...*
CUDA Device Query (Runtime API) version (CUDART static linking)
Detected 1 CUDA Capable device(s)
Device 0: "GeForce GT 730"
CUDA Driver Version / Runtime Version 9.1 / 9.1
CUDA Capability Major/Minor version number: 2.1
Total amount of global memory: 1982 MBytes (2078605312
bytes)
MapSMtoCores for SM 2.1 is undefined. Default to use 64 Cores/SM
MapSMtoCores for SM 2.1 is undefined. Default to use 64 Cores/SM
( 2) Multiprocessors, ( 64) CUDA Cores/MP: 128 CUDA Cores
GPU Max Clock rate: 1400 MHz (1.40 GHz)
Memory Clock rate: 700 Mhz
Memory Bus Width: 128-bit
L2 Cache Size: 131072 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(65536), 2D=(65536,
65535), 3D=(2048, 2048, 2048)
Maximum Layered 1D Texture Size, (num) layers 1D=(16384), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(16384, 16384), 2048
layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 32768
Warp size: 32
Maximum number of threads per multiprocessor: 1536
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (65535, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 1 copy engine(s)
Run time limit on kernels: Yes
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
Device supports Unified Addressing (UVA): Yes
Supports Cooperative Kernel Launch: No
Supports MultiDevice Co-op Kernel Launch: No
Device PCI Domain ID / Bus ID / location ID: 0 / 1 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device
simultaneously) >
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 9.1, CUDA Runtime
Version = 9.1, NumDevs = 1
Result = PASS
Thank you,
Best regards,
Leandro Gazoni
2018-05-03 19:14 GMT-03:00 Daniel B. Peter <daniel.peter at kaust.edu.sa>:
> hi Leandro,
>
> how did you compile SPECFEM3D?
>
> the GT 730 comes in two chip flavors, not sure which one you will have.
> one of them supports CUDA compute capability 3.5 (Kepler). to try that out,
> use something like:
>
> ./configure --with-cuda=cuda5 CUDA_FLAGS=.. CUDA_LIB=.. CUDA_INC=..
> MPI_INC=..
>
> best wishes,
> daniel
>
>
> > On May 3, 2018, at 11:24 PM, Leandro Gazoni <lgazoni at gmail.com> wrote:
> >
> > Hello Dimitri,
> >
> > Many thanks for the reply! The first thing I imagined would be to run a
> simple problem, so I chose the simple model. I modified it so that it could
> generate a small mesh and could run on a GPU like the GeForce GT 730.
> >
> > I believe the problem is small, but I do not know the reasons for the
> message about the architecture of the board (sm _....). I await and
> appreciate the response from the experts.
> >
> >
> > Thank you,
> > Best regards,
> > Leandro Gazoni
> >
> > 2018-05-03 16:05 GMT-03:00 Dimitri Komatitsch <
> komatitsch at lma.cnrs-mrs.fr>:
> >
> > Hi Leandro,
> >
> > Thanks for your message. I do not see any reason why a GeForce GT 730
> 64bit card could not be used, but I am not an expert, thus let me cc four
> experts.
> >
> > Could it be that the example is big and thus you are running out of
> memory?
> >
> > Thank you,
> > Best regards,
> > Dimitri.
> >
> >
> > On 05/03/2018 04:17 PM, Leandro Gazoni wrote:
> > Hello everyone,
> >
> > My name is Leandro Gazoni, I am a Phd student in computational mechanics
> in Federal University of Rio de Janeiro / Brazil. I have done some studies
> on using SprecFem3d I was able to compile, run and modify some examples
> using MPI.
> >
> > I would like to test the same problems with cuda but I have had problems
> running the example: (/ EXAMPLES / meshfem3D_examples / simple_model) with
> a GPU: GeForce GT 730, 64bit card. Is it possible to run tests with this
> board?
> >
> > I can compile specfem3d without problems with ./configure --with-cuda,
> but when running the example I have the following error:
> >
> > born @ jobi: ~ / Downloads / specfem3d / EXAMPLES / meshfem3D_examples /
> simple_model $ ./run_this_example.sh
> > running example: Thu May 3 05:06:59 -03 2018
> >
> > setting up example ...
> >
> > running mesher ...
> >
> > running database generation ...
> >
> > running solver ...
> >
> > Error in setConst_hprime_xx: invalid device symbol
> > The problem is maybe -arch sm_13 instead of -arch sm_11 in the Makefile,
> please doublecheck
> >
> >
> > Regards,
> > Leandro Gazoni
> >
> > ==============
> > ParFile
> > ==============
> > # number of MPI processors
> > NPROC = 1
> >
> > # time step parameters
> > NSTEP = 10000
> > DT = 0.01
> >
> > GPU_MODE = .true.
> >
> >
> > ==============
> > Mesh Parfile
> > ==============
> > # number of elements at the surface along edges of the mesh at the
> surface
> > # (must be 8 * multiple of NPROC below if mesh is not regular and
> contains mesh doublings)
> > # (must be multiple of NPROC below if mesh is regular)
> > NEX_XI = 32
> > NEX_ETA = 32
> >
> > # number of MPI processors along xi and eta (can be different)
> > NPROC_XI = 1
> > NPROC_ETA = 1
> >
> > # number of regions
> > NREGIONS = 4
> > # define the different regions of the model as :
> > #NEX_XI_BEGIN #NEX_XI_END #NEX_ETA_BEGIN #NEX_ETA_END #NZ_BEGIN
> #NZ_END #material_id
> > 1 32 1 32 1 4
> 1
> > 1 32 1 32 5 5
> 2
> > 1 32 1 32 6
> 15 3
> > 14 25 7 19 7
> 10 4
> >
> >
> >
> >
> >
> > _______________________________________________
> > CIG-SEISMO mailing list
> > CIG-SEISMO at geodynamics.org
> > http://lists.geodynamics.org/cgi-bin/mailman/listinfo/cig-seismo
> >
> >
> > --
> > Dimitri Komatitsch, CNRS Research Director (DR CNRS)
> > Laboratory of Mechanics and Acoustics, Marseille, France
> > http://komatitsch.free.fr
> >
>
>
> ________________________________
> This message and its contents including attachments are intended solely
> for the original recipient. If you are not the intended recipient or have
> received this message in error, please notify me immediately and delete
> this message from your computer system. Any unauthorized use or
> distribution is prohibited. Please consider the environment before printing
> this email.
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.geodynamics.org/pipermail/cig-seismo/attachments/20180503/9c3332bc/attachment-0001.html>
More information about the CIG-SEISMO
mailing list