Commit Graph

23 Commits

Author SHA1 Message Date
Ruipeng Li
ad5d7e009f
Gpu mixedInt (#380)
This PR adds GPU support for mixedInt. 

Co-authored-by: Rob Falgout <rfalgout@llnl.gov>
2021-06-10 11:10:13 -07:00
Ruipeng Li
3bc7d267ef
Gpu default (#336)
This PR changes AMG defaults regarding GPUs at various places, adds regression tests on GPUs, simplifies CUDA boxloop implementations. 

Co-authored-by: Sarah Virginia Osborn <osborn9@llnl.gov>
Co-authored-by: PaulMullowney <pmullown@nrel.gov>
Co-authored-by: Daniel Osei-Kuffuor <oseikuffuor1@llnl.gov>
Co-authored-by: Ruipeng Li <li50@euler.llnl.gov>
Co-authored-by: Ruipeng Li <coe0141@redwood.cm.cluster>
2021-05-24 17:16:35 -07:00
Rob Falgout
6f9260b67c
Add saved-file extension to runtest (#271)
This pull request adds a -save <ext> feature to the runtest.sh script to allow testing against different saved files on different platforms such as GPU machines. See Issue #255. A few additional things were done:

- All of the checks against the saved files were moved out of the individual tests and into runtest.sh.
- The output-file sanity checks that are in many of the tests were modified so they no longer depend on the saved files. Several issues were also uncovered and fixed.
2021-02-08 15:11:45 -08:00
Ruipeng Li
2186a8fb34
triangular solve on GPUs; runcheck (#256)
This PR fixes triangular solve on GPUs, and runcheck.sh

Co-authored-by: Daniel Osei-Kuffuor <oseikuffuor1@llnl.gov>
2021-01-15 20:46:59 -08:00
Ruipeng Li
b49727f16b
Cuda triangular smoothers (#240)
* This commit has CUDA based smoothers for AMG based on the triangular parts of sparse matrices. This includes an Gauss-Seidel (relax_type==3), which uses CUSPARSE triangular solvers to invert L. Symmetric Gauss Seidel is implemented in relax_type==6 also via CUSPARSE. Finally, 2 new smoothers are added. THe first is a 2 stage approximation to Gauss Seidel using a parallel MatVec and L (relax_type==11). The second (relax_type==12) is a less effective version of 11. It uses A_diag instead of L for the smoothing. CPU implementations of these new smoothers are also provided. For the two stage algorithms, L and U are NOT explicitly created. This seems faster and saves memory. In the two stage preconditioner, multiply by invdiag rather than divide by diagonal reduces register pressure and yields full occupancy.
Co-authored-by: Paul Mullowney <pmullown@nrel.gov>
Co-authored-by: PaulMullowney <60452402+PaulMullowney@users.noreply.github.com>
2020-12-17 19:37:59 -08:00
Luke
22f4d3f8c6
Cuda 11 API (#163)
This PR adds CUDA-11 support.
2020-11-05 20:57:57 -08:00
Ruipeng Li
aaf5aa564a
Aggressive coarsening and 2- stage MM-ext Interpolations on GPUs (#195)
This PR contains the following changes:
* Aggressive coarsening, i.e, 2nd SoC on GPUs
* 2-stage MM-ext Interpolations (MM-ext, MM-ext+e) on GPUs
* Enhanced abilities of extracting strong FF/FC/CF/CC submatrix with given SoC matrix
* Bug fix in device PMIS
Co-authored-by: Bjorn Sjogreen <sjogreen2@llnl.gov>
Co-authored-by: ulrikeyang <yang11@llnl.gov>
2020-09-23 17:13:23 -07:00
liruipeng
8833bed155 add compile flags in GPU regression test scripts 2020-08-27 18:10:27 -07:00
Ruipeng Li
5d5b75bc02 GPU regression tests 2020-06-05 17:25:45 -07:00
Ruipeng Li
cf4d9b78b5 bug fix in ILU 2020-02-28 00:32:23 -08:00
Ruipeng Li
43ad3d6703 cub allocator 2020-02-27 22:28:44 -08:00
Ruipeng Li
022d476eda bug fix 2020-02-26 14:24:29 -08:00
Ruipeng Li
687232e116 gpu regression test [clean regression tests on ray] 2020-02-09 11:31:51 -08:00
Ruipeng Li
8e8eb4f5cb Merge branch 'master' of https://github.com/hypre-space/hypre into amg-setup
Conflicts:
	src/IJ_mv/IJMatrix_parcsr.c
	src/parcsr_ls/par_nongalerkin.c
	src/seq_mv/csr_matrix.c
	src/utilities/_hypre_utilities.h
	src/utilities/binsearch.c
	src/utilities/gpuErrorCheck.c
	src/utilities/gpuErrorCheck.h
	src/utilities/gpuMem.c
	src/utilities/gpuMem.h
	src/utilities/hypre_cuda_reducer.h
	src/utilities/hypre_nvtx.h
	src/utilities/hypre_reducesum.c
	src/utilities/protos.h
2019-07-22 15:36:07 -07:00
Ruipeng Li
dad575fc60 don't know how to fix a configure warning. comment the test out 2019-07-15 12:06:41 -07:00
Ruipeng Li
f0ed0085a3 regression test 2019-07-13 12:46:46 -07:00
Rob Falgout
48c9f0b972 Changed all of the headers 2019-07-07 19:26:24 -07:00
Ulrike M. Yang
c7a58fe3d9 changed a few configure option for additional tests 2018-10-25 15:26:47 -07:00
Ruipeng Li
faf5cfad57 add more configure & make tests 2018-10-18 08:53:44 -07:00
Ruipeng Li
35acc1b5f2 truncate the jobs in machine-ray.sh. It took 173m on ray
"time ./test.sh machine-ray.sh ../src/
 Running test [machine-ray]
 real    173m12.647s
 user    505m33.655s
 sys     241m20.030s"
2018-10-13 14:29:48 -07:00
Ruipeng Li
ff33a15589 [draft] GPU regression test, machine-ray.sh 2018-09-25 09:32:59 -07:00
Ruipeng Li
d368274932 towards a GPU regression test 2018-09-23 10:07:43 -07:00
Ruipeng Li
c530ac0da8 towards a GPU regression test 2018-09-23 10:01:43 -07:00