Commit Graph

11668 Commits

Author SHA1 Message Date
Ruipeng Li
7681f7f180 add cublas to makefile 2022-03-10 08:43:44 -08:00
Ruipeng Li
8ecee0b47d update hypre's spmv 2022-03-09 22:36:16 -08:00
Ruipeng Li
31ca2338d2 saved.lassen 2022-03-09 16:22:22 -08:00
Ruipeng Li
009501d51c bug fix 2022-03-09 16:14:56 -08:00
Ruipeng Li
86dae0be5b a minor change 2022-03-09 14:29:22 -08:00
Ruipeng Li
1dc1261fe8 fix cpu regression 2022-03-09 14:26:40 -08:00
Ruipeng Li
f8fd57ab2a updated saved perf on ray 2022-03-09 13:54:00 -08:00
Ruipeng Li
9b8627ce84 update lassen banchmark saved results 2022-03-09 13:09:36 -08:00
Ruipeng Li
7a8cf68b9a add -repeats 2 for struct benchmark jobs 2022-03-09 13:09:07 -08:00
Ruipeng Li
9dda5af3c4 struct.c driver for reps == 2 2022-03-09 12:31:46 -08:00
Ruipeng Li
9c33e9a263 regression tests 2022-03-09 08:54:42 -08:00
Ruipeng Li
63c9fa65a2 add using hypre's spmv option 2022-03-08 22:11:31 -08:00
Ruipeng Li
7443a2ac6c missed some sync in the last commit 2022-03-07 23:54:03 -08:00
Ruipeng Li
e1b9a56405 add gpu sync for mpi 2022-03-07 23:34:33 -08:00
Ruipeng Li
8ee20f4812 cudamallocasync 2022-03-07 16:54:56 -08:00
Ruipeng Li
df0f6dbba7 configure options: cublas; cudamallocasync 2022-03-07 16:40:32 -08:00
Ruipeng Li
d7728d0bce updated ij driver for 2nd solve 2022-03-07 15:16:00 -08:00
Ruipeng Li
b97fbc13ed sync device at ending timing 2022-03-07 15:13:08 -08:00
Ruipeng Li
c2e4836c1e bug fix 2022-03-05 10:36:53 -08:00
Ruipeng Li
a51bb880a8 bug fix 2022-03-05 09:46:16 -08:00
Ruipeng Li
03546b428f Merge branch 'master' of github.com:hypre-space/hypre into nvcollab 2022-03-04 22:18:02 -08:00
Paul T. Bauman
251cd3d269
Need -O1 instead of -O0 for HIP in debug mode (#588)
This PR changes -O0 in debug mode to -O1 with HIP (at this time).
2022-03-04 12:40:35 -08:00
Ruipeng Li
95e6433fc7
GPU support with single precision (#572)
This PR fixes the GPU support with single precision.
2022-03-04 12:05:32 -08:00
Ruipeng Li
ebd6eb88c3
bug fix; nonsquare rap (#581)
This PR fixes a corner case of the RAP routine for RAP matrix that is globally square but not locally.
2022-03-03 21:26:17 -08:00
Paul T. Bauman
04af9a4cd9
HYPRE_Int -> HYPRE_BigInt (#585) 2022-02-18 12:16:35 -08:00
Golam Rabbani
94070dd3a9
Updated CMakeLists.txt for SYCL (#577)
With CMake, enable CUDA stream by default when using SYCL.
2022-02-17 18:21:51 -08:00
Victor A. Paludetto Magri
33a5051398
Add SStruct IO functions (#583)
This PR adds support for native print/read functions of SStructMatrix and SStructVector. Other important changes are:
* Add public functions for reading StructMatrix and StructVector.
* Add a new set of regression tests called "io" to the TEST_sstruct folder.
2022-02-17 18:06:23 -08:00
Ruipeng Li
9888903445 memory tracker 2022-02-16 15:04:20 -08:00
Ruipeng Li
c336122cc1 move to debug region 2022-02-16 14:19:15 -08:00
Victor A. Paludetto Magri
49dbf7b60a
Fix cross-compilation problem (#580)
This PR fixes issue #556.

AC_CHECK_FILE was being used to test the existence of the .git folder. However, according to Autoconf manual, it does not work when cross-compiling. This PR implements another strategy for looking for the .git folder which works also when doing cross-compilation.
2022-02-16 07:55:02 -08:00
Ulrike Yang
e5a82e81e6 specified SYCL support 2022-02-10 17:12:32 -08:00
Rob Falgout
ccd135d8da Updating CHANGELOG 2022-02-10 10:18:32 -08:00
Rob Falgout
666f457d2b Bumping release number to 2.24.0 2022-02-10 07:05:43 -08:00
Rob Falgout
4ee737b53c Initial CHANGELOG update for new release 2022-02-10 07:02:26 -08:00
Ruipeng Li
ab72d05bd8
Deviceomp (#519)
This PR fixes the build with Kokkos + OMP offload, supports OMP offload without linking CUDA libraries, and supports OMP offload on Intel GPUs.
2022-02-09 06:40:57 -08:00
Ruipeng Li
8ba048c0b5
Forced regeneration of softlinks in shared library builds. (#574)
This PR (copied from #573) added -f to softlink generation to all the makefiles.
2022-02-07 16:54:44 -08:00
Ruipeng Li
e40f8219a3 fix for last merged PR 2022-02-07 18:26:53 -06:00
Quan Bui
734a10fcb7
Mgr setup gpu (#400)
Enable GPU setup for MGR solver.
* Added device specific functionality for interpolation
* Made device and host calls to interpolation consistent
* Edited IJ driver to use GPU capable options for MGR
* Updated saved files for new GPU options
* Updated CMakeLists to support new MGR capabilities

Co-authored-by: Ruipeng Li <li50@llnl.gov>
Co-authored-by: Daniel Osei-Kuffuor <oseikuffuor1@llnl.gov>
2022-02-07 15:54:52 -08:00
Ruipeng Li
790e8e7826
fix cuda 11 build (#569)
This PR fixes CUDA 11 build after merging #549, also adds regression tests (build only) with CUDA 11.
2022-02-02 08:40:15 -08:00
Wayne Mitchell
a7bb784a45
SYCL support for AMG solve phase (#549)
This adds matvec, matrix transpose, and vector operations (axpy, inner product, etc.)
with sycl backend (via oneMKL and oneDPL) for running on Intel GPUs. Thus, the AMG
solve phase can now execute entirely on Intel GPUs.
2022-01-31 16:15:30 -08:00
Victor A. Paludetto Magri
b159c7dd58
Fortran interfaces (#566)
This PR adds Fortran interfaces for hypre_MGR and hypre_ILU. Additionally:

* Add ArrayArray types in `fortran.h`
* Add MGR and ILU options to the fortran interfaces for Krylov solvers
2022-01-31 15:32:14 -08:00
Denis Barbier
dcbee14539
Add convenient CMake alias (#563)
Usually consumers of HYPRE call find_package(HYPRE) and depend on
HYPRE::HYPRE target.  But they could also want to use HYPRE via
add_directory (for instance via a git submodule)) or FetchContent,
in which case they have to depend on HYPRE target.

This alias makes this usage more consistent, all users could then
depend on HYPRE::HYPRE.  See for instance
  https://cmake.org/pipermail/cmake/2018-November/068629.html
2022-01-25 12:06:47 -08:00
Ruipeng Li
84fa589671
Redwood sh update (#561)
This PR adds a minor update in runtest.sh for redwood
2022-01-19 21:28:46 -08:00
Ruipeng Li
4c3ef2a0b4
Fortran gpu (#470)
This PR adds GPU examples for FORTRAN users, examples ex5f.f and ex12f.f.
2022-01-19 21:24:05 -08:00
Ruipeng Li
ce54070d76
fixed .saved (#560)
This PR updates files smoother.saved.lassen/ray that were not done in #534.
2022-01-19 16:30:15 -08:00
Ruipeng Li
514c72be69
add reading x0 from parcsr file back (#548)
This PR adds build_x0_type == 7 (read from parcsr file) back.
2022-01-12 08:57:14 -08:00
Ruipeng Li
bcccb117ef
ldg only for sm >= 35 (#516)
This PR fixes compile issues with CUDA sm_30. See #511
2022-01-12 08:55:58 -08:00
Ruipeng Li
436e09cba2
Early break in CG Eig (#534)
This PR adds early break in CG for eigenvalue estimations.
2022-01-12 08:53:42 -08:00
Ruipeng Li
e5e295ca1b fix after merge 2021-12-10 09:17:21 -08:00
Ruipeng Li
776b1de6b5 Merge branch 'master' of github.com:hypre-space/hypre into nvcollab 2021-12-10 09:13:18 -08:00