Commit Graph

12019 Commits

Author SHA1 Message Date
Rui-peng Li
c430559482 remove fast hash 2022-02-12 01:09:15 -06:00
Rui-peng Li
c347b3c9e1 remove some changes 2022-02-12 00:53:55 -06:00
Ruipeng Li
7672c101cd try to optimize hash 2022-02-11 22:17:47 -08:00
Ruipeng Li
c85a060d85 naive row nnz bound with spmv 2022-02-11 17:18:32 -08:00
Ulrike Yang
e5a82e81e6 specified SYCL support 2022-02-10 17:12:32 -08:00
Rob Falgout
ccd135d8da Updating CHANGELOG 2022-02-10 10:18:32 -08:00
Rob Falgout
666f457d2b Bumping release number to 2.24.0 2022-02-10 07:05:43 -08:00
Rob Falgout
4ee737b53c Initial CHANGELOG update for new release 2022-02-10 07:02:26 -08:00
Ruipeng Li
9d99e1fb77 put unroll as a macro 2022-02-10 00:14:18 -05:00
Ruipeng Li
efed76505c rename printf0; change unroll to 1 2022-02-09 23:44:49 -05:00
Wayne Mitchell
f8d482efad Add packing on device with oneDPL for par matvec 2022-02-10 01:51:56 +00:00
Ruipeng Li
6180f1261c minor change 2022-02-09 20:36:53 -05:00
Ruipeng Li
bef2862710 silence compiler warnings 2022-02-09 13:27:18 -05:00
Ruipeng Li
ab72d05bd8
Deviceomp (#519)
This PR fixes the build with Kokkos + OMP offload, supports OMP offload without linking CUDA libraries, and supports OMP offload on Intel GPUs.
2022-02-09 06:40:57 -08:00
Rui-peng Li
e0c24fd1f6 turn off printings 2022-02-08 16:34:05 -06:00
Ruipeng Li
04037377ea debug for HIP 2022-02-08 14:48:58 -06:00
Ruipeng Li
8ba048c0b5
Forced regeneration of softlinks in shared library builds. (#574)
This PR (copied from #573) added -f to softlink generation to all the makefiles.
2022-02-07 16:54:44 -08:00
Ruipeng Li
e40f8219a3 fix for last merged PR 2022-02-07 18:26:53 -06:00
Quan Bui
734a10fcb7
Mgr setup gpu (#400)
Enable GPU setup for MGR solver.
* Added device specific functionality for interpolation
* Made device and host calls to interpolation consistent
* Edited IJ driver to use GPU capable options for MGR
* Updated saved files for new GPU options
* Updated CMakeLists to support new MGR capabilities

Co-authored-by: Ruipeng Li <li50@llnl.gov>
Co-authored-by: Daniel Osei-Kuffuor <oseikuffuor1@llnl.gov>
2022-02-07 15:54:52 -08:00
Ruipeng Li
790e8e7826
fix cuda 11 build (#569)
This PR fixes CUDA 11 build after merging #549, also adds regression tests (build only) with CUDA 11.
2022-02-02 08:40:15 -08:00
Wayne Mitchell
a7bb784a45
SYCL support for AMG solve phase (#549)
This adds matvec, matrix transpose, and vector operations (axpy, inner product, etc.)
with sycl backend (via oneMKL and oneDPL) for running on Intel GPUs. Thus, the AMG
solve phase can now execute entirely on Intel GPUs.
2022-01-31 16:15:30 -08:00
Victor A. Paludetto Magri
b159c7dd58
Fortran interfaces (#566)
This PR adds Fortran interfaces for hypre_MGR and hypre_ILU. Additionally:

* Add ArrayArray types in `fortran.h`
* Add MGR and ILU options to the fortran interfaces for Krylov solvers
2022-01-31 15:32:14 -08:00
Ruipeng Li
24165eba5a remove hash type from ij.c 2022-01-30 23:49:00 -08:00
Ruipeng Li
8cde1ac442 remove some shared memory in symbl.h 2022-01-30 23:42:03 -08:00
Ruipeng Li
2fdccfaa60 changed default group settings 2022-01-30 17:32:57 -08:00
Ruipeng Li
a43daab879 add symb1/numer1; new binned symbl 2022-01-29 22:40:22 -08:00
Ruipeng Li
ebe3d2bdf3 renamed a marco 2022-01-28 15:50:39 -08:00
Ruipeng Li
38ebda17b3 minor change in a file 2022-01-28 16:22:26 -06:00
Ruipeng Li
40258a8f6b add a new file 2022-01-28 16:09:19 -06:00
Ruipeng Li
6f597695af fix for hip 2022-01-27 22:26:04 -06:00
li50@llnl.gov
9e115e94f5 Merge branch 'deviceomp' of github.com:hypre-space/hypre into spgemm 2022-01-25 22:37:26 -08:00
Ruipeng Li
c8abcff853 fixed compile without stream; enable stream with hip and sycl 2022-01-25 22:17:25 -08:00
li50@llnl.gov
a0f95095f7 Merge branch 'master' of github.com:hypre-space/hypre into deviceomp 2022-01-25 21:44:46 -08:00
Ruipeng Li
f2763da988 change the use of HYPRE_WARP_SIZE in spgemm 2022-01-25 17:29:53 -08:00
Ruipeng Li
543a1e552d remove hash type from runtime 2022-01-25 14:16:43 -08:00
Denis Barbier
dcbee14539
Add convenient CMake alias (#563)
Usually consumers of HYPRE call find_package(HYPRE) and depend on
HYPRE::HYPRE target.  But they could also want to use HYPRE via
add_directory (for instance via a git submodule)) or FetchContent,
in which case they have to depend on HYPRE target.

This alias makes this usage more consistent, all users could then
depend on HYPRE::HYPRE.  See for instance
  https://cmake.org/pipermail/cmake/2018-November/068629.html
2022-01-25 12:06:47 -08:00
Ruipeng Li
ffa79d350f add device shmem properties in device_data 2022-01-25 09:40:15 -08:00
Ruipeng Li
13ba004e75 minor change 2022-01-24 16:53:44 -08:00
Ruipeng Li
8d155b1fd9 binned alg3 2022-01-24 16:46:15 -08:00
Ruipeng Li
3ddf2dd836 minor change 2022-01-24 11:58:58 -08:00
Ruipeng Li
89e037fdac code reorg 2022-01-24 11:55:37 -08:00
Ruipeng Li
d8783345be removed alg 2 in the paper 2022-01-24 09:12:23 -08:00
Ruipeng Li
10e576bff0 completed binned algs. 2022-01-20 21:42:36 -08:00
Ruipeng Li
84fa589671
Redwood sh update (#561)
This PR adds a minor update in runtest.sh for redwood
2022-01-19 21:28:46 -08:00
Ruipeng Li
4c3ef2a0b4
Fortran gpu (#470)
This PR adds GPU examples for FORTRAN users, examples ex5f.f and ex12f.f.
2022-01-19 21:24:05 -08:00
Ruipeng Li
ce54070d76
fixed .saved (#560)
This PR updates files smoother.saved.lassen/ray that were not done in #534.
2022-01-19 16:30:15 -08:00
Ruipeng Li
6bd12e6445 optimization 2022-01-14 14:20:21 -08:00
Ruipeng Li
86710f73dc get number of blocks for bins 2022-01-13 09:01:50 -08:00
Ruipeng Li
514c72be69
add reading x0 from parcsr file back (#548)
This PR adds build_x0_type == 7 (read from parcsr file) back.
2022-01-12 08:57:14 -08:00
Ruipeng Li
bcccb117ef
ldg only for sm >= 35 (#516)
This PR fixes compile issues with CUDA sm_30. See #511
2022-01-12 08:55:58 -08:00