Rui-peng Li
c430559482
remove fast hash
2022-02-12 01:09:15 -06:00
Rui-peng Li
c347b3c9e1
remove some changes
2022-02-12 00:53:55 -06:00
Ruipeng Li
7672c101cd
try to optimize hash
2022-02-11 22:17:47 -08:00
Ruipeng Li
c85a060d85
naive row nnz bound with spmv
2022-02-11 17:18:32 -08:00
Ulrike Yang
e5a82e81e6
specified SYCL support
2022-02-10 17:12:32 -08:00
Rob Falgout
ccd135d8da
Updating CHANGELOG
2022-02-10 10:18:32 -08:00
Rob Falgout
666f457d2b
Bumping release number to 2.24.0
2022-02-10 07:05:43 -08:00
Rob Falgout
4ee737b53c
Initial CHANGELOG update for new release
2022-02-10 07:02:26 -08:00
Ruipeng Li
9d99e1fb77
put unroll as a macro
2022-02-10 00:14:18 -05:00
Ruipeng Li
efed76505c
rename printf0; change unroll to 1
2022-02-09 23:44:49 -05:00
Wayne Mitchell
f8d482efad
Add packing on device with oneDPL for par matvec
2022-02-10 01:51:56 +00:00
Ruipeng Li
6180f1261c
minor change
2022-02-09 20:36:53 -05:00
Ruipeng Li
bef2862710
silence compiler warnings
2022-02-09 13:27:18 -05:00
Ruipeng Li
ab72d05bd8
Deviceomp ( #519 )
...
This PR fixes the build with Kokkos + OMP offload, supports OMP offload without linking CUDA libraries, and supports OMP offload on Intel GPUs.
2022-02-09 06:40:57 -08:00
Rui-peng Li
e0c24fd1f6
turn off printings
2022-02-08 16:34:05 -06:00
Ruipeng Li
04037377ea
debug for HIP
2022-02-08 14:48:58 -06:00
Ruipeng Li
8ba048c0b5
Forced regeneration of softlinks in shared library builds. ( #574 )
...
This PR (copied from #573 ) added -f to softlink generation to all the makefiles.
2022-02-07 16:54:44 -08:00
Ruipeng Li
e40f8219a3
fix for last merged PR
2022-02-07 18:26:53 -06:00
Quan Bui
734a10fcb7
Mgr setup gpu ( #400 )
...
Enable GPU setup for MGR solver.
* Added device specific functionality for interpolation
* Made device and host calls to interpolation consistent
* Edited IJ driver to use GPU capable options for MGR
* Updated saved files for new GPU options
* Updated CMakeLists to support new MGR capabilities
Co-authored-by: Ruipeng Li <li50@llnl.gov>
Co-authored-by: Daniel Osei-Kuffuor <oseikuffuor1@llnl.gov>
2022-02-07 15:54:52 -08:00
Ruipeng Li
790e8e7826
fix cuda 11 build ( #569 )
...
This PR fixes CUDA 11 build after merging #549 , also adds regression tests (build only) with CUDA 11.
2022-02-02 08:40:15 -08:00
Wayne Mitchell
a7bb784a45
SYCL support for AMG solve phase ( #549 )
...
This adds matvec, matrix transpose, and vector operations (axpy, inner product, etc.)
with sycl backend (via oneMKL and oneDPL) for running on Intel GPUs. Thus, the AMG
solve phase can now execute entirely on Intel GPUs.
2022-01-31 16:15:30 -08:00
Victor A. Paludetto Magri
b159c7dd58
Fortran interfaces ( #566 )
...
This PR adds Fortran interfaces for hypre_MGR and hypre_ILU. Additionally:
* Add ArrayArray types in `fortran.h`
* Add MGR and ILU options to the fortran interfaces for Krylov solvers
2022-01-31 15:32:14 -08:00
Ruipeng Li
24165eba5a
remove hash type from ij.c
2022-01-30 23:49:00 -08:00
Ruipeng Li
8cde1ac442
remove some shared memory in symbl.h
2022-01-30 23:42:03 -08:00
Ruipeng Li
2fdccfaa60
changed default group settings
2022-01-30 17:32:57 -08:00
Ruipeng Li
a43daab879
add symb1/numer1; new binned symbl
2022-01-29 22:40:22 -08:00
Ruipeng Li
ebe3d2bdf3
renamed a marco
2022-01-28 15:50:39 -08:00
Ruipeng Li
38ebda17b3
minor change in a file
2022-01-28 16:22:26 -06:00
Ruipeng Li
40258a8f6b
add a new file
2022-01-28 16:09:19 -06:00
Ruipeng Li
6f597695af
fix for hip
2022-01-27 22:26:04 -06:00
li50@llnl.gov
9e115e94f5
Merge branch 'deviceomp' of github.com:hypre-space/hypre into spgemm
2022-01-25 22:37:26 -08:00
Ruipeng Li
c8abcff853
fixed compile without stream; enable stream with hip and sycl
2022-01-25 22:17:25 -08:00
li50@llnl.gov
a0f95095f7
Merge branch 'master' of github.com:hypre-space/hypre into deviceomp
2022-01-25 21:44:46 -08:00
Ruipeng Li
f2763da988
change the use of HYPRE_WARP_SIZE in spgemm
2022-01-25 17:29:53 -08:00
Ruipeng Li
543a1e552d
remove hash type from runtime
2022-01-25 14:16:43 -08:00
Denis Barbier
dcbee14539
Add convenient CMake alias ( #563 )
...
Usually consumers of HYPRE call find_package(HYPRE) and depend on
HYPRE::HYPRE target. But they could also want to use HYPRE via
add_directory (for instance via a git submodule)) or FetchContent,
in which case they have to depend on HYPRE target.
This alias makes this usage more consistent, all users could then
depend on HYPRE::HYPRE. See for instance
https://cmake.org/pipermail/cmake/2018-November/068629.html
2022-01-25 12:06:47 -08:00
Ruipeng Li
ffa79d350f
add device shmem properties in device_data
2022-01-25 09:40:15 -08:00
Ruipeng Li
13ba004e75
minor change
2022-01-24 16:53:44 -08:00
Ruipeng Li
8d155b1fd9
binned alg3
2022-01-24 16:46:15 -08:00
Ruipeng Li
3ddf2dd836
minor change
2022-01-24 11:58:58 -08:00
Ruipeng Li
89e037fdac
code reorg
2022-01-24 11:55:37 -08:00
Ruipeng Li
d8783345be
removed alg 2 in the paper
2022-01-24 09:12:23 -08:00
Ruipeng Li
10e576bff0
completed binned algs.
2022-01-20 21:42:36 -08:00
Ruipeng Li
84fa589671
Redwood sh update ( #561 )
...
This PR adds a minor update in runtest.sh for redwood
2022-01-19 21:28:46 -08:00
Ruipeng Li
4c3ef2a0b4
Fortran gpu ( #470 )
...
This PR adds GPU examples for FORTRAN users, examples ex5f.f and ex12f.f.
2022-01-19 21:24:05 -08:00
Ruipeng Li
ce54070d76
fixed .saved ( #560 )
...
This PR updates files smoother.saved.lassen/ray that were not done in #534 .
2022-01-19 16:30:15 -08:00
Ruipeng Li
6bd12e6445
optimization
2022-01-14 14:20:21 -08:00
Ruipeng Li
86710f73dc
get number of blocks for bins
2022-01-13 09:01:50 -08:00
Ruipeng Li
514c72be69
add reading x0 from parcsr file back ( #548 )
...
This PR adds build_x0_type == 7 (read from parcsr file) back.
2022-01-12 08:57:14 -08:00
Ruipeng Li
bcccb117ef
ldg only for sm >= 35 ( #516 )
...
This PR fixes compile issues with CUDA sm_30. See #511
2022-01-12 08:55:58 -08:00