Commit Graph

2212 Commits

Author SHA1 Message Date
Ruipeng Li
2fa29169c6 bug fix 2022-06-29 10:06:09 -07:00
Ruipeng Li
172787d7d9 delete files 2022-06-28 22:18:39 -07:00
Ruipeng Li
b03f350bf1 fix after merge 2022-06-28 22:15:45 -07:00
Ruipeng Li
4ed68414e5 Merge branch 'master' of github.com:hypre-space/hypre into interp_trunc 2022-06-28 22:10:57 -07:00
Ruipeng Li
b3573fb7a5 .saved 2022-06-28 21:09:23 -07:00
Wayne Mitchell
4411530e76
hypre_Item (#645)
Introduce hypre_DeviceItem to further unify cuda/hip/sycl implementation.
Unify some wrappers for thread/warp-level kernel routines.
2022-06-28 08:27:36 -07:00
Ruipeng Li
e270c561b0
Spgemm (#639)
This PR includes optimizations for hypre's SpGEMM and ParSpGEMM kernels

Co-authored-by: Wayne Mitchell <mitchell82@llnl.gov>
Co-authored-by: Paul T. Bauman <ptbauman@gmail.com>
Co-authored-by: Sarah Osborn <30503782+osborn9@users.noreply.github.com>
2022-06-24 10:42:16 -07:00
Victor A. Paludetto Magri
8268b9f1e1
hypre_ParCSRMatrixPrintIJ on device (#655)
hypre_ParCSRMatrixPrintIJ works for matrices living on the device w/o the need of UVM support. A explicit copy is to host memory is performed in this function prior to printing the files.
2022-06-22 20:49:57 -04:00
Victor A. Paludetto Magri
850fd47d07
Fix chebyshev smoother for singular problems (#657)
See PR's description for additional info
2022-06-22 20:47:09 -04:00
Ruipeng Li
b58585e0f0
add a func (#646)
This PR adds a function to perform local transposition of ParCSR.
2022-06-21 08:52:21 -07:00
Ruipeng Li
322e6a5e6e astyle 2022-06-17 10:14:00 -07:00
Ruipeng Li
18f85886ff remove debug code 2022-06-17 09:15:08 -07:00
Ruipeng Li
b9b93c45ef save debug code of Pass0 2022-06-17 09:13:58 -07:00
Ruipeng Li
8d54b78730 fix some nvtx region names 2022-06-15 23:18:00 -07:00
Ruipeng Li
3509640354 optimized interp_trunc 2022-06-15 22:55:35 -07:00
Rob Falgout
03e0150ee4 Change release number to 2.25.0 2022-06-13 17:17:54 -07:00
Ruipeng Li
6b34e985eb minor change 2022-06-13 16:20:03 -07:00
Ruipeng Li
dfa39f680b filtering out srun warnings 2022-06-13 11:11:08 -07:00
Ruipeng Li
5dbe0a2a10 saved.tioga 2022-06-12 23:46:01 -07:00
Ruipeng Li
a5b10499cb tune for AMD GPUs 2022-06-12 11:34:22 -07:00
Ruipeng Li
ee88d0b1a4 header 2022-06-11 11:49:56 -07:00
Ruipeng Li
c76cd1fd04 bug fix 2022-06-10 15:08:56 -07:00
Ruipeng Li
84baffc1da add license 2022-06-10 15:08:41 -07:00
Ruipeng Li
cb5814c4f4 minor change 2022-06-10 15:07:57 -07:00
Ruipeng Li
6ce70c135e bug fix 2022-06-10 14:47:46 -07:00
Ruipeng Li
7a7a63f802 Merge branch 'spgemm' of github.com:hypre-space/hypre into spgemm 2022-06-10 12:50:32 -07:00
Ruipeng Li
834bac5d8f update bench perf. on lassen 2022-06-10 12:50:24 -07:00
Ruipeng Li
637af55397 Merge branch 'spgemm' of github.com:hypre-space/hypre into spgemm 2022-06-10 12:14:35 -07:00
Ruipeng Li
c9b9a36354 update bench perf. on ray 2022-06-10 12:14:20 -07:00
Wayne Mitchell
3a735f9ccd Merge branch 'spgemm' of github.com:hypre-space/hypre into spgemm 2022-06-10 19:07:39 +00:00
Wayne Mitchell
951fa56baa Fix copy paste error 2022-06-10 19:06:34 +00:00
Wayne Mitchell
7dc8321f4a Additional fixes for sycl build. Note that CSR matrix memory location must be set correctly before calling hypre_CSRMatrixSetRownnz. 2022-06-10 19:02:35 +00:00
Ruipeng Li
eb77ea2da2 Merge branch 'spgemm' of github.com:hypre-space/hypre into spgemm 2022-06-10 11:05:48 -07:00
Ruipeng Li
2782ac5079 lassen saved files 2022-06-10 11:04:56 -07:00
Ruipeng Li
d34a1fac60 update saved ray 2022-06-10 09:35:15 -07:00
Ruipeng Li
68934f9604 bug fix? 2022-06-09 15:35:23 -07:00
Ruipeng Li
4224e7227a change the default (non-binned) back 2022-06-09 15:34:24 -07:00
Ruipeng Li
78eb2ed9b3 minor changes 2022-06-09 15:16:03 -07:00
Ruipeng Li
f7e652ca63 add more printf 2022-06-09 15:14:53 -07:00
Wayne Mitchell
3cc0138a9b Merge changes from Ruipeng and finish fixing up sycl. Seems to work now. 2022-06-09 20:34:00 +00:00
Wayne Mitchell
8730346c03 More debugging code and astyle. About to sync up with Ruipeng. 2022-06-09 19:39:03 +00:00
Ruipeng Li
67a2588888 Cmake and make files 2022-06-09 08:51:49 -07:00
Ruipeng Li
5b35ee3a41 bug fix 2022-06-09 00:31:47 -07:00
Ruipeng Li
26d53aacce bug fix 2022-06-09 00:11:31 -07:00
Ruipeng Li
f6edbf7631 Merge branch 'spgemm' of github.com:hypre-space/hypre into spgemm 2022-06-08 17:21:42 -07:00
Ruipeng Li
2eb2cc7143 saved ray 2022-06-08 17:21:24 -07:00
Ruipeng Li
d461c7184a update lassen saved 2022-06-08 17:19:42 -07:00
Ruipeng Li
db4fd264db bug fix 2022-06-08 15:33:47 -07:00
Ruipeng Li
9036263552 minor changes 2022-06-08 13:30:38 -07:00
Ruipeng Li
ae31f37135 change binned to be the default 2022-06-08 12:40:43 -07:00
Ruipeng Li
1112186d12 add some comments 2022-06-08 11:42:46 -07:00
Wayne Mitchell
d80c0f3550 Merge branch 'spgemm' of github.com:hypre-space/hypre into spgemm 2022-06-08 18:32:56 +00:00
Wayne Mitchell
d8c6556e7e Fixes for sycl. Still debugging incorrect results. 2022-06-08 18:32:49 +00:00
Ruipeng Li
430241d7d0 fix matrix num cols 2022-06-08 11:29:01 -07:00
Ruipeng Li
0eab14592b code style 2022-06-08 10:53:39 -07:00
Ruipeng Li
a436b670db update saved.ray 2022-06-08 10:30:45 -07:00
Ruipeng Li
f867d600b2 remove commented code 2022-06-07 15:28:33 -07:00
Ruipeng Li
67c545fce0 bug fix matvecT 2022-06-07 15:28:11 -07:00
Ruipeng Li
3f39f5d4fa update printMM 2022-06-07 15:26:40 -07:00
Ruipeng Li
8b02ab88d3 GPU support for rownnz using IntArray 2022-06-07 12:08:43 -07:00
Ruipeng Li
c09ab567e7 remove a function 2022-06-07 10:13:11 -07:00
Ruipeng Li
31d4f891bf turn off timing 2022-06-06 21:46:46 -07:00
Ruipeng Li
f4917850e7 Merge branch 'master' of github.com:hypre-space/hypre into spgemm 2022-06-06 21:45:49 -07:00
Ruipeng Li
57f0ddd75c removed old impl 2022-06-06 21:40:36 -07:00
Ruipeng Li
c12449c44c sycl impl. 2022-06-06 16:42:26 -07:00
Ruipeng Li
0fee4f3c80 astyle 2022-06-06 13:23:32 -07:00
Ruipeng Li
4efa15836d support P100 2022-06-06 13:17:19 -07:00
Victor A. Paludetto Magri
07a8def6f8
Fix compilation warnings (#643)
This PR fixes compilation warnings obtained with gcc-11, clang-12, and clang-14. A list of the warnings is given below:

* -Wundef
* -Wunused
* -Wdouble-promotion
* -Wsometimes-uninitialized
* -Wunused-variable
* -Wunused-but-set-variable
2022-05-31 20:32:49 -04:00
Ruipeng Li
0aa6d52f92 style change 2022-05-27 10:56:32 -07:00
Ruipeng Li
8070877390 code style 2022-05-27 10:21:07 -07:00
Ruipeng Li
8c58483b56 Merge branch 'master' of github.com:hypre-space/hypre into spgemm 2022-05-27 09:44:49 -07:00
Ruipeng Li
270bbe6e01 astyle 2022-05-27 09:42:35 -07:00
Ruipeng Li
96991d3d30 name changes 2022-05-26 20:03:07 -07:00
Ruipeng Li
875180594d change some var names 2022-05-26 19:54:32 -07:00
Victor A. Paludetto Magri
edb91b4a50
Add -auxfromfile option to IJ driver (#633)
Add -auxfromfile option for reading an auxiliary matrix from file, which is then used to build the preconditioner. This is useful, for example, for the case when a filtered version of A is used to build the preconditioner.
2022-05-26 21:23:31 -04:00
Ruipeng Li
17bc61ed48 Merge branch 'spgemm' of github.com:hypre-space/hypre into spgemm 2022-05-26 16:56:43 -07:00
Ruipeng Li
9cd6eb1ff8 bug fix 2022-05-26 16:56:32 -07:00
Ruipeng Li
4ac9d00577
Parspgemm (#641)
Merge two spgemm branches into one
2022-05-26 14:38:15 -07:00
Ruipeng Li
09eb54520c fix after merge 2022-05-26 10:07:11 -07:00
Ruipeng Li
4e78801d2a Merge branch 'master' of github.com:hypre-space/hypre into spgemm 2022-05-25 16:36:44 -07:00
Ruipeng Li
e766e36e76
Add header to remove header-transitivity issue (#636)
Add header for `thrust::remove_if.`
Co-authored-by: Paul T. Bauman <ptbauman@gmail.com>
2022-05-24 14:18:57 -07:00
Ruipeng Li
1c1bf95b10
fix with umpire 2022 (#625)
This PR fixes hypre with umpire 2022
2022-05-24 13:55:08 -07:00
Ruipeng Li
a8d423013b
missing -f in softlink commands (#594)
This PR adds in a few of the Mafefiles the missing -f in the softlink commands.

Co-authored-by: Paul Mullowney <Paul.Mullowney@nrel.gov>
2022-05-24 13:39:33 -07:00
Paul T. Bauman
565bbe0511
Need to read start/end indices as HYPRE_BigInt (#605)
This fixes an issue with the ParCSRMatrixRead when compiled with mixed-int enabled.
2022-05-24 13:32:56 -07:00
Ruipeng Li
aa0446d720
ij help (#634)
Minor fix to `ij -help`
2022-05-24 13:30:23 -07:00
Ruipeng Li
ef3f890d4b
Nvcollab (#591)
This PR contains various GPU optimizations in the collaboration with the NVIDIA team. 

Co-authored-by: Peng Wang <penwang@nvidia.com>
2022-05-24 13:27:32 -07:00
Ruipeng Li
bec8645cf9
script option for runtest.sh (#632)
This PR changes runtest.sh to run an executable with a "script"  and allows valgrind and mpibind on all platforms.
2022-05-19 10:10:03 -07:00
Daniel Osei-Kuffuor
63208e3e34
Hotfix for issues with dsuperlu in regression test. (#631)
Commented out unnecessary memory deallocation check.
2022-05-15 21:42:28 -07:00
Wayne Mitchell
dfdd1cd12f
Sycl par matmat (#611)
Further unification of GPU implementation across cuda/hip/sycl.
Implements the parallel matrix matrix product in sycl.
HYPRE_CUDA_LAUNCH and HYPRE_SYCL_LAUNCH macros have 
been unified under HYPRE_GPU_LAUNCH for kernel launches.
Replace HYPRE_SetSpGemmUseCusparse with HYPRE_SetSpGemmUseVendor.
2022-05-09 15:24:44 -07:00
Daniel Osei-Kuffuor
00d1dfd3f7
Mgr block jacobi (#607)
* Added new capabilities to allow multilevel assignment of solver options
* New (local) block Jacobi option for smoothers and intergrid operators
* Added capabilities to do CPR in MGR
* Updated non-Galerkin strategy for constructing the coarse grid.

Co-authored-by: Quan Bui <mquan.bui@gmail.com>
2022-05-09 08:30:05 -07:00
Ruipeng Li
117963fb19 fixed a debugging check 2022-04-13 17:01:07 -07:00
Ruipeng Li
3819f474f0 A: row nnz = 1, numer 2022-04-13 16:01:42 -07:00
Ruipeng Li
6450927874 A: row nnz = 1, numer (1st version) 2022-04-12 11:18:42 -07:00
Ruipeng Li
df10f0b5ed A: row nnz = 1, symbl 2022-04-12 11:17:56 -07:00
Ruipeng Li
4025b32ad8 update driver 2022-04-07 13:11:32 -07:00
Victor A. Paludetto Magri
8017ce459b
Fix segfault on HYPRE_SStructGraphDestroy (#617)
This PR fixes a segmentation fault on HYPRE_SStructGraphDestroy. The error occurred when the number of graph entries added to the SStructGraph via HYPRE_SStructGraphAddEntries was larger than 1000.
2022-04-06 21:12:41 -07:00
Ruipeng Li
4421ebd83a Parprint; function name change of sync stream 2022-04-06 18:13:49 -07:00
Ruipeng Li
096ce6c1dd driver changes 2022-04-06 17:39:36 -07:00
Ruipeng Li
e21778682b minor changes 2022-04-06 17:39:23 -07:00
Victor A. Paludetto Magri
70d055a994
Fix complex build (#616)
This PR fixes compilation of the "complex" build variant of hypre. It also adds hypre_csqrt for computing the square root of an HYPRE_Complex number. This function/macro works when enable-complex is turned on/off.
2022-04-06 15:02:04 -07:00
Ruipeng Li
224bb78d4f Merge branch 'spgemm' of github.com:hypre-space/hypre into parspgemm 2022-04-05 23:35:19 -07:00
Ruipeng Li
046b278c66 bug fix 2022-04-05 22:26:29 -07:00
Ruipeng Li
db8fbbf119 minor change in timing 2022-04-05 22:26:11 -07:00
Rob Falgout
4c5529810a Updating one missed copyright date in user manual 2022-04-05 16:40:02 -07:00
Victor A. Paludetto Magri
e16167fe46
Fix copyright (#615)
This PR updates Copyright headers from "Copyright 1998-2019 ..." to "Copyright (c) 1998 ..."
2022-04-05 16:19:51 -07:00
Victor A. Paludetto Magri
9415d6aa08
FSAI implementation on CPUs (#610)
Thir PR adds a factorized sparse approximate inverse (FSAI) implementation on hypre, which can be used as a standalone solver, preconditioner to Krylov methods, or complex smoother to BoomerAMG. Particularly, we consider the adaptive algorithm version, where the sparsity pattern of the lower triangular factor G is built dynamically, i.e., during an iterative procedure that tries to find the best nonzero positions for a given row of G. This implementation was performed on top of the IJ interface. It uses the diagonal portion of A for constructing G, i.e., it's a block-Jacobi method in the MPI sense. List of additional changes:

* Add caliper instrumentation to FSAI.
* Add ZeroGuess option to FSAI.
* Performance optimizations.
* Add OpenMP support to FSAI.
* Make internal BLAS/LAPACK functions thread-safe. 
* Update CMake build.
* Add new test cases: beam_tet_dof459_np1, beam_hex_dof459_np2, and beam_tet_dof2475_np4.
* Add documentation for FSAI.

Co-authored-by: Heather Switzer <switzer4@lassen36.coral.llnl.gov>
Co-authored-by: heatherms27 <hmswitzer@email.wm.edu>
Co-authored-by: Sarah Osborn <30503782+osborn9@users.noreply.github.com>
2022-04-05 11:18:39 -07:00
Ruipeng Li
7b8377d3b1 new impl. of partial add 2022-04-05 10:19:44 -07:00
Ruipeng Li
e9c019e40f parcsr matvec T 2022-04-02 14:19:45 -07:00
Ruipeng Li
87b0b6669a update hypre's spmv 2022-04-02 14:16:08 -07:00
Ruipeng Li
d1774a1ded Merge branch 'sycl_par_matmat' of github.com:hypre-space/hypre into parspgemm 2022-04-01 12:45:42 -07:00
Ruipeng Li
d4b5c26ef9 new impl. of partial add 2022-04-01 12:21:19 -07:00
Ruipeng Li
f5950dee92 whitespace changes 2022-04-01 12:20:33 -07:00
Ruipeng Li
dcead56dd2 header 2022-03-31 19:03:42 -07:00
Ruipeng Li
aea2e7303f configure 2022-03-31 19:01:36 -07:00
Ruipeng Li
8ea39950b1 Merge branch 'nvcollab' of github.com:hypre-space/hypre into spgemm 2022-03-31 18:58:36 -07:00
Ruipeng Li
66b624de9e update ij_mm 2022-03-31 18:23:03 -07:00
Wayne Mitchell
bc30046d08 Removing guards for zero size onedpl calls where not needed 2022-04-01 00:50:22 +00:00
Ruipeng Li
1c35a59c1d astyle 2022-03-31 17:48:19 -07:00
Ruipeng Li
6086b07a96 minor change 2022-03-31 17:38:35 -07:00
Ruipeng Li
6e8607fd47 reorg parcsrmm 2022-03-31 17:38:19 -07:00
Wayne Mitchell
996dfaecd5 Revert changes in solvers.jobs 2022-03-31 22:01:35 +00:00
Ruipeng Li
8e34f6400f
fix domp build (#613) 2022-03-31 14:25:57 -07:00
Ruipeng Li
23c7777045 Merge branch 'sycl_par_matmat_domp' of github.com:hypre-space/hypre into parspgemm 2022-03-30 09:19:52 -07:00
ulrikeyang
303457abae
fixed MM-multipass interpolation for case of no C-points (#606)
* fixed MM-multipass interpolation for case of no C-points

* fixed the issue of isolated groups of fine points and added a regression test.

* corresponding changes to the device code

Co-authored-by: Ruipeng Li <li50@llnl.gov>
2022-03-29 15:14:29 -07:00
Ruipeng Li
e5393bc0eb fix domp build 2022-03-28 22:55:18 -07:00
Ruipeng Li
caec03c7ad Merge branch 'sycl_par_matmat' of github.com:hypre-space/hypre into parspgemm 2022-03-25 18:59:19 -07:00
Ruipeng Li
23b8527c1d fix after merge 2022-03-25 18:53:02 -07:00
Ruipeng Li
68be11b70f fix after merge; change CUDAKernel to GPUKernel 2022-03-25 18:52:40 -07:00
Wayne Mitchell
ce30e2faa4 Backward compatibility for UseCusparse 2022-03-26 01:09:17 +00:00
Ruipeng Li
ac65ca58fe changes to mm 2022-03-25 15:53:16 -07:00
Wayne Mitchell
5f7608b875 Change UseCusparse to UseVendor 2022-03-25 22:07:05 +00:00
Wayne Mitchell
112691e2d1 Remove debugging code 2022-03-25 21:07:58 +00:00
Wayne Mitchell
b860e28c72 Clean up 2022-03-25 20:57:41 +00:00
Wayne Mitchell
bb2cb43232 Merge branch 'master' into sycl_par_matmat 2022-03-25 20:27:17 +00:00
Wayne Mitchell
b5de7f7dfd MGR bug fix and no longer throw error for zero len kernel launches for sycl 2022-03-25 19:41:45 +00:00
Wayne Mitchell
6b5a13cedf Bug fix! Still need to clean up some further debugging code. 2022-03-25 00:56:39 +00:00
Wayne Mitchell
511c0ec77e Remove some debugging code. Temporary changes to solvers.jobs 2022-03-23 18:49:46 +00:00
Ruipeng Li
5fe37b2286
hypre_ParPrintf (#604)
This PR adds hypre_ParPrintf. Prints to standard out, only from the first processor in the communicator. Calls from other processes are ignored.
2022-03-21 09:10:01 -07:00
Wayne Mitchell
f2fa2e9577 Lots of ugly debugging code in here, but I have also fixed a couple esoteric things. Saving with debugging code in just in case I need to go back and use it. 2022-03-18 23:49:23 +00:00
Ruipeng Li
5513fd5115 Merge branch 'ParPrintf' of github.com:hypre-space/hypre into parspgemm 2022-03-18 15:20:52 -07:00
Ruipeng Li
7300f4b937 Merge branch 'master' of github.com:hypre-space/hypre into parspgemm 2022-03-18 15:17:45 -07:00
Ruipeng Li
04239e769a remove hypre_vprintf 2022-03-18 13:32:22 -07:00
Ruipeng Li
96985f04e2 ParPrintf; hypre_vprintf 2022-03-18 12:10:47 -07:00
Ruipeng Li
adfd07c509
Fix build on FreeBSD/powerpc*. (#603)
lr collides with lr from machine/frame.h header (link register):
Co-authored-by: Piotr Kubaj <pkubaj@FreeBSD.org>
2022-03-18 10:16:20 -07:00
Rob Falgout
fa43ea82e3
Bug fix in prefix sum for OpenMP IJ interface (#602)
This fixes a bug found in issue #522 for the prefix sum openmp code in IJ.
2022-03-16 10:56:09 -07:00
Ruipeng Li
e5f6655ba0 initial support for pattern only matrices (spgemm only) 2022-03-16 09:32:12 -07:00
Ruipeng Li
d079a90fb8 Merge branch 'master' of github.com:hypre-space/hypre into nvcollab 2022-03-15 08:32:25 -07:00
Ruipeng Li
92faac9748
fix memory location (#600)
This PR fixes a number of memory location issues in memory copy and memset. It also adds more strict checking in memory.c in the debug mode.
2022-03-14 11:19:28 -07:00
Victor A. Paludetto Magri
6fd043c9c2
(S)Struct IO on GPUs (#599)
This PR extends the (semi)-struct matrix/vector IO functions added on #583 with GPU support. Additionally:

* Fix regression tests on Lassen.
* Read data values into host memory
* Update Umatrix read algorithm when the ParCSRMatrix is expected to live on the device
* Reset deallocated pointers at hypre_IJMatrixDestroyParCSR to NULL
* Clone rownnz info if present on a CSRMatrix
* Reduce memory transfer and remove unused variables
* Fix bug with -print option
* Build rownnz info also when the ParCSRMatrix is in assembled state
* Remove a few instances of "return ierr"
* Refactor (s)struct IO - code works with cuda and without UM
* Add executables to gitignore
2022-03-13 20:14:23 -07:00
Ruipeng Li
c9a6c67229 Merge branch 'master' of github.com:hypre-space/hypre into nvcollab 2022-03-11 18:34:45 -08:00
Ruipeng Li
f7787ab0ae
fixes coarsening.jobs.14 (#598)
a temporary "fix"
2022-03-11 18:25:03 -08:00
Ruipeng Li
793b22aaf0 Merge branch 'nvcollab' of github.com:hypre-space/hypre into nvcollab 2022-03-11 08:34:20 -08:00
Ruipeng Li
97f3254d71 Merge branch 'master' of github.com:hypre-space/hypre into nvcollab 2022-03-11 08:33:31 -08:00
Ruipeng Li
8c344aee9a
Invalid assumption on exclusive_scan (#575)
This PR fixes a number of initialization problems with exclusive_scan on GPUs due to invalid assumptions of this function.
2022-03-11 08:32:26 -08:00
Rui-peng Li
700b0328bf bug fix 2022-03-10 22:19:22 -06:00
Ruipeng Li
9498625db4 Merge branch 'nvcollab' of github.com:hypre-space/hypre into nvcollab 2022-03-10 18:31:13 -08:00
Ruipeng Li
50c88ab95d minor changes 2022-03-10 18:30:52 -08:00
Ruipeng Li
c143265c61 regression on ray 2022-03-10 17:41:49 -08:00
Ruipeng Li
14a336c28b accidentally deleted io.sh 2022-03-10 13:30:31 -08:00
Ruipeng Li
2d06b53c4e benchmark ij on lassen 2022-03-10 13:26:15 -08:00
Ruipeng Li
815f2d57e0 add cublas/cusparse precision macros 2022-03-10 12:12:32 -08:00
Ruipeng Li
90cbe64fee saved.lassen 2022-03-10 08:45:27 -08:00
Ruipeng Li
7681f7f180 add cublas to makefile 2022-03-10 08:43:44 -08:00
Ruipeng Li
8ecee0b47d update hypre's spmv 2022-03-09 22:36:16 -08:00
Ruipeng Li
31ca2338d2 saved.lassen 2022-03-09 16:22:22 -08:00
Ruipeng Li
009501d51c bug fix 2022-03-09 16:14:56 -08:00
Ruipeng Li
86dae0be5b a minor change 2022-03-09 14:29:22 -08:00
Ruipeng Li
1dc1261fe8 fix cpu regression 2022-03-09 14:26:40 -08:00
Ruipeng Li
f8fd57ab2a updated saved perf on ray 2022-03-09 13:54:00 -08:00
Ruipeng Li
9b8627ce84 update lassen banchmark saved results 2022-03-09 13:09:36 -08:00
Ruipeng Li
7a8cf68b9a add -repeats 2 for struct benchmark jobs 2022-03-09 13:09:07 -08:00
Ruipeng Li
9dda5af3c4 struct.c driver for reps == 2 2022-03-09 12:31:46 -08:00
Ruipeng Li
63c9fa65a2 add using hypre's spmv option 2022-03-08 22:11:31 -08:00
Wayne Mitchell
5e90f43cbc astyle 2022-03-08 23:51:41 +00:00
Wayne Mitchell
34c16787b7 Addition of MatMat and TMatMat routines and clean up 2022-03-08 23:50:48 +00:00
Wayne Mitchell
297ff5d5a7 Par matmat is verified correcct for a small example 2022-03-08 21:08:47 +00:00
Ruipeng Li
7443a2ac6c missed some sync in the last commit 2022-03-07 23:54:03 -08:00
Ruipeng Li
e1b9a56405 add gpu sync for mpi 2022-03-07 23:34:33 -08:00
Ruipeng Li
8ee20f4812 cudamallocasync 2022-03-07 16:54:56 -08:00
Ruipeng Li
df0f6dbba7 configure options: cublas; cudamallocasync 2022-03-07 16:40:32 -08:00
Ruipeng Li
d7728d0bce updated ij driver for 2nd solve 2022-03-07 15:16:00 -08:00
Ruipeng Li
b97fbc13ed sync device at ending timing 2022-03-07 15:13:08 -08:00
Ruipeng Li
c2e4836c1e bug fix 2022-03-05 10:36:53 -08:00
Ruipeng Li
a51bb880a8 bug fix 2022-03-05 09:46:16 -08:00
Ruipeng Li
03546b428f Merge branch 'master' of github.com:hypre-space/hypre into nvcollab 2022-03-04 22:18:02 -08:00
Paul T. Bauman
251cd3d269
Need -O1 instead of -O0 for HIP in debug mode (#588)
This PR changes -O0 in debug mode to -O1 with HIP (at this time).
2022-03-04 12:40:35 -08:00
Ruipeng Li
95e6433fc7
GPU support with single precision (#572)
This PR fixes the GPU support with single precision.
2022-03-04 12:05:32 -08:00
Ruipeng Li
ebd6eb88c3
bug fix; nonsquare rap (#581)
This PR fixes a corner case of the RAP routine for RAP matrix that is globally square but not locally.
2022-03-03 21:26:17 -08:00
Wayne Mitchell
d388a2766e Lots of reorganization. This now has all functionality for par matmat and compiles, but needs debugging. 2022-03-01 01:59:27 +00:00
Wayne Mitchell
8112dd736f Further cleanup and reorg of device_utils.c/h and addition of more functionality needed for par matmat 2022-02-19 00:40:27 +00:00
Paul T. Bauman
04af9a4cd9
HYPRE_Int -> HYPRE_BigInt (#585) 2022-02-18 12:16:35 -08:00
Wayne Mitchell
cda5b10a69 Single processor device rap works 2022-02-18 18:58:28 +00:00
Wayne Mitchell
47ae1c8a22 Start major reorganization of device_utils.h 2022-02-18 17:44:22 +00:00
Golam Rabbani
94070dd3a9
Updated CMakeLists.txt for SYCL (#577)
With CMake, enable CUDA stream by default when using SYCL.
2022-02-17 18:21:51 -08:00
Victor A. Paludetto Magri
33a5051398
Add SStruct IO functions (#583)
This PR adds support for native print/read functions of SStructMatrix and SStructVector. Other important changes are:
* Add public functions for reading StructMatrix and StructVector.
* Add a new set of regression tests called "io" to the TEST_sstruct folder.
2022-02-17 18:06:23 -08:00
Wayne Mitchell
1e289479f1 astyle 2022-02-17 00:56:49 +00:00
Ruipeng Li
9888903445 memory tracker 2022-02-16 15:04:20 -08:00
Ruipeng Li
c336122cc1 move to debug region 2022-02-16 14:19:15 -08:00
Victor A. Paludetto Magri
49dbf7b60a
Fix cross-compilation problem (#580)
This PR fixes issue #556.

AC_CHECK_FILE was being used to test the existence of the .git folder. However, according to Autoconf manual, it does not work when cross-compiling. This PR implements another strategy for looking for the .git folder which works also when doing cross-compilation.
2022-02-16 07:55:02 -08:00
Wayne Mitchell
4136c63269 Switch to HYPRE_GPU_LAUNCH 2022-02-15 20:01:08 +00:00