hypre

CFD/hypre

Author	SHA1	Message	Date
ulrikeyang	0a0ddd0723	Fix s (#312 ) * Rewrote strength matrix generation for nodal > 0, since it was faulty. This also also fixed a bug that occurred when using hopscotch. * Added new option HYPRE_BoomerAMGSetKeepSameSign, which keeps values in S if they are of the same sign as the diagonal for nodal > 0. This option does not affect nodal = 0. * fixed a bug accidentally introduced in standard interpolation * removed option to set SCommPkgSwitch, which allowed to reduce communication for strength matrix, but was not working properly. This included removing occurrences of col_offd_S_to_A etc * included capability to deal with systems of PDEs in MM interpolation operators, except for 2s-MM-ext * updated saved files * added regression tests and modified elasticity regression tests	2021-04-12 12:20:28 -07:00
Ruipeng Li	25d0498a3e	Roctx (#319 ) This PR (by @pbauman #317) adds rocTX support, the ROCm analog to NVTX. Co-authored-by: Paul T. Bauman <ptbauman@gmail.com>	2021-04-12 08:36:09 -07:00
Ruipeng Li	b3a4a76a5f	Roc sparse (#316 ) This PR (by @pbauman #304) adds the first pass of rocSPARSE support. Co-authored-by: Paul T. Bauman <ptbauman@gmail.com>	2021-03-25 20:11:53 -07:00
Ruipeng Li	366b80f89b	2 stage gs (#314 ) This PR (by @PaulMullowney #282) includes updates/changes to the 2 stage GS preconditioner. Co-authored-by: Paul Mullowney <pmullown@nrel.gov>	2021-03-24 15:14:00 -07:00
Ruipeng Li	8002200aa0	Revert "2 stage gs (#308 )" (#313 ) This reverts commit `f7fb2853f2`.	2021-03-24 09:37:07 -07:00
Ruipeng Li	f7fb2853f2	2 stage gs (#308 ) This PR (by @PaulMullowney #282) includes updates/changes to the 2 stage GS preconditioner. Co-authored-by: Paul Mullowney <pmullown@nrel.gov>	2021-03-24 09:36:01 -07:00
Sarah Virginia Osborn	fcd2b2b4bb	Add missing utilities source file to CMakeList	2021-03-18 08:38:12 -07:00
Ruipeng Li	ce2a438215	removing device bindings in HYPRE_Init (#296 ) This PR removes GPU device binding in HYPRE_Init() which is now moved into test drivers. It also fixes the issue with Umpire when using non-default device (device id \neq 0)	2021-03-17 17:22:17 -07:00
Ruipeng Li	8223465ead	More hip support (#302 ) This PR (by @pbauman #301) add more Hip support that was missed in #297. Co-authored-by: Paul T. Bauman <ptbauman@gmail.com>	2021-03-12 20:30:18 -08:00
Ruipeng Li	67a0bdeab9	Hip support (#298 ) This PR (by @pbauman #297) is a primary step in adding HIP support. Co-authored-by: Paul T. Bauman <ptbauman@gmail.com>	2021-03-10 09:17:15 -08:00
Ruipeng Li	4630047ad3	HYPRE_USING_GPU (#295 ) This PR (by @pbauman #292 ) takes instances of #if defined(HYPRE_USING_CUDA) \|\| defined(HYPRE_USING_DEVICE_OPENMP) and changes them to #if defined(HYPRE_USING_GPU). Co-authored-by: Paul T. Bauman <ptbauman@gmail.com>	2021-03-05 16:35:03 -08:00
Sarah Virginia Osborn	9d45b1934a	Remove unnecessary including of superlu_ddefs in ij test	2021-03-05 09:05:19 -08:00
Ruipeng Li	2e97ffcc9e	bug fix recvbufsize (#294 ) This PR fixes a problem of communication between GPUs for structured solvers.	2021-03-04 11:47:42 -08:00
Rob Falgout	c9848a5ed8	Fixed a small mistake in configure/configure.in	2021-03-04 05:51:34 -08:00
Ruipeng Li	b5070c14e2	configure fix for last merged PR	2021-03-04 01:02:38 -08:00
Ruipeng Li	8c00f18a95	Hip build (#293 ) This PR adds options to the build system for enabling ROCm and HIP support. Co-authored-by: Paul T. Bauman <ptbauman@gmail.com>	2021-03-02 10:02:12 -08:00
Rob Falgout	7859aeee34	Changed the rhs in several TEST_sstruct/periodic runs (#291 ) The previous -rhsone option on tests 80-95 resulted in CG solving the problem exactly in 4 or 5 iterations. This required the code to be bitwise accurate to pass the tests, which is not reasonable here. The new cosine-based rhs produces more typical convergence for CG on a Laplacian problem.	2021-03-01 06:13:53 -08:00
Rob Falgout	5122196348	Adding filter to check-license test to ignore runtests-* files	2021-02-18 07:00:23 -08:00
Ruipeng Li	68f510c11b	Test jobs for enable-mixedint (#280 ) This PR separates the jobs that break with --enable-mixedint (CGC, ParaSail, Euclid) from regular job scripts, so regression tests can selectively run jobs with --enable-mixedint. The new runtests<-option> files contain a list of tests for the runtest.sh script that can be run by passing <-option> to the 'run.sh' autotest script. This should enable more flexibility for building regression test suites. We should also revisit the notes pointed out by @rfalgout at some point. Co-authored-by: Rob Falgout <rfalgout@llnl.gov>	2021-02-17 20:24:58 -08:00
Rob Falgout	277b357858	Removing 'memory.h' include since it is not ISO standard (#279 ) The 'memory.h' header file is not standard ISO C, but it is standard C++. Apparently, some C compilers may accidentally find a C++ version of this file and cause compile problems. See issue #274 .	2021-02-16 08:20:02 -08:00
Ruipeng Li	c4ca3e1050	fixed wrong pointer checking; error handler in debug mode (#275 ) This PR fixed a pointer comparison error in GPU matvec y=Ax, (if both pointers of x and y are NULL, should not throw errors).	2021-02-10 23:11:32 -08:00
Rob Falgout	33da39f68a	Fixed a typo in the sstruct.in.README file comment	2021-02-10 17:55:01 -08:00
Ruipeng Li	51e5a4c6de	SStruct interface without UVM (#170 ) In this PR, we removed the dependency of UVM when building with CUDA for the SStruct solvers, and added a "memory tracker" to help debugging memory leak or misuse.	2021-02-09 11:21:39 -08:00
Ruipeng Li	3438132e1a	GPU examples (#268 ) This PR adds GPU-support to hypre's examples. A new `Makefile_gpu` is provided in `src/examples` as an sample makefile, so one can compile the examples with make -f Makefile_gpu if hypre has been built with GPUs.	2021-02-09 11:19:05 -08:00
Rob Falgout	6eb66f8695	Fixed a small mistake in the sludist.sh test	2021-02-09 06:44:39 -08:00
Rob Falgout	be18e595ae	Remove the global partition code from hypre (#273 ) This PR removes the global partition code from hypre.	2021-02-08 15:16:29 -08:00
Rob Falgout	6f9260b67c	Add saved-file extension to runtest (#271 ) This pull request adds a -save <ext> feature to the runtest.sh script to allow testing against different saved files on different platforms such as GPU machines. See Issue #255. A few additional things were done: - All of the checks against the saved files were moved out of the individual tests and into runtest.sh. - The output-file sanity checks that are in many of the tests were modified so they no longer depend on the saved files. Several issues were also uncovered and fixed.	2021-02-08 15:11:45 -08:00
Ramesh Pankajakshan	414fa671be	Umpire (#243 ) This PR contains the support of UMPIRE pooling allocators for host and GPU memory. Configure hypre with --with-umpire, device and uvm allocations and deallocations are done with umpire, whereas host pool is not enabled by default. This PR also includes some other minor changes: Adding .gitignore to the repo Removing all malloc/calloc/realloc/free and regression testing on finding them No longer compile ij.c with C++ compiler. It goes back to a C code now. Introducing HYPRE_USING_GPU, which is equivalent to HYPRE_USING_CUDA \|\| HYPRE_USING_DEVICE_OPENMP Adding a few user-level interfaces: HYPRE_SetMemoryLocation, HYPRE_SetExecutionPolicy, HYPRE_SetGPUMemoryPoolSize and HYPRE_CSRMatrixSetSpGemmUseCusparse Co-authored-by: li50@llnl.gov <liruipengblue@gmail.com> Co-authored-by: Rob Falgout <rfalgout@llnl.gov> Co-authored-by: Ruipeng Li <li50@llnl.gov>	2021-02-03 12:31:25 -08:00
Ruipeng Li	8462f60dc7	Hypre warp bitshift (#267 ) This PR adds HYPRE_WARP_BITSHIFT macro, which will allow us to hide instances of '>> 5' for forthcoming HIP changes that will need a bit shift of 6 rather than 5. This PR was copied from #265 by @pbauman Co-authored-by: Paul T. Bauman <ptbauman@gmail.com>	2021-01-27 11:22:09 -08:00
Ruipeng Li	2186a8fb34	triangular solve on GPUs; runcheck (#256 ) This PR fixes triangular solve on GPUs, and runcheck.sh Co-authored-by: Daniel Osei-Kuffuor <oseikuffuor1@llnl.gov>	2021-01-15 20:46:59 -08:00
Daniel Osei-Kuffuor	bd76daf124	Updated saved files to reflect change in NSH solve on ILU Schur system -- See PR#251. (#254 )	2021-01-11 20:22:36 -08:00
Daniel Osei-Kuffuor	6a1caf8998	Modification to fix error code warning for coarse level solver (#251 ) Set tolerance for ILU Schur solver to zero for NSH. Schur solver convergence will be controlled by the max. number of iterations.	2021-01-08 16:57:22 -08:00
Ruipeng Li	a6c852be52	fixed syntax error with --enable-gpu-aware-mpi (#250 ) This PR fixed compile errors with `--enable-gpu-aware-mpi', and added more comments regarding syncing CUDA stream when doing GPU-GPU MPI.	2021-01-04 16:05:27 -08:00
Ruipeng Li	950f9f2505	fix GPU SpMV for zero matrices (#246 ) This PR fixes the issue with zero-sized matrix SpMV on GPUs.	2020-12-22 21:47:36 -08:00
Ruipeng Li	b49727f16b	Cuda triangular smoothers (#240 ) * This commit has CUDA based smoothers for AMG based on the triangular parts of sparse matrices. This includes an Gauss-Seidel (relax_type==3), which uses CUSPARSE triangular solvers to invert L. Symmetric Gauss Seidel is implemented in relax_type==6 also via CUSPARSE. Finally, 2 new smoothers are added. THe first is a 2 stage approximation to Gauss Seidel using a parallel MatVec and L (relax_type==11). The second (relax_type==12) is a less effective version of 11. It uses A_diag instead of L for the smoothing. CPU implementations of these new smoothers are also provided. For the two stage algorithms, L and U are NOT explicitly created. This seems faster and saves memory. In the two stage preconditioner, multiply by invdiag rather than divide by diagonal reduces register pressure and yields full occupancy. Co-authored-by: Paul Mullowney <pmullown@nrel.gov> Co-authored-by: PaulMullowney <60452402+PaulMullowney@users.noreply.github.com>	2020-12-17 19:37:59 -08:00
Sarah Osborn	763ea8a5a8	cmake: Optionally accept path to BLAS/LAPACK libraries (#230 ) Allow a user to specify the path to BLAS/LAPACK libraries within cmake, and bypass the cmake find_package logic.	2020-12-17 12:21:30 -06:00
Ruipeng Li	804609b6c4	Reorg relax (#237 ) This PR refactors the relaxation routines on CPUs and modularize the various Jacobi and Gauss-Seidel (G-S) methods in two "core" kernels.	2020-12-07 09:05:36 -08:00
Daniel Osei-Kuffuor	9c24f006a6	ILU updates (#239 ) * Fix bug in communication for Schur offD element map and update comments on communication pattern options. * Fix bug to avoid double deletion of solution array.	2020-12-04 11:13:21 -08:00
Ruipeng Li	eae9be29be	bug fix cycle_param (#236 ) This PR fixes the improper use of relax_type in relax-18 with CF relax, see #235	2020-11-23 13:09:59 -08:00
Daniel Osei-Kuffuor	56012897e1	Ilu dev 2019 (#160 ) This merge introduces new features to the parallel ILU solvers in hypre. In particular we have GPU support for BJ-ILU(0) and GMRES-ILU(0). In addition, this merge includes a new option for GMRES-ILU(0) using MILU(0) to build restriction/interpolation operators used to construct the Schur complement matrix by a Galerkin product. This option is also available on the GPU. Key commits include: * ILU updates with bug fixes for compiling the cuda version * Update local RCM ordering option to support nonsymmetric matrices * Update regression tests to test new features * Reference manual updates, Code cleanup and bug fixes Co-authored-by: Tianshi Xu <xu16@ray59.coralea.llnl.gov> Co-authored-by: Tianshi Xu <xu16@lassen708.coral.llnl.gov> Co-authored-by: Xu <xu16@bellsofireland.llnl.gov> Co-authored-by: Kote Hitenze <hitenze@jotenshis-MacBook-Pro.local> Co-authored-by: Tianshi Xu <xuxx1180@umn.edu> Co-authored-by: Ruipeng Li <li50@llnl.gov>	2020-11-22 22:16:56 -06:00
Rob Falgout	2bc4228eca	Changed sludist.saved file to correct new superlu-dist autotest errors	2020-11-13 06:20:27 -08:00
David M. Rogers	796ab0af48	Use basename when checking compiler in configure (#225 ) Use `basename` to strip off the path when (for example) `CC=/path/to/compiler/gcc` before testing for specific compilers in the `configure` script.	2020-11-09 06:23:03 -08:00
Luke	22f4d3f8c6	Cuda 11 API (#163 ) This PR adds CUDA-11 support.	2020-11-05 20:57:57 -08:00
Meisam	641f7a4e31	Minor spelling fix (#222 ) This PR fixes a typo in doc.	2020-11-02 08:51:42 -08:00
Ruipeng Li	2e1ccee243	Euclid fix (#218 ) This PR fixes the integer overflow problem in Euclid.	2020-11-02 08:50:50 -08:00
Daniel Osei-Kuffuor	5ac2b3a54a	Improve portability for update-release script (for LINUX, UNIX and macOS). (#227 ) Modified update-release.sh script to improve portability (for LINUX, UNIX and macOS). * The use of the 'date' command has been modified to use GNUs 'date' command (if installed). * In addition, single quotes for sed commands have been replaced by double quotes to allow the use of single quotes around internal variables. This appears to be more portable than the use of '\x27'. Note that this means shell meta- characters need to be escaped if they need to be treated as string literals. Other said lines are also modified accordingly for consistency.	2020-10-29 08:39:34 -07:00
Ruipeng Li	9fb1b351c3	MS-Windows OMP pragma (#223 ) This PR fixes OpenMP pragma in Windows when not using MSVC.	2020-10-27 08:45:58 -07:00
Ruipeng Li	636706acd7	Fixing compile issues --with-caliper (#216 ) This PR fixed compile issues --with-caliper and a region mismatch issue with caliper. Co-authored-by: Victor A. P. Magri <paludettomag1@llnl.gov>	2020-10-09 20:29:27 -07:00
Rob Falgout	dd4ddba0f3	Added a filter to runtest for 'lrun warning'	2020-10-08 13:43:18 -07:00
Rob Falgout	ff45ecef32	Set default convergence tolerance to 1.0e-6 (#206 ) This sets the default convergence tolerance to 1.0e-6 uniformly in hypre. Only three solvers had to be updated (AMG, ILU, and MGR), along with the corresponding documentation.	2020-09-28 19:28:43 -07:00

1 2 3 4 5 ...

11546 Commits