hypre

CFD/hypre

Author	SHA1	Message	Date
Victor A. P. Magri	593fe4cba7	Apply Debian patches (#966 ) 1. Fix make checkpar 2. Add missing `finalizeAllTimings 3. Add error code support to checktest.sh --------- Co-authored-by: Drew Parsons <dparsons@debian.org>	2023-09-22 14:41:40 -04:00
Rui Peng Li	36ab29ba09	L1 HSGS (#927 ) This PR provides a convergent l1-hybrid symmetric Gauss-Seidel (HSGS) method.	2023-09-05 12:44:05 -07:00
tisaac	cd8f9c363e	Keep smooth_num_levels in sync with amg_data (#954 ) This solves an out-of-bounds memory error during `hypre_BoomerAMGSetup` when called multiple times without a call to `hypre_BoomerAMGDestroy` interleaved. This pull request makes sure that `smooth_num_levels` is reset to `hypre_ParAMGDataSmoothNumLevels(amg_data)` before the smoothers variable is allocated.	2023-09-05 15:29:30 -04:00
Victor A. P. Magri	0e43aec1d4	Add Binary I/O functions for IJ matrices and vectors (#826 ) This PR adds new Print and Read functions for matrices and vectors to be stored/read in binary format. A detailed list of changes is given below: * Add IJMatrix/ParCSRMatrix routines for binary I/O * Add IJVector/ParVector routines for binary I/O * Add typedefs for unsigned integer types and single-precision floating-point * Change char sizes to HYPRE_MAX_FILE_NAME_LEN * Add options to IJ driver for reading binary matrices/vectors * Add regression tests for IJ input/output	2023-08-27 20:10:55 -04:00
Victor A. P. Magri	b372b31a11	Change sh to bash (#900 ) Change shell scripts from `#!/bin/sh` to `#!/bin/bash`	2023-08-16 20:09:43 -04:00
Victor A. P. Magri	03b9d3d090	Fix Copyright message (#951 ) Fix year in Copyright message of a few source files.	2023-07-20 21:27:37 -04:00
Victor A. P. Magri	9449cb919e	Add MAGMA option to FSAI (#940 ) Allow the use of MAGMA as local linear solver for FSAI. Add `HYPRE_FSAISetLocalSolveType` for choosing the local linear solve type used in FSAI and add `HYPRE_BoomerAMGSetFSAILocalSolveType` for the case when FSAI is used as a smoother to BoomerAMG.	2023-07-20 19:19:20 -04:00
Victor A. P. Magri	31e3bf3b06	Add FSAI support with CUDA and HIP (#739 ) This PR adds CUDA and HIP support to FSAI according to a static pattern generation algorithm. The resulting method can also be used as a preconditioner for BoomerAMG. A detailed list of changes is given below: * Add par_fsai_device.c * Add hypre_FSAIApply * Add function to dump local linear systems in dense format * Implement static FSAI pattern computation via powers of A * Improve filtering of candidate pattern * Improve local linear systems extraction * Add option for a 125pt matrix (27pt squared) * Add options to control sizes of the memory pools with umpire * Add hypre_GpuProfiling calls * Improve candidate pattern truncation times * Add max_nnz_row member and its private and public functions to FSAI * Use max_nnz_row in FSAISetupDevice * Add num_levels member and its private and public functions to FSAI * Add threshold member and its public/private functions to FSAI * Expose FSAI algorithm type to BoomerAMG * Expose options to control FSAI setup * Add cuSOLVER variables and calls * Add batched dense linear solver calls to FSAI * Improve execution time for generating random numbers * Show FSAI parameters when amg_print_level >= 1 * Improve output of FSAIPrintStats * Implement warp calls * Add hypre_mask type and hypre_ballot_sync wrapper function * Add hypre_popc and hypre_ffs wrapper functions * Implement warp_allreduce_max calls * Change: hypreDevice -> hypre_Device Add rocSOLVER calls * Apply astyle * Remove redundant line	2023-07-20 12:34:53 -04:00
Victor A. P. Magri	7df4f63377	Improve ILU documentation (#939 ) Add warnings for Euclid and PILUT redirecting users to hypre-ILU. Rewrite hypre-ILU overview section. Add new sections to hypre-ILU documentation: "User-level functions", "ILU as smoother for BoomerAMG", and "GPU support". Include info about new iterative ILU options. Update BoomerAMG complex smoothers section. Change name "hypre-ILU" to "ILU"	2023-07-17 13:43:57 -04:00
adam-sim-dev	64604ec075	Fix typo in krylov solvers comments (#938 )	2023-06-24 08:47:02 -04:00
Rob Falgout	09d4bd8849	Update mac autotest to use a tolerance when diffing residual norms (#926 )	2023-06-23 10:42:28 -07:00
Rob Falgout	8f6bdc6ab8	Release 2.29.0 (#936 )	2023-06-23 05:01:41 -07:00
Victor A. P. Magri	5e0bf05b42	Fax valgrind issue in AIR (#930 )	2023-06-20 15:52:18 -04:00
Victor A. P. Magri	64ce05b405	Offd num cols fix (#901 ) Modified from #718, this PR squashes out zero columns of the off-diagonal part of a `hypre_ParCSRMatrix`. The issue was in offd there exist empty columns (columns with no nonzeros), which correspond to "useless" entries in col_map_offd. This caused issues in at coarser grids in the communications with large number of ranks. We added a routine to compress the zero columns out and shorten col_map_offd. This should reduce communication cost even at higher levels. Two sources of the empty columns have been located and fixed: - Truncation after building P - P^T(AP): only the transpose multiplication part. --------- Co-authored-by: Noel Chalmers <noel.chalmers@gmail.com> Co-authored-by: Ruipeng Li <li50@llnl.gov> Co-authored-by: Wayne Mitchell <mitchell82@llnl.gov>	2023-06-19 20:18:26 -04:00
Victor A. P. Magri	a39cecc555	Fix compilation error (#929 ) This PR fixes a compilation error that arises in PETSc when hypre is used	2023-06-17 22:21:37 -04:00
Victor A. P. Magri	18b98805ba	Extend ILU support to AMD GPUs (#877 ) This PR adds HIP support to hypre_ILU (setup and solve phases): - Algorithm type 0 (BJ-ILU0) - Algorithm type 10 (GMRES-ILU0) - Iterative triangular solves for backward and forward substitutions. --------- Co-authored-by: Paul Mullowney <Paul.Mullowney@nrel.gov>	2023-06-15 20:10:52 -04:00
Rui Peng Li	72f5f3e136	Cuda versions (#879 ) This PR adds support and regression tests for all the versions from CUDA 9.0 to 12.0.	2023-06-15 06:26:12 -07:00
Rui Peng Li	7d1d9ca95c	ame/ams to use Jacobi on GPUs (#924 ) This PR updates the ams driver to use Jacobi smoothers on GPUs.	2023-06-14 13:25:59 -07:00
Victor A. P. Magri	f02bc17bbd	Fix device build (#925 ) This PR removes a call to hypre_error_w_msg inside two GPU lambdas and fixes the device build.	2023-06-14 11:07:13 -04:00
Rob Falgout	f478498295	New error handling feature to print messages to memory (#920 ) This allows users to direct hypre's error messages to a memory buffer instead of stderr. With this, there are now three basic ways to use hypre when configured --with-print-errors: - Default (mode 0): Errors are printed immediately to stderr (there is no processor information available in this print). - Store errors in memory (mode 1) and call PrintErrorMessages to print them. - Store errors in memory (mode 1) and call GetErrorMessages to manage the error messages however you like.	2023-06-13 20:17:25 -07:00
Rob Falgout	f3e8fb852f	Fix warning messages about set-but-not-used variables (#923 ) Fixed warning messages about set-but-not-used variables and ran astyle	2023-06-13 13:20:56 -07:00
Wayne Mitchell	2794aa3b28	SYCL MGR, AMS, AIR, etc. (#904 ) This PR enables more functionality with sycl: MGR AMS/ADS/AME AIR restriction	2023-06-12 14:03:52 -07:00
Victor A. P. Magri	48d13c6e28	Add MAGMA support (#868 ) * Add MAGMA support to autotools build * Add MAGMA interface files * Call MAGMA init/finalize interfaces * Add MAGMA support to CMake build	2023-06-11 23:17:22 -04:00
Victor A. P. Magri	80bf97e8a2	Check whether CF_marker exists before using it (#918 ) This PR implements a check on whether CF_marker exists before using it.	2023-06-09 11:37:09 -04:00
Victor A. P. Magri	f45de31a3d	Fix exact block inversion with HIP in MGR Fix cublasHandle -> vendorSolverHandle	2023-06-05 18:04:27 -04:00
Victor A. P. Magri	f7850206cd	Extend exact block inversion to HIP via batched solvers (#913 ) This PR adds HIP support to exact block inversion in MGR via rocSOLVER's batched routines.	2023-06-05 14:04:44 -04:00
Victor A. P. Magri	bd1073ad70	MGR non-UVM device support (#906 ) Allow MGR to work without UVM in device runs	2023-06-02 17:40:18 -04:00
Wayne Mitchell	7ff7f2f60d	oneDPL fixes and Sunspot regressions (#905 ) Fixes needed due to recent changes with oneDPL. Move regression testing of sycl build to sunspot.	2023-05-30 10:23:01 -07:00
Victor A. P. Magri	37bd6071fc	Fix SetInitialized/Finalized position (#912 ) This PR fixes the positions of the hypre_SetInitialized and hypre_SetFinalized calls	2023-05-26 17:20:37 -04:00
Victor A. P. Magri	2b2e9d2eee	Update comment about `HYPRE_BoomerAMGSetGridRelaxPoints` (#908 ) The function HYPRE_BoomerAMGSetGridRelaxPoints is particularly useful for AIR. Thus, it should not be phased out.	2023-05-19 14:53:03 -04:00
Rui Peng Li	ec86992c4b	Cuda12 (#871 ) This PR adds the support for CUDA 12.	2023-05-17 20:01:41 -07:00
Victor A. P. Magri	8b39b73a52	Fixes for Rocm 5.4.3 (#902 ) * Use unroll_factor=8 for rocm-5.4.3 * Add SortCSRRocsparse back * Fix Wunused-variable warnings * Set _hypre_memory_tracker to NULL after destroy * Update tioga results after changing default rocm version to 5.2.0	2023-05-11 09:05:26 -04:00
Victor A. P. Magri	412a6b1a48	Update sanity checks (#903 ) This fixes issue #883	2023-05-10 18:45:16 -04:00
Victor A. P. Magri	6ac5d7d0c8	Remove deprecated warning from Init (#899 ) When using HYPRE_Init, the compiler no longer gives a deprecated function warning message.	2023-05-08 10:25:07 -04:00
Victor A. P. Magri	fbfc271cfa	Remove deprecated warning from Init (#899 ) When using HYPRE_Init, the compiler no longer gives a deprecated function warning message.	2023-05-08 10:24:32 -04:00
Victor A. P. Magri	991f2e15eb	Fix memory leak on BoomerAMG complex smoothers (#890 ) The memory leak was happening when: A complex smoother for BoomerAMG was selected. The AMG hierarchy consisted of one level. The BoomerAMG preconditioner was destroyed and recomputed again.	2023-05-04 10:59:02 -04:00
Victor A. P. Magri	dc487086b4	Add HYPRE_Initialized and HYPRE_Finalized (#889 ) * Add hypre_State type to track initialization state of hypre * Add HYPRE_Initialized to determine whether hypre has been initialized * Add HYPRE_Finalized to determine whether hypre has been finalized * Add private implementations for hypre_initialized/finalized * Add HYPRE_Initialize * Update Fortran interface for HYPRE_Initialize * Use HYPRE_Initialize in test drivers and examples * Clean-up mentions of HYPRE_Init * Add HYPRE_DEPRECATED macro to autotools and CMake builds * Add regression test for library initialization/finalization	2023-05-04 10:42:40 -04:00
Victor A. P. Magri	cfb3ae4e32	Update `HYPRE_WARP_FULL_MASK` for HIP builds (#895 ) * Update HYPRE_WARP_FULL_MASK to 64-bit length for HIP * Add hypre_mask type depending on the GPU architecture * Change unsigned -> hypre_uint. Move a few hypre_int to hypre_uint	2023-05-03 06:44:45 -04:00
adam-sim-dev	e351324df8	Fix the comments (#892 ) This PR fixes comments in testij.c	2023-05-02 08:37:43 -07:00
Victor A. P. Magri	82b40f72f4	Fix device OpenMP build (#894 ) * Remove SyncCompute call to fix compilation with device omp * Fix hypre_SeqVectorAxpyzDevice implementation for device omp * Add warning for function not implemented for device omp	2023-05-02 11:21:51 -04:00
Wayne Mitchell	ba1f15b082	HYPRE_USING_GPU and HYPRE_USING_DEVICE_OPENMP (#742 ) Do not enable HYPRE_USING_GPU when using the device openmp backend. This allows cuda/hip/sycl implementation throughout the code to be grouped under the HYPRE_USING_GPU macro instead of always combining the cuda/ hip/sycl macros. In addition some other extraneous macro guards are removed.	2023-04-29 16:04:56 -07:00
Victor A. P. Magri	b08a962924	Fix comparisons between HYPRE_Real and HYPRE_Complex (#880 ) This fixes the machine-tux regression failures from 04/12/23	2023-04-12 14:04:18 -04:00
Victor A. P. Magri	461b6eee60	New features for MGR (#817 ) This PR adds device support to various MGR options: Non-galerkin coarse grid correction options (except for option 4) Block diagonal interpolation (interp_type = 12) Block Jacobi relaxation (level_smooth_type = 0 for global relaxation and interp_type = 12 for F-relaxation) The main code changes are listed below: * Add hypre_ParCSRMatrixExtractBlockDiagDevice * Add hypre_ParCSRMatrixExtractBlockDiagDevice and respective GPU kernels * Add hypre_ParCSRMatrixGenerateFFFCHost and respective backend wrapper * Add device support to hypre_MGRBuildPBlockJacobi * Add hypre_ParCSRMatrixBlockDiagMatrixDevice * Add hypre_ParCSRMatrixExtractBlockDiagDevice * Add MGRBuildPFromWpDevice * Add implementation for batched matrix transpose on the device * hypre_ParCSRMatrixDropSmallEntriesDevice: exit if tolerance is zero * Add hypre_ParCSRMatrixGenerateCCCFDevice * Port MGR's Non-Galerkin option to device * Add L1-Jacobi global smoother to MGR * Add missing comments about MGR's public APIs * Add hypre_MGRComputeNonGalerkinCGDevice * Update style of hypre_MGRCycle * Add sanity checks to hypre_SeqVectorElmdivpyMarked * Add hypre_MGRBlockRelaxSolveDevice * Add GPUProfiling to several places * MGR setup: simplify computation of l1_norms * MGR solve: make use of ParVectorSetZeros to make residual computations faster * Exit hypre_SeqVectorElmdivpyMarked earlier for vectors with zero size * Update caliper region names for MGR * Add wrappers to cublas batched getrf and getri functions * General performance improvements for MGR	2023-04-11 15:34:12 -04:00
Victor A. P. Magri	416a8f8323	Fix memory leak at ParCSRMatrix host multiplication (#873 ) Check whether AT_diag/offd already exist before computing them	2023-04-07 10:24:54 -04:00
Victor A. P. Magri	58d1abbc27	Change variables to unsigned long long int (#874 ) Solves GitHub issue #870	2023-04-05 16:33:24 -04:00
Victor A. P. Magri	f69f8ef469	Fix typo on `hypre_GpuMatDataMatDescr` variable (#864 ) * The Accessor macro is being changed from Decsr to Descr * Run astyle --------- Co-authored-by: Paul Mullowney <Paul.Mullowney@nrel.gov>	2023-03-22 22:52:47 -04:00
Rui Peng Li	2391e4703f	Fix memory tracker with omp (#844 ) This PR fixes hypre memory tracker with OMP threading.	2023-03-22 16:01:24 -07:00
Wayne Mitchell	1eabaf5f33	Report error on hypre calls outside of HYPRE_Init() - HYPRE_Finalize() (#837 ) When hypre handle is not initialized and a call to hypre_handle() is made, report an error and call HYPRE_Init() rather than just calling hypre_HandleCreate() with no error reporting.	2023-03-21 14:31:52 -07:00
Victor A. Paludetto Magri	9e84e35bf4	Add ILU device support independent of UVM (#816 ) This PR has three main goals: 1. Enable ILU to work without UVM on device builds. 2. Add the runtime switch option to ILU. Thus, in device builds, one can choose now whether to execute ILU's setup and solve on the device or host. 3. Improve error handling. e.g, when trying to execute on the device ILU types that are not supported yet. A summary of the code changes is given below: * Enable ILU reordering algorithms to work without UVM memory. * Update hypre_ILUSetupILU0Device with new intArray functions * Fix memory location of S_offd + style updates * Simplify hypre_ParILUCusparseILUExtractEBFC * Remove unused function: hypre_ParILUCusparseExtractDiagonalCSR * Refactor hypre_ILUGetInteriorExteriorPerm * Refactor hypre_ILULocalRCM * Refactor hypre_ILUSortOffdColmap * Refactor hypre_ILUGetLocalPerm * Refactor hypre_ILUGetPermddPQ * Fix memory transfer at hypre_ILUGetInteriorExteriorPerm * Add error messages for ILUK and ILUT for device runs * Add error messages for GMRES-ILUK and GMRES-ILUT for device runs * Add execution policy to ILU setup/solve * Bug fixes for -exec_host/-memory_host on device builds * Add ILU reordering option to IJ driver * Refactor hypre_ILUSetupILUKDevice * Refactor hypre_ILULocalRCM * Remove duplicated code * Bug fix on hypre_ILUGetPermddPQPre * Update lassen results in accordance with CPU runs --------- Co-authored-by: Paul Mullowney <Paul.Mullowney@nrel.gov>	2023-03-18 17:26:40 -04:00
Victor A. Paludetto Magri	deece5580f	Auxiliary space preconditioners fix (#859 ) This PR fixes the data type of cheby_fraction used as input in the SetChebySmoothingOptions functions of AMS and ADS	2023-03-17 14:29:37 -04:00

1 2 3 4 5 ...

12180 Commits