eigen

CFD/eigen

Author	SHA1	Message	Date
Antonio Sánchez	73b2c13bf2	Disable f16c scalar conversions for MSVC.	2022-03-30 18:35:32 +00:00
Essex Edwards	cd3c81c3bc	Add a NNLS solver to unsupported - issue #655	2022-03-23 20:20:44 +00:00
Antonio Sánchez	19a6a827c4	Optimize visitor traversal in case of RowMajor.	2022-03-23 15:27:57 +00:00
Antonio Sánchez	9deaa19121	Work around g++-10 docker issue for geo_orthomethods_4.	2022-03-16 21:46:04 +00:00
Antonio Sánchez	01b5bc48cc	Disable schur non-convergence test.	2022-03-16 17:33:53 +00:00
Erik Schultheis	421cbf0866	Replace Eigen type metaprogramming with corresponding std types and make use of alias templates	2022-03-16 16:43:40 +00:00
Antonio Sánchez	baf9a985ec	Fix swap test for size 1 inputs.	2022-03-10 15:05:58 +00:00
Tobias Schlüter	9883108f3a	Remove copy_bool workaround for gcc 4.3	2022-03-08 17:43:11 +00:00
John Mather	3a9d404d76	Add support for Apple's Accelerate sparse matrix solvers	2022-03-08 00:09:18 +00:00
Antonio Sanchez	28c7c1a629	Log position of first difference for easier debugging.	2022-03-07 19:06:27 +00:00
Antonio Sánchez	b2ee235a4b	Split and reduce SVD test sizes.	2022-03-05 00:15:28 +00:00
Antonio Sánchez	27d8f29be3	Update vectorization_logic tests for all platforms.	2022-03-03 19:54:15 +00:00
Antonio Sánchez	711803c427	Skip denormal test if `Cond` is false.	2022-03-03 04:32:13 +00:00
Antonio Sánchez	9c07e201ff	Modified sqrt/rsqrt for denormal handling.	2022-03-02 17:20:47 +00:00
Antonio Sánchez	f03df0df53	Fix SVD for MSVC.	2022-02-28 19:53:15 +00:00
Antonio Sánchez	2ed4bee78f	Fix frexp packetmath tests for MSVC.	2022-02-24 22:16:37 +00:00
Antonio Sánchez	d58e629130	Disable deprecated warnings for SVD tests on MSVC.	2022-02-24 21:20:49 +00:00
Antonio Sánchez	3d7e2d0e3e	Fix packetmath compilation error.	2022-02-23 23:27:08 +00:00
Antonio Sánchez	8970719771	Fix gcc-5 packetmath_12 bug.	2022-02-23 21:56:25 +00:00
Antonio Sánchez	f0b81fefb7	Disable deprecated warnings in SVD tests.	2022-02-23 18:32:00 +00:00
Rasmus Munk Larsen	8b875dbef1	Changes to fast SQRT/RSQRT	2022-02-23 17:32:21 +00:00
Arthur	cd80e04ab7	Add assert for edge case if Thin U Requested at runtime	2022-02-23 05:35:19 +00:00
Lingzhu Xiang	35727928ad	Fix test macro conflicts with STL headers in C++20	2022-02-23 07:43:30 +08:00
Antonio Sánchez	28e008b99a	Fix sqrt/rsqrt for NEON.	2022-02-15 21:31:51 +00:00
Rasmus Munk Larsen	92d0026b7b	Provide a definition for numeric_limits static data members	2022-02-08 20:34:53 +00:00
Antonio Sánchez	9441d94dcc	Revert "Make fixed-size Matrix and Array trivially copyable after C++20" This reverts commit `47eac21072`	2022-02-05 04:40:29 +00:00
Rasmus Munk Larsen	979fdd58a4	Add generic fast psqrt and prsqrt impls and make them correct for 0, +Inf, NaN, and negative arguments.	2022-02-05 00:20:13 +00:00
Arthur	18b50458b6	Update SVD Module with Options template parameter	2022-02-02 00:15:44 +00:00
Rasmus Munk Larsen	7db0ac977a	Remove extraneous ")".	2022-01-27 02:20:03 +00:00
Rasmus Munk Larsen	09c0085a57	Only test pmsub, pnmadd, and pnmsub on signed types.	2022-01-27 02:09:25 +00:00
Rasmus Munk Larsen	8f2c6f0aa6	Make preciprocal IEEE compliant w.r.t. 1/0 and 1/inf.	2022-01-26 20:38:05 +00:00
Erik Schultheis	d271a7d545	reduce float warnings (comparisons and implicit conversions)	2022-01-26 18:16:19 +00:00
Rasmus Munk Larsen	51311ec651	Remove inline assembly for FMA (AVX) and add remaining extensions as packet ops: pmsub, pnmadd, and pnmsub.	2022-01-26 04:25:41 +00:00
Erik Schultheis	4e629b3c1b	make casts explicit and fixed the type	2022-01-24 18:19:21 +00:00
Rasmus Munk Larsen	ea2c02060c	Add reciprocal packet op and fast specializations for float with SSE, AVX, and AVX512.	2022-01-21 23:49:18 +00:00
Arthur Feeney	4b0926f99b	Prevent heap allocation in diagonal product	2022-01-21 21:15:44 +00:00
Erik Schultheis	970640519b	Cleanup	2022-01-21 01:48:59 +00:00
Lingzhu Xiang	47eac21072	Make fixed-size Matrix and Array trivially copyable after C++20 Making them trivially copyable allows using std::memcpy() without undefined behaviors. Only Matrix and Array with trivially copyable DenseStorage are marked as trivially copyable with an additional type trait. As described in http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2019/p0848r3.html it requires extremely verbose SFINAE to make the special member functions of fixed-size Matrix and Array trivial, unless C++20 concepts are available to simplify the selection of trivial special member functions given template parameters. Therefore only make this feature available to compilers that support C++20 P0848R3. Fix #1855.	2022-01-07 19:04:35 +00:00
Rasmus Munk Larsen	96dc37a03b	Some fixes/cleanups for numeric_limits & fix for related bug in psqrt	2022-01-07 01:10:17 +00:00
Rasmus Munk Larsen	7b5a8b6bc5	Improve plog: 20% speedup for float + handle denormals	2022-01-05 23:40:31 +00:00
Andrew Johnson	a491c7f898	Allow specifying inner & outer stride for CWiseUnaryView - fixes #2398	2022-01-05 19:24:46 +00:00
Rohit Santhanam	27a78e4f96	Some serialization API changes were made in commit...	2022-01-05 16:18:45 +00:00
Lingzhu Xiang	7244a74ab0	Add bounds checking to Eigen serializer	2022-01-03 17:00:24 +08:00
David Tellenbach	22a347b9d2	Remove unused EIGEN_HAS_STATIC_ARRAY_TEMPLATE `ec2fd0f7` removed the EIGEN_HAS_STATIC_ARRAY_TEMPLATE but forgot to remove this last occurrence. This fixes issue #2399.	2021-12-30 15:26:55 +00:00
David Tellenbach	d705eb5f86	Revert "Select AVX2 even if the data size is not a multiple of 8" Tests are failing for AVX and NEON. This reverts commit `eb85b97339`.	2021-12-28 23:57:06 +01:00
David Tellenbach	6e95c0cd9a	Add missing internal namespace The vectorization logic tests miss some namespace internal qualifiers.	2021-12-27 23:50:32 +00:00
Erik Schultheis	f7a056bf04	Small fixes This MR fixes a bunch of smaller issues, making the following changes: * Template parameters in the documentation are documented with `\tparam` instead of `\param` * Superfluous semicolon warnings fixed * Fixed the type of literals used to initialize float variables	2021-12-21 16:46:09 +00:00
Erik Schultheis	c20e908ebc	turn some macros intro constexpr functions	2021-12-10 19:27:01 +00:00
Erik Schultheis	495ffff945	removed helper cmake macro and don't use deprecated COMPILE_FLAGS anymore.	2021-12-09 23:09:56 +00:00
Rasmus Munk Larsen	f04fd8b168	Make sure exp(-Inf) is zero for vectorized expressions. This fixes #2385 .	2021-12-08 17:57:23 +00:00
Erik Schultheis	cc11e240ac	Some further cleanup	2021-12-06 18:01:15 +00:00
Erik Schultheis	ec2fd0f7ed	Require recent GCC and MSCV and removed `EIGEN_HAS_CXX14` and some other feature test macros	2021-12-01 00:48:34 +00:00
Rasmus Munk Larsen	085c2fc5d5	Revert "Update SVD Module to allow specifying computation options with a...	2021-11-30 18:45:54 +00:00
Jakub Gałecki	1b8dce564a	bugfix: issue #2375	2021-11-29 22:26:15 +00:00
Francesco Mazzoli	eb85b97339	Select AVX2 even if the data size is not a multiple of 8	2021-11-29 21:13:24 +00:00
Arthur	eef33946b7	Update SVD Module to allow specifying computation options with a template parameter. Resolves #2051	2021-11-29 20:50:46 +00:00
Erik Schultheis	f33a31b823	removed EIGEN_HAS_CXX11_* and redundant EIGEN_COMP_CXXVER checks	2021-11-29 19:18:57 +00:00
Erik Schultheis	ec4efbd696	remove EIGEN_HAS_CXX11	2021-11-24 20:08:49 +00:00
Erik Schultheis	7e586635ba	don't use deprecated MappedSparseMatrix	2021-11-19 15:58:04 +00:00
Erik Schultheis	b0fb5417d3	Fixed Sparse-Sparse Product in case of mixed StorageIndex types	2021-11-18 18:33:31 +00:00
Erik Schultheis	13954c4440	moved pruning code to SparseVector.h	2021-11-15 22:16:01 +00:00
Minh Quan HO	4284c68fbb	nestbyvalue test: fix uninitialized matrix - Doing computation with uninitialized (zero-ed ? but thanks Linux) matrix, or worse NaN on other non-linux systems. - This commit fixes it by initializing to Random().	2021-11-04 14:32:12 +01:00
Xinle Liu	478a1bdda6	Fix total deflation issue in BDCSVD, when & only when M is already diagonal.	2021-11-02 16:53:55 +00:00
Maxiwell S. Garcia	99600bd1a6	test: fix boostmutiprec test to compile with older Boost versions Eigen boostmultiprec test redefines a symbol that is already defined inside Boot Math [1]. Boost has fixed it recently [2], but this patch avoids errors if Boost version was less than 1.77. https://github.com/boostorg/math/blob/boost-1.76.0/include/boost/math/policies/policy.hpp#L18 `6830712302 (diff-c7a8e5911c2e6be4138e1a966d762200f147792ac16ad96fdcc724313d11f839)`	2021-10-25 20:32:33 +00:00
Rasmus Munk Larsen	2d3fec8ff6	Add nan-propagation options to matrix and array plugins.	2021-10-21 19:40:11 +00:00
Antonio Sanchez	fd5f48e465	Fix tuple compilation for VS2017. VS2017 doesn't like deducing alias types, leading to a bunch of compile errors for functions involving the `tuple` alias. Replacing with `TupleImpl` seems to solve this, allowing the test to compile/pass.	2021-10-20 19:18:34 +00:00
Antonio Sanchez	701f5d1c91	Fix gpu special function tests. Some checks used incorrect values, partly from copy-paste errors, partly from the change in behaviour introduced in !398. Modified results to match scipy, simplified tests by updating `VERIFY_IS_CWISE_APPROX` to work for scalars.	2021-10-01 10:20:50 -07:00
Antonio Sanchez	f0f1d7938b	Disable testing of complex compound assignment operators for MSVC. MSVC does not support specializing compound assignments for `std::complex`, since it already specializes them (contrary to the standard). Trying to use one of these on device will currently lead to a duplicate definition error. This is still probably preferable to no error though. If we remove the definitions for MSVC, then it will compile, but the kernel will fail silently. The only proper solution would be to define our own custom `Complex` type.	2021-09-27 15:15:11 -07:00
Kolja Brix	51a0b4e2d2	Reorganize test main file	2021-09-27 18:30:47 +00:00
Antonio Sanchez	de218b471d	Add -arch=<arch> argument for nvcc. Without this flag, when compiling with nvcc, if the compute architecture of a card does not exactly match any of those listed for `-gencode arch=compute_<arch>,code=sm_<arch>`, then the kernel will fail to run with: ``` cudaErrorNoKernelImageForDevice: no kernel image is available for execution on the device. ``` This can happen, for example, when compiling with an older cuda version that does not support a newer architecture (e.g. T4 is `sm_75`, but cuda 9.2 only supports up to `sm_70`). With the `-arch=<arch>` flag, the code will compile and run at the supplied architecture.	2021-09-24 20:48:01 -07:00
Antonio Sanchez	846d34384a	Rename EIGEN_CUDA_FLAGS to EIGEN_CUDA_CXX_FLAGS Also add a missing space for clang.	2021-09-24 20:15:55 -07:00
Antonio Sanchez	7b00e8b186	Clean up CUDA CMake files. - Unify test/CMakeLists.txt and unsupported/test/CMakeLists.txt - Added `EIGEN_CUDA_FLAGS` that are appended to the set of flags passed to the cuda compiler (nvcc or clang). The latter is to support passing custom flags (e.g. `-arch=` to nvcc, or to disable cuda-specific warnings).	2021-09-24 14:43:59 -07:00
Kolja Brix	afa616bc9e	Fix some typos found	2021-09-23 15:22:00 +00:00
sciencewhiz	4b6036e276	fix various typos	2021-09-22 16:15:06 +00:00
Antonio Sanchez	f49217e52b	Fix implicit conversion warnings in tuple_test. Fixes #2329.	2021-09-17 19:40:22 -07:00
Antonio Sanchez	9882aec279	Silence string overflow warning for GCC in initializer_list_construction test. This looks to be a GCC bug. It doesn't seem to reproduce is a smaller example, making it hard to isolate.	2021-09-17 18:33:50 +00:00
Antonio Sanchez	5dac0b53c9	Move Eigen::all,last,lastp1,lastN to Eigen::placeholders::. These names are so common, IMO they should not exist directly in the `Eigen::` namespace. This prevents us from using the `last` or `all` names for any parameters or local variables, otherwise spitting out warnings about shadowing or hiding the global values. Many external projects (and our own examples) also heavily use ``` using namespace Eigen; ``` which means these conflict with external libraries as well, e.g. `std::fill(first,last,value)`. It seems originally these were placed in a separate namespace `Eigen::placeholders`, which has since been deprecated. I propose to un-deprecate this, and restore the original locations. These symbols are also imported into `Eigen::indexing`, which additionally imports `fix` and `seq`. An alternative is to remove the `placeholders` namespace and stick with `indexing`. NOTE: this is an API-breaking change. Fixes #2321.	2021-09-17 10:21:42 -07:00
Rohit Santhanam	44da7a3b9d	Disable specific subtests that fail on HIP due to non-functional device side malloc/free (on HIP).	2021-09-17 16:19:03 +00:00
Rasmus Munk Larsen	6cadab6896	Clean up EIGEN_STATIC_ASSERT to only use standard c++11 static_assert.	2021-09-16 20:43:54 +00:00
Ryan Pavlik	3c87d6b662	Fix typos in copyright dates (cherry picked from commit 3335e0767cb847154e24f5d4fa345318309d1281)	2021-09-15 20:49:43 +00:00
Rohit Santhanam	a751225845	Minor fix for compilation error on HIP.	2021-09-12 14:06:58 +00:00
Antonio Sanchez	2e31570c16	Fix tuple_test after gpu_test_helper update. Duplicating the namespace `tuple_impl` caused a conflict with the `arch/GPU/Tuple.h` definitions for the `tuple_test`. We can't just use `Eigen::internal` either, since there exists a different `Eigen::internal::get`. Renaming the namespace to `test_detail` fixes the issue.	2021-09-11 20:24:42 -07:00
Antonio Sanchez	d06c639667	Fix unused variable warning and unnecessessary gpuFree.	2021-09-11 20:02:22 -07:00
Antonio Sanchez	bf66137efc	New GPU test utilities. This introduces new functions: ``` // returns kernel(args...) running on the CPU. Eigen::run_on_cpu(Kernel kernel, Args&&... args); // returns kernel(args...) running on the GPU. Eigen::run_on_gpu(Kernel kernel, Args&&... args); Eigen::run_on_gpu_with_hint(size_t buffer_capacity_hint, Kernel kernel, Args&&... args); // returns kernel(args...) running on the GPU if using // a GPU compiler, or CPU otherwise. Eigen::run(Kernel kernel, Args&&... args); Eigen::run_with_hint(size_t buffer_capacity_hint, Kernel kernel, Args&&... args); ``` Running on the GPU is accomplished by: - Serializing the kernel inputs on the CPU - Transferring the inputs to the GPU - Passing the kernel and serialized inputs to a GPU kernel - Deserializing the inputs on the GPU - Running `kernel(inputs...)` on the GPU - Serializing all output parameters and the return value - Transferring the serialized outputs back to the CPU - Deserializing the outputs and return value on the CPU - Returning the deserialized return value All inputs must be serializable (currently POD types, `Eigen::Matrix` and `Eigen::Array`). The kernel must also be POD (though usually contains no actual data). Tested on CUDA 9.1, 10.2, 11.3, with g++-6, g++-8, g++-10 respectively. This MR depends on !622, !623, !624.	2021-09-10 14:22:50 -07:00
Antonio Sanchez	26e5beb8cb	Device-compatible Tuple implementation. An analogue of `std::tuple` that works on device. Context: I've tried `std::tuple` in various versions of NVCC and clang, and although code seems to compile, it often fails to run - generating "illegal memory access" errors, or "illegal instruction" errors. This replacement does work on device.	2021-09-08 13:34:19 -07:00
Antonio Sanchez	fcd73b4884	Add a simple serialization mechanism. The `Serializer<T>` class implements a binary serialization that can write to (`serialize`) and read from (`deserialize`) a byte buffer. Also added convenience routines for serializing a list of arguments. This will mainly be for testing, specifically to transfer data to and from the GPU.	2021-09-08 09:38:59 -07:00
Antonio Sanchez	3b48a3b964	Remove stray DynamicSparseMatrix references. DynamicSparseMatrix has been removed. These shouldn't be here anymore.	2021-09-02 19:47:26 +00:00
Antonio Sanchez	5db9e5c779	Fix fix<N> when variable templates are not supported. There were some typos that checked `EIGEN_HAS_CXX14` that should have checked `EIGEN_HAS_CXX14_VARIABLE_TEMPLATES`, causing a mismatch in some of the `Eigen::fix<N>` assumptions. Also fixed the `symbolic_index` test when `EIGEN_HAS_CXX14_VARIABLE_TEMPLATES` is 0. Fixes #2308	2021-08-30 08:06:55 -07:00
Antonio Sanchez	eeacbd26c8	Bump CMake files to at least c++11. Removed all configurations that explicitly test or set the c++ standard flags. The only place the standard is now configured is at the top of the main `CMakeLists.txt` file, which can easily be updated (e.g. if we decide to move to c++14+). This can also be set via command-line using ``` > cmake -DCMAKE_CXX_STANDARD 14 ``` Kept the `EIGEN_TEST_CXX11` flag for now - that still controls whether to build/run the `cxx11_*` tests. We will likely end up renaming these tests and removing the `CXX11` subfolder.	2021-08-25 20:07:48 +00:00
Jakub Lichman	dc5b1f7d75	AVX512 and AVX2 support for Packet16i and Packet8i added	2021-08-25 19:38:23 +00:00
Kolja Brix	58e086b8c8	Add random matrix generation via SVD	2021-08-23 16:00:05 +00:00
Antonio Sanchez	0c4ae56e37	Remove unaligned assert tests. Manually constructing an unaligned object declared as aligned invokes UB, so we cannot technically check for alignment from within the constructor. Newer versions of clang optimize away this check. Removing the affected tests.	2021-08-18 18:05:24 +00:00
Antonio Sanchez	fc9d352432	Renamed shift_left/shift_right to shiftLeft/shiftRight. For naming consistency. Also moved to ArrayCwiseUnaryOps, and added test.	2021-08-17 20:04:48 -07:00
Nikolay Tverdokhleb	f1b899eef7	Do not set AnnoyingScalar::dont_throw if not defined EIGEN_TEST_ANNOYING_SCALAR_DONT_THROW. - Because that member is not declared if the macro is defined.	2021-08-11 10:01:21 +00:00
Gauri Deshpande	e6a5a594a7	remove denormal flushing in fp32tobf16 for avx & avx512	2021-08-09 22:15:21 +00:00
Alexander Karatarakis	4ba872bd75	Avoid leading underscore followed by cap in template identifiers	2021-08-04 22:41:52 +00:00
Antonio Sanchez	3d98a6ef5c	Modify scalar pzero, ptrue, pselect, and p<binary> operations to avoid memset. The `memset` function and bitwise manipulation only apply to POD types that do not require initialization, otherwise resulting in UB. We currently violate this in `ptrue` and `pzero`, we assume bitmasks for `pselect`, and bitwise operations are applied byte-by-byte in the generic implementations. This is causing issues for scalar types that do require initialization or that contain non-POD info such as pointers (#2201). We either break them, or force specializations of these functions for custom scalars, even if they are not vectorized. Here we modify these functions for scalars only - instead using only scalar operations: - `pzero`: `Scalar(0)` for all scalars. - `ptrue`: `Scalar(1)` for non-trivial scalars, bitset to one bits for trivial scalars. - `pselect`: ternary select comparing mask to `Scalar(0)` for all scalars - `pand`, `por`, `pxor`, `pnot`: use operators `&`, `\|`, `^`, `~` for all integer or non-trivial scalars, otherwise apply bytewise. For non-scalar types, the original implementations are used to maintain compatibility and minimize the number of changes. Fixes #2201.	2021-08-03 08:44:28 -07:00
Antonio Sanchez	7880f10526	Enable equality comparisons on GPU. Since `std::equal_to::operator()` is not a device function, it fails on GPU. On my device, I seem to get a silent crash in the kernel (no reported error, but the kernel does not complete). Replacing this with a portable version enables comparisons on device. Addresses #2292 - would need to be cherry-picked. The 3.3 branch also requires adding `EIGEN_DEVICE_FUNC` in `BooleanRedux.h` to get fully working.	2021-08-03 01:53:31 +00:00
arthurfeeney	a77638387d	Fixes #1387 for compilation error in JacobiSVD with HouseholderQRPreconditioner that occurs when input is a compile-time row vector.	2021-07-20 20:11:22 +00:00
Antonio Sanchez	1e6c6c1576	Replace memset with fill to work for non-trivial scalars. For custom scalars, zero is not necessarily represented by a zeroed-out memory block (e.g. gnu MPFR). We therefore cannot rely on `memset` if we want to fill a matrix or tensor with zeroes. Instead, we should rely on `fill`, which for trivial types does end up getting converted to a `memset` under-the-hood (at least with gcc/clang). Requires adding a `fill(begin, end, v)` to `TensorDevice`. Replaced all potentially bad instances of memset with fill. Fixes #2245.	2021-07-08 18:34:41 +00:00
Kolja Brix	a59cf78c8d	Add Doxygen-style documentation to main.h.	2021-07-07 18:23:59 +00:00
Antonio Sanchez	154f00e9ea	Fix inverse nullptr/asan errors for LU. For empty or single-column matrices, the current `PartialPivLU` currently dereferences a `nullptr` or accesses memory out-of-bounds. Here we adjust the checks to avoid this.	2021-07-01 13:41:04 -07:00
Alexander Karatarakis	60400334a9	Make DenseStorage<> trivially_copyable	2021-06-30 04:27:51 +00:00
Rasmus Munk Larsen	4ad30a73fc	Use internal::ref_selector to avoid holding a reference to a RHS expression.	2021-06-22 14:31:32 +00:00
Rasmus Munk Larsen	13fb5ab92c	Fix more enum arithmetic.	2021-06-15 09:09:31 -07:00
Rasmus Munk Larsen	f64b2954c7	Fix c++20 warnings about using enums in arithmetic expressions.	2021-06-10 17:17:39 -07:00
Antonio Sanchez	dba753a986	Add missing NEON ptranspose implementations. Unified implementation using only `vzip`.	2021-05-25 18:25:35 +00:00
Jakub Lichman	12471fcb5d	predux_half_dowto4 test extended to all applicable packets	2021-05-21 16:42:19 +00:00
Niall Murphy	391094c507	Use derived object type in conservative_resize_like_impl When calling conservativeResize() on a matrix with DontAlign flag, the temporary variable used to perform the resize should have the same Options as the original matrix to ensure that the correct override of swap is called (i.e. PlainObjectBase::swap(DenseBase<OtherDerived> & other). Calling the base class swap (i.e in DenseBase) results in assertions errors or memory corruption.	2021-05-20 23:17:02 +00:00
Jakub Lichman	8877f8d9b2	ptranpose test for non-square kernels added	2021-05-19 08:26:45 +00:00
Guoqiang QI	3e006bfd31	Ensure all generated matrices for inverse_4x4 testes are invertible, this fix #2248 .	2021-05-13 15:03:30 +00:00
Antonio Sanchez	90e9a33e1c	Fix numext::arg return type. The cxx11 path for `numext::arg` incorrectly returned the complex type instead of the real type, leading to compile errors. Fixed this and added tests. Related to !477, which uncovered the issue.	2021-05-07 16:26:57 +00:00
Theo Fletcher	2ced0cc233	Added complex matrix unit tests for SelfAdjointEigenSolve	2021-04-26 19:00:51 +00:00
Jakub Lichman	d87648a6be	Tests added and AVX512 bug fixed for pcmp_lt_or_nan	2021-04-25 20:58:56 +00:00
Jakub Lichman	1115f5462e	Tests for pcmp_lt and pcmp_le added	2021-04-23 19:51:43 +00:00
Antonio Sanchez	d213a0bcea	DenseStorage safely copy/swap. Fixes #2229. For dynamic matrices with fixed-sized storage, only copy/swap elements that have been set. Otherwise, this leads to inefficient copying, and potential UB for non-initialized elements.	2021-04-22 18:45:19 +00:00
Antonio Sanchez	69adf26aa3	Modify googlehash use to account for namespace issues. The namespace declaration for googlehash is a configurable macro that can be disabled. In particular, it is disabled within google, causing compile errors since `dense_hash_map`/`sparse_hash_map` are then in the global namespace instead of in `::google`. Here we play a bit of gynastics to allow for both `google::_hash_map` and `_hash_map`, while limiting namespace polution. Symbols within the `::google` namespace are imported into `Eigen::google`. We also remove checks based on `_SPARSE_HASH_MAP_H_`, as this is fragile, and instead require `EIGEN_GOOGLEHASH_SUPPORT` to be defined.	2021-04-12 19:00:39 -07:00
Christoph Hertzberg	d58678069c	Make iterators default constructible and assignable, by making...	2021-04-09 17:03:28 +00:00
Antonio Sanchez	ace7f132ed	Fix clang tidy warnings in AnnoyingScalar. Clang-tidy complains that full specializations in headers can cause ODR violations. Marked these as `inline` to fix. It also complains about renaming arguments in specializations. Set the argument names to match.	2021-04-05 12:49:38 -07:00
Rasmus Munk Larsen	3ddc0974ce	Fix two bugs in commit	2021-04-02 22:06:27 +00:00
David Tellenbach	ae95b74af9	Add CMake infrastructure for smoke testing Necessary CMake changes to implement pre-merge smoke tests running via CI.	2021-03-31 22:09:00 +00:00
Rasmus Munk Larsen	5bbc9cea93	Add an info() method to the SVDBase class to make it possible to tell the user that the computation failed, possibly due to invalid input. Make Jacobi and divide-and-conquer fail fast and return info() == InvalidInput if the matrix contains NaN or +/-Inf.	2021-03-31 21:09:19 +00:00
Antonio Sanchez	78ee3d6261	Fix CUDA constexpr issues for numeric_limits. Some CUDA/HIP constants fail on device with `constexpr` since they internally rely on non-constexpr functions, e.g. ``` \#define CUDART_INF_F __int_as_float(0x7f800000) ``` This fails for cuda-clang (though passes with nvcc). These constants are currently used by `device::numeric_limits`. For portability, we need to remove `constexpr` from the affected functions. For C++11 or higher, we should be able to rely on the `std::numeric_limits` versions anyways, since the methods themselves are now `constexpr`, so should be supported on device (clang/hipcc natively, nvcc with `--expr-relaxed-constexpr`).	2021-03-30 18:01:27 +00:00
Chip Kerchner	d59ef212e1	Fixed performance issues for complex VSX and P10 MMA in gebp_kernel (level 3).	2021-03-25 11:08:19 +00:00
Antonio Sanchez	5521c65afb	Eliminate mixingtypes_7 warning. `g_called` is not used in subtest 7, so was generating a `-Wunneeded-internal-declaration` warnings. Here we silence it by initializing the static variable.	2021-03-24 11:05:41 -07:00
David Tellenbach	0cc9b5eb40	Split test commainitializer into two substests	2021-03-18 13:28:51 +01:00
Antonio Sanchez	c3fbc6cec7	Use singleton pattern for static registered tests. The original fails with nvcc+msvc - there's a static order of initialization issue leading to registered tests being cleared. The test then fails on ``` VERIFY(EigenTest::all().size()>0); ``` since `EigenTest` no longer contains any tests. The singleton pattern fixes this.	2021-03-18 00:56:31 +00:00
Antonio Sanchez	8dfe1029a5	Augment NumTraits with min/max_exponent() again. Replace usage of `std::numeric_limits<...>::min/max_exponent` in codebase where possible. Also replaced some other `numeric_limits` usages in affected tests with the `NumTraits` equivalent. The previous MR !443 failed for c++03 due to lack of `constexpr`. Because of this, we need to keep around the `std::numeric_limits` version in enum expressions until the switch to c++11. Fixes #2148	2021-03-16 20:12:46 -07:00
David Tellenbach	df4bc2731c	Revert "Augment NumTraits with min/max_exponent()." This reverts commit `75ce9cd2a7`.	2021-03-17 03:06:08 +01:00
Antonio Sanchez	75ce9cd2a7	Augment NumTraits with min/max_exponent(). Replace usage of `std::numeric_limits<...>::min/max_exponent` in codebase. Also replaced some other `numeric_limits` usages in affected tests with the `NumTraits` equivalent. Fixes #2148	2021-03-17 01:00:41 +00:00
Rasmus Munk Larsen	2e83cbbba9	Add NaN propagation options to minCoeff/maxCoeff visitors.	2021-03-16 17:02:50 +00:00
Antonio Sanchez	f612df2736	Add fmod(half, half). This is to support TensorFlow's `tf.math.floormod` for half.	2021-03-15 13:32:24 -07:00
Antonio Sanchez	d24f9f9b55	Fix NVCC+ICC issues. NVCC does not understand `__forceinline`, so we need to use `inline` when compiling for GPU. ICC specializes `std::complex` operators for `float` and `double` by default, which cannot be used on device and conflict with Eigen's workaround in CUDA/Complex.h. This can be prevented by defining `_OVERRIDE_COMPLEX_SPECIALIZATION_` before including `<complex>`. Added this define to the tests and to `Eigen/Core`, but this will not work if the user includes `<complex>` before `<Eigen/Core>`. ICC also seems to generate a duplicate `Map` symbol in `PlainObjectBase`: ``` error: "Map" has already been declared in the current scope static ConstMapType Map(const Scalar *data) ``` I tracked this down to `friend class Eigen::Map`. Putting the `friend` statements at the bottom of the class seems to resolve this issue. Fixes #2180	2021-03-15 18:42:04 +00:00
Antonio Sanchez	14487ed14e	Add increment/decrement operators to Eigen::half. This is for consistency with bfloat16, and to support initialization with `std::iota`.	2021-03-15 10:52:23 -07:00
Antonio Sanchez	b271110788	Bump up rand histogram threshold. The previous one sometimes fails for MSVC which has a poor random number generator. Fixes #2182	2021-03-10 22:17:03 -08:00
Antonio Sanchez	543e34ab9d	Re-implement move assignments. The original swap approach leads to potential undefined behavior (reading uninitialized memory) and results in unnecessary copying of data for static storage. Here we pass down the move assignment to the underlying storage. Static storage does a one-way copy, dynamic storage does a swap. Modified the tests to no longer read from the moved-from matrix/tensor, since that can lead to UB. Added a test to ensure we do not access uninitialized memory in a move. Fixes: #2119	2021-03-10 16:55:20 +00:00
Antonio Sanchez	2468253c9a	Define EIGEN_CPLUSPLUS and replace most __cplusplus checks. The macro `__cplusplus` is not defined correctly in MSVC unless building with the the `/Zc:__cplusplus` flag. Instead, it defines `_MSVC_LANG` to the specified c++ standard version number. Here we introduce `EIGEN_CPLUSPLUS` which will contain the c++ version number both for MSVC and otherwise. This simplifies checks for supported features. Also replaced most instances of standard version checking via `__cplusplus` with the existing `EIGEN_COMP_CXXVER` macro for better clarity. Fixes: #2170	2021-03-05 18:33:18 +00:00
Antonio Sanchez	82d61af3a4	Fix rint SSE/NEON again, using optimization barrier. This is a new version of !423, which failed for MSVC. Defined `EIGEN_OPTIMIZATION_BARRIER(X)` that uses inline assembly to prevent operations involving `X` from crossing that barrier. Should work on most `GNUC` compatible compilers (MSVC doesn't seem to need this). This is a modified version adapted from what was used in `psincos_float` and tested on more platforms (see #1674, https://godbolt.org/z/73ezTG). Modified `rint` to use the barrier to prevent the add/subtract rounding trick from being optimized away. Also fixed an edge case for large inputs that get bumped up a power of two and ends up rounding away more than just the fractional part. If we are over `2^digits` then just return the input. This edge case was missed in the test since the test was comparing approximate equality, which was still satisfied. Adding a strict equality option catches it.	2021-03-05 08:54:12 -08:00
Antonio Sánchez	9a663973b4	Revert "Fix rint for SSE/NEON." This reverts commit `e72dfeb8b9`	2021-03-03 18:51:51 +00:00
Antonio Sanchez	e72dfeb8b9	Fix rint for SSE/NEON. It seems sometimes with aggressive optimizations the combination `psub(padd(a, b), b)` trick to force rounding is compiled away. Here we replace with inline assembly to prevent this (I tried `volatile`, but that leads to additional loads from memory). Also fixed an edge case for large inputs `a` where adding `b` bumps the value up a power of two and ends up rounding away more than just the fractional part. If we are over `2^digits` then just return the input. This edge case was missed in the test since the test was comparing approximate equality, which was still satisfied. Adding a strict equality option catches it.	2021-03-03 09:41:46 -08:00
Christoph Hertzberg	199c5f2b47	geo_alignedbox_5 was failing with AVX enabled, due to storing `Vector4d` in a `std::vector` without using an aligned allocator. Got rid of using `std::vector` and simplified the code. Avoid leading `_`	2021-03-01 03:59:21 +01:00
Antonio Sanchez	1e0c7d4f49	Add print for SSE/NEON, use NEON rounding intrinsics if available. In SSE, by adding/subtracting 2^MantissaBits, we force rounding according to the current rounding mode. For NEON, we use the provided intrinsics for rint/floor/ceil if available (armv8). Related to #1969.	2021-02-27 22:42:07 +00:00
Antonio Sanchez	c65c2b31d4	Make half/bfloat16 constructor take inputs by value, fix powerpc test. Since `numeric_limits<half>::max_exponent` is a static inline constant, it cannot be directly passed by reference. This triggers a linker error in recent versions of `g++-powerpc64le`. Changing `half` to take inputs by value fixes this. Wrapping `max_exponent` with `int(...)` to make an addressable integer also fixes this and may help with other custom `Scalar` types down-the-road. Also eliminated some compile warnings for powerpc.	2021-02-27 21:32:06 +00:00
Christoph Hertzberg	ca528593f4	Fixed/masked more implicit copy constructor warnings (cherry picked from commit 2883e91ce5a99c391fbf28e20160176b70854992)	2021-02-27 18:44:26 +01:00
Antonio Sanchez	29ebd84cb7	Fix NEON sqrt for 32-bit, add prsqrt. With !406, we accidentally broke arm 32-bit NEON builds, since `vsqrt_f32` is only available for 64-bit. Here we add back the `rsqrt` implementation for 32-bit, relying on a `prsqrt` implementation with better handling of edge cases. Note that several of the 32-bit NEON packet tests are currently failing - either due to denormal handling (NEON versions flush to zero, but scalar paths don't) or due to accuracy (e.g. sin/cos).	2021-02-26 14:08:40 -08:00
Rasmus Munk Larsen	fe19714f80	Merge branch 'rmlarsen1/eigen-nan_prop'	2021-02-26 09:21:24 -08:00
Antonio Sanchez	e19829c3b0	Fix floor/ceil for NEON fp16. Forgot to test this. Fixes bug introduced in !416.	2021-02-25 20:39:56 -08:00
Antonio Sanchez	5529db7524	Fix SSE/NEON pfloor/pceil for saturated values. The original will saturate if the input does not fit into an integer type. Here we fix this, returning the input if it doesn't have enough precision to have a fractional part. Also added `pceil` for NEON. Fixes #1969.	2021-02-25 14:39:26 -08:00
Rasmus Munk Larsen	5297b7162a	Make it possible to specify NaN propagation strategy for maxCoeff/minCoeff reductions.	2021-02-25 18:21:21 +00:00
Antonio Sanchez	ecb7b19dfa	Disable new/delete test for HIP	2021-02-25 08:04:05 -08:00

1 2 3 4 5 ...

2687 Commits