eigen

CFD/eigen

Author	SHA1	Message	Date
Pedro Caldeira	31ab62d347	Add support for Power10 (AltiVec) MMA instructions for bfloat16.	2022-11-30 23:33:37 +00:00
Antonio Sánchez	dcb042a87d	Fix serialization for non-compressed matrices.	2022-11-30 18:16:47 +00:00
Antonio Sánchez	2260e11eb0	Fix reshape strides when input has non-zero inner stride.	2022-11-29 19:39:29 +00:00
Antonio Sánchez	ab2b26fbc2	Fix sparseLU solver when destination has a non-unit stride.	2022-11-29 19:37:03 +00:00
Charles Schlosser	b7551bff92	Fix a bunch of annoying compiler warnings in tests	2022-11-21 20:07:19 +00:00
Antonio Sánchez	e7b1ad0315	Add serialization for sparse matrix and sparse vector.	2022-11-21 19:43:07 +00:00
Gabriele Buondonno	6431dfdb50	Cross product for vectors of size 2. Fixes #1037	2022-11-15 22:39:42 +00:00
Antonio Sánchez	8588d8c74b	Correct pnegate for floating-point zero.	2022-11-15 18:07:23 +00:00
Charles Schlosser	82b152dbe7	Add signbit function	2022-11-04 00:31:20 +00:00
Rasmus Munk Larsen	14c847dc0e	Refactor special values test for pow, and add a similar test for atan2	2022-10-12 20:12:08 +00:00
Rasmus Munk Larsen	d6bc062591	Remove reference to EIGEN_HAS_CXX11_MATH.	2022-10-10 23:38:28 +00:00
Rasmus Munk Larsen	1e1848fdb1	Add a vectorized implementation of atan2 to Eigen.	2022-09-28 20:46:49 +00:00
Rasmus Munk Larsen	273e0c884e	Revert "Add constexpr, test for C++14 constexpr."	2022-09-16 21:14:29 +00:00
Rasmus Munk Larsen	dceb779ecd	Fix test for pow with mixed integer types. We do not convert the exponent if it is an integer type.	2022-09-12 15:51:27 -07:00
Rasmus Munk Larsen	afc014f1b5	Allow mixed types for pow(), as long as the exponent is exactly representable in the base type.	2022-09-12 21:55:30 +00:00
Antonio Sánchez	b2c82a9347	Remove bad skew_symmetric_matrix3 test.	2022-09-10 07:08:37 +00:00
Antonio Sánchez	fb212c745d	Fix g++-6 constexpr and c++20 constexpr build errors.	2022-09-09 03:41:45 +00:00
Thomas Gloor	ec9c7163a3	Feature/skew symmetric matrix3	2022-09-08 20:44:40 +00:00
Antonio Sánchez	3c37dd2a1d	Tweak bound for pow to account for floating-point types.	2022-09-08 17:40:45 +00:00
Rasmus Munk Larsen	242325eca7	Remove unused variable.	2022-09-07 20:46:44 +00:00
Tobias Schlüter	133498c329	Add constexpr, test for C++14 constexpr.	2022-09-07 03:42:34 +00:00
Antonio Sánchez	69f50e3a67	Adjust overflow threshold bound for pow tests.	2022-09-06 19:53:29 +00:00
Antonio Sanchez	3e44f960ed	Reduce compiler warnings for tests.	2022-09-06 18:20:56 +00:00
Antonio Sánchez	f5364331eb	Fix some cmake issues.	2022-09-02 16:43:14 +00:00
Antonio Sánchez	d816044b6e	Fix mixingtypes tests.	2022-09-02 15:30:13 +00:00
Charles Schlosser	e5af9f87f2	Vectorize pow for integer base / exponent types	2022-08-29 19:23:54 +00:00
Rasmus Munk Larsen	98e51c9e24	Avoid undefined behavior in array_cwise test due to signed integer overflow	2022-08-26 16:19:03 +00:00
Arthur	a7c1cac18b	Fix GeneralizedEigenSolver::info() and Asserts	2022-08-25 22:05:04 +00:00
Charles Schlosser	76a669fb45	add fixed power unary operation	2022-08-16 21:32:36 +00:00
Rasmus Munk Larsen	97e0784dc6	Vectorize the sign operator in Eigen.	2022-08-09 19:54:57 +00:00
Arthur	be20207d10	Fix vectorized Jacobi Rotation	2022-08-08 19:29:56 +00:00
Rasmus Munk Larsen	7a87ed1b6a	Fix code and unit test for a few corner cases in vectorized pow()	2022-08-08 18:48:36 +00:00
Antonio Sánchez	5a1c7807e6	Fix inner iterator for sparse block.	2022-08-03 17:26:12 +00:00
Antonio Sánchez	39d22ef46b	Fix flaky packetmath_1 test.	2022-08-02 17:42:45 +00:00
sjusju	ef4654bae7	Add true determinant to QR and it's variants	2022-07-29 18:24:14 +00:00
Rohit Santhanam	06a458a13d	Enable subtests which use device side malloc since this has been fixed in ROCm 5.2.	2022-06-29 17:09:43 +00:00
Chip Kerchner	84cf3ff18d	Add pload_partial, pstore_partial (and unaligned versions), pgather_partial, pscatter_partial, loadPacketPartial and storePacketPartial.	2022-06-27 19:18:00 +00:00
Arthur	14aae29470	Provide DiagonalMatrix Product and Initializers	2022-06-06 21:43:22 +00:00
Rasmus Munk Larsen	510f6b9f15	Fix integer shortening warnings in visitor tests.	2022-05-27 18:51:37 +00:00
Arthur	705ae70646	Add R-Bidiagonalization step to BDCSVD	2022-05-27 02:00:24 +00:00
Antonio Sánchez	32348091ba	Avoid signed integer overflow in adjoint test.	2022-05-23 14:46:16 +00:00
Guoqiang QI	00b75375e7	Adding PocketFFT support in FFT module since kissfft has some flaw in accuracy and performance	2022-05-11 17:44:22 +00:00
John Mather	9e026e5e28	Removed need to supply the Symmetric flag to UpLo argument for Accelerate LLT and LDLT	2022-04-21 20:02:10 +00:00
Antonio Sánchez	f845a8bb1a	Fix cwise NaN propagation for scalar input.	2022-04-16 05:07:44 +00:00
Antonio Sánchez	07db964bde	Restrict new AVX512 trsm to AVX512VL, rename files for consistency.	2022-04-14 16:58:32 +00:00
Antonio Sánchez	3342fc7e4d	Allow all tests to pass with `EIGEN_TEST_NO_EXPLICIT_VECTORIZATION`	2022-04-12 14:48:22 +00:00
Tobias Schlüter	f3ba220c5d	Remove EIGEN_EMPTY_STRUCT_CTOR	2022-04-08 18:27:26 +00:00
Antonio Sánchez	0c859cf35d	Consider inf/nan in scalar test_isApprox.	2022-04-01 17:00:24 +00:00
Antonio Sánchez	73b2c13bf2	Disable f16c scalar conversions for MSVC.	2022-03-30 18:35:32 +00:00
Essex Edwards	cd3c81c3bc	Add a NNLS solver to unsupported - issue #655	2022-03-23 20:20:44 +00:00
Antonio Sánchez	19a6a827c4	Optimize visitor traversal in case of RowMajor.	2022-03-23 15:27:57 +00:00
Antonio Sánchez	9deaa19121	Work around g++-10 docker issue for geo_orthomethods_4.	2022-03-16 21:46:04 +00:00
Antonio Sánchez	01b5bc48cc	Disable schur non-convergence test.	2022-03-16 17:33:53 +00:00
Erik Schultheis	421cbf0866	Replace Eigen type metaprogramming with corresponding std types and make use of alias templates	2022-03-16 16:43:40 +00:00
Antonio Sánchez	baf9a985ec	Fix swap test for size 1 inputs.	2022-03-10 15:05:58 +00:00
Tobias Schlüter	9883108f3a	Remove copy_bool workaround for gcc 4.3	2022-03-08 17:43:11 +00:00
John Mather	3a9d404d76	Add support for Apple's Accelerate sparse matrix solvers	2022-03-08 00:09:18 +00:00
Antonio Sanchez	28c7c1a629	Log position of first difference for easier debugging.	2022-03-07 19:06:27 +00:00
Antonio Sánchez	b2ee235a4b	Split and reduce SVD test sizes.	2022-03-05 00:15:28 +00:00
Antonio Sánchez	27d8f29be3	Update vectorization_logic tests for all platforms.	2022-03-03 19:54:15 +00:00
Antonio Sánchez	711803c427	Skip denormal test if `Cond` is false.	2022-03-03 04:32:13 +00:00
Antonio Sánchez	9c07e201ff	Modified sqrt/rsqrt for denormal handling.	2022-03-02 17:20:47 +00:00
Antonio Sánchez	f03df0df53	Fix SVD for MSVC.	2022-02-28 19:53:15 +00:00
Antonio Sánchez	2ed4bee78f	Fix frexp packetmath tests for MSVC.	2022-02-24 22:16:37 +00:00
Antonio Sánchez	d58e629130	Disable deprecated warnings for SVD tests on MSVC.	2022-02-24 21:20:49 +00:00
Antonio Sánchez	3d7e2d0e3e	Fix packetmath compilation error.	2022-02-23 23:27:08 +00:00
Antonio Sánchez	8970719771	Fix gcc-5 packetmath_12 bug.	2022-02-23 21:56:25 +00:00
Antonio Sánchez	f0b81fefb7	Disable deprecated warnings in SVD tests.	2022-02-23 18:32:00 +00:00
Rasmus Munk Larsen	8b875dbef1	Changes to fast SQRT/RSQRT	2022-02-23 17:32:21 +00:00
Arthur	cd80e04ab7	Add assert for edge case if Thin U Requested at runtime	2022-02-23 05:35:19 +00:00
Lingzhu Xiang	35727928ad	Fix test macro conflicts with STL headers in C++20	2022-02-23 07:43:30 +08:00
Antonio Sánchez	28e008b99a	Fix sqrt/rsqrt for NEON.	2022-02-15 21:31:51 +00:00
Rasmus Munk Larsen	92d0026b7b	Provide a definition for numeric_limits static data members	2022-02-08 20:34:53 +00:00
Antonio Sánchez	9441d94dcc	Revert "Make fixed-size Matrix and Array trivially copyable after C++20" This reverts commit `47eac21072`	2022-02-05 04:40:29 +00:00
Rasmus Munk Larsen	979fdd58a4	Add generic fast psqrt and prsqrt impls and make them correct for 0, +Inf, NaN, and negative arguments.	2022-02-05 00:20:13 +00:00
Arthur	18b50458b6	Update SVD Module with Options template parameter	2022-02-02 00:15:44 +00:00
Rasmus Munk Larsen	7db0ac977a	Remove extraneous ")".	2022-01-27 02:20:03 +00:00
Rasmus Munk Larsen	09c0085a57	Only test pmsub, pnmadd, and pnmsub on signed types.	2022-01-27 02:09:25 +00:00
Rasmus Munk Larsen	8f2c6f0aa6	Make preciprocal IEEE compliant w.r.t. 1/0 and 1/inf.	2022-01-26 20:38:05 +00:00
Erik Schultheis	d271a7d545	reduce float warnings (comparisons and implicit conversions)	2022-01-26 18:16:19 +00:00
Rasmus Munk Larsen	51311ec651	Remove inline assembly for FMA (AVX) and add remaining extensions as packet ops: pmsub, pnmadd, and pnmsub.	2022-01-26 04:25:41 +00:00
Erik Schultheis	4e629b3c1b	make casts explicit and fixed the type	2022-01-24 18:19:21 +00:00
Rasmus Munk Larsen	ea2c02060c	Add reciprocal packet op and fast specializations for float with SSE, AVX, and AVX512.	2022-01-21 23:49:18 +00:00
Arthur Feeney	4b0926f99b	Prevent heap allocation in diagonal product	2022-01-21 21:15:44 +00:00
Erik Schultheis	970640519b	Cleanup	2022-01-21 01:48:59 +00:00
Lingzhu Xiang	47eac21072	Make fixed-size Matrix and Array trivially copyable after C++20 Making them trivially copyable allows using std::memcpy() without undefined behaviors. Only Matrix and Array with trivially copyable DenseStorage are marked as trivially copyable with an additional type trait. As described in http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2019/p0848r3.html it requires extremely verbose SFINAE to make the special member functions of fixed-size Matrix and Array trivial, unless C++20 concepts are available to simplify the selection of trivial special member functions given template parameters. Therefore only make this feature available to compilers that support C++20 P0848R3. Fix #1855.	2022-01-07 19:04:35 +00:00
Rasmus Munk Larsen	96dc37a03b	Some fixes/cleanups for numeric_limits & fix for related bug in psqrt	2022-01-07 01:10:17 +00:00
Rasmus Munk Larsen	7b5a8b6bc5	Improve plog: 20% speedup for float + handle denormals	2022-01-05 23:40:31 +00:00
Andrew Johnson	a491c7f898	Allow specifying inner & outer stride for CWiseUnaryView - fixes #2398	2022-01-05 19:24:46 +00:00
Rohit Santhanam	27a78e4f96	Some serialization API changes were made in commit...	2022-01-05 16:18:45 +00:00
Lingzhu Xiang	7244a74ab0	Add bounds checking to Eigen serializer	2022-01-03 17:00:24 +08:00
David Tellenbach	22a347b9d2	Remove unused EIGEN_HAS_STATIC_ARRAY_TEMPLATE `ec2fd0f7` removed the EIGEN_HAS_STATIC_ARRAY_TEMPLATE but forgot to remove this last occurrence. This fixes issue #2399.	2021-12-30 15:26:55 +00:00
David Tellenbach	d705eb5f86	Revert "Select AVX2 even if the data size is not a multiple of 8" Tests are failing for AVX and NEON. This reverts commit `eb85b97339`.	2021-12-28 23:57:06 +01:00
David Tellenbach	6e95c0cd9a	Add missing internal namespace The vectorization logic tests miss some namespace internal qualifiers.	2021-12-27 23:50:32 +00:00
Erik Schultheis	f7a056bf04	Small fixes This MR fixes a bunch of smaller issues, making the following changes: * Template parameters in the documentation are documented with `\tparam` instead of `\param` * Superfluous semicolon warnings fixed * Fixed the type of literals used to initialize float variables	2021-12-21 16:46:09 +00:00
Erik Schultheis	c20e908ebc	turn some macros intro constexpr functions	2021-12-10 19:27:01 +00:00
Erik Schultheis	495ffff945	removed helper cmake macro and don't use deprecated COMPILE_FLAGS anymore.	2021-12-09 23:09:56 +00:00
Rasmus Munk Larsen	f04fd8b168	Make sure exp(-Inf) is zero for vectorized expressions. This fixes #2385 .	2021-12-08 17:57:23 +00:00
Erik Schultheis	cc11e240ac	Some further cleanup	2021-12-06 18:01:15 +00:00
Erik Schultheis	ec2fd0f7ed	Require recent GCC and MSCV and removed `EIGEN_HAS_CXX14` and some other feature test macros	2021-12-01 00:48:34 +00:00
Rasmus Munk Larsen	085c2fc5d5	Revert "Update SVD Module to allow specifying computation options with a...	2021-11-30 18:45:54 +00:00
Jakub Gałecki	1b8dce564a	bugfix: issue #2375	2021-11-29 22:26:15 +00:00
Francesco Mazzoli	eb85b97339	Select AVX2 even if the data size is not a multiple of 8	2021-11-29 21:13:24 +00:00
Arthur	eef33946b7	Update SVD Module to allow specifying computation options with a template parameter. Resolves #2051	2021-11-29 20:50:46 +00:00
Erik Schultheis	f33a31b823	removed EIGEN_HAS_CXX11_* and redundant EIGEN_COMP_CXXVER checks	2021-11-29 19:18:57 +00:00
Erik Schultheis	ec4efbd696	remove EIGEN_HAS_CXX11	2021-11-24 20:08:49 +00:00
Erik Schultheis	7e586635ba	don't use deprecated MappedSparseMatrix	2021-11-19 15:58:04 +00:00
Erik Schultheis	b0fb5417d3	Fixed Sparse-Sparse Product in case of mixed StorageIndex types	2021-11-18 18:33:31 +00:00
Erik Schultheis	13954c4440	moved pruning code to SparseVector.h	2021-11-15 22:16:01 +00:00
Minh Quan HO	4284c68fbb	nestbyvalue test: fix uninitialized matrix - Doing computation with uninitialized (zero-ed ? but thanks Linux) matrix, or worse NaN on other non-linux systems. - This commit fixes it by initializing to Random().	2021-11-04 14:32:12 +01:00
Xinle Liu	478a1bdda6	Fix total deflation issue in BDCSVD, when & only when M is already diagonal.	2021-11-02 16:53:55 +00:00
Maxiwell S. Garcia	99600bd1a6	test: fix boostmutiprec test to compile with older Boost versions Eigen boostmultiprec test redefines a symbol that is already defined inside Boot Math [1]. Boost has fixed it recently [2], but this patch avoids errors if Boost version was less than 1.77. https://github.com/boostorg/math/blob/boost-1.76.0/include/boost/math/policies/policy.hpp#L18 `6830712302 (diff-c7a8e5911c2e6be4138e1a966d762200f147792ac16ad96fdcc724313d11f839)`	2021-10-25 20:32:33 +00:00
Rasmus Munk Larsen	2d3fec8ff6	Add nan-propagation options to matrix and array plugins.	2021-10-21 19:40:11 +00:00
Antonio Sanchez	fd5f48e465	Fix tuple compilation for VS2017. VS2017 doesn't like deducing alias types, leading to a bunch of compile errors for functions involving the `tuple` alias. Replacing with `TupleImpl` seems to solve this, allowing the test to compile/pass.	2021-10-20 19:18:34 +00:00
Antonio Sanchez	701f5d1c91	Fix gpu special function tests. Some checks used incorrect values, partly from copy-paste errors, partly from the change in behaviour introduced in !398. Modified results to match scipy, simplified tests by updating `VERIFY_IS_CWISE_APPROX` to work for scalars.	2021-10-01 10:20:50 -07:00
Antonio Sanchez	f0f1d7938b	Disable testing of complex compound assignment operators for MSVC. MSVC does not support specializing compound assignments for `std::complex`, since it already specializes them (contrary to the standard). Trying to use one of these on device will currently lead to a duplicate definition error. This is still probably preferable to no error though. If we remove the definitions for MSVC, then it will compile, but the kernel will fail silently. The only proper solution would be to define our own custom `Complex` type.	2021-09-27 15:15:11 -07:00
Kolja Brix	51a0b4e2d2	Reorganize test main file	2021-09-27 18:30:47 +00:00
Antonio Sanchez	de218b471d	Add -arch=<arch> argument for nvcc. Without this flag, when compiling with nvcc, if the compute architecture of a card does not exactly match any of those listed for `-gencode arch=compute_<arch>,code=sm_<arch>`, then the kernel will fail to run with: ``` cudaErrorNoKernelImageForDevice: no kernel image is available for execution on the device. ``` This can happen, for example, when compiling with an older cuda version that does not support a newer architecture (e.g. T4 is `sm_75`, but cuda 9.2 only supports up to `sm_70`). With the `-arch=<arch>` flag, the code will compile and run at the supplied architecture.	2021-09-24 20:48:01 -07:00
Antonio Sanchez	846d34384a	Rename EIGEN_CUDA_FLAGS to EIGEN_CUDA_CXX_FLAGS Also add a missing space for clang.	2021-09-24 20:15:55 -07:00
Antonio Sanchez	7b00e8b186	Clean up CUDA CMake files. - Unify test/CMakeLists.txt and unsupported/test/CMakeLists.txt - Added `EIGEN_CUDA_FLAGS` that are appended to the set of flags passed to the cuda compiler (nvcc or clang). The latter is to support passing custom flags (e.g. `-arch=` to nvcc, or to disable cuda-specific warnings).	2021-09-24 14:43:59 -07:00
Kolja Brix	afa616bc9e	Fix some typos found	2021-09-23 15:22:00 +00:00
sciencewhiz	4b6036e276	fix various typos	2021-09-22 16:15:06 +00:00
Antonio Sanchez	f49217e52b	Fix implicit conversion warnings in tuple_test. Fixes #2329.	2021-09-17 19:40:22 -07:00
Antonio Sanchez	9882aec279	Silence string overflow warning for GCC in initializer_list_construction test. This looks to be a GCC bug. It doesn't seem to reproduce is a smaller example, making it hard to isolate.	2021-09-17 18:33:50 +00:00
Antonio Sanchez	5dac0b53c9	Move Eigen::all,last,lastp1,lastN to Eigen::placeholders::. These names are so common, IMO they should not exist directly in the `Eigen::` namespace. This prevents us from using the `last` or `all` names for any parameters or local variables, otherwise spitting out warnings about shadowing or hiding the global values. Many external projects (and our own examples) also heavily use ``` using namespace Eigen; ``` which means these conflict with external libraries as well, e.g. `std::fill(first,last,value)`. It seems originally these were placed in a separate namespace `Eigen::placeholders`, which has since been deprecated. I propose to un-deprecate this, and restore the original locations. These symbols are also imported into `Eigen::indexing`, which additionally imports `fix` and `seq`. An alternative is to remove the `placeholders` namespace and stick with `indexing`. NOTE: this is an API-breaking change. Fixes #2321.	2021-09-17 10:21:42 -07:00
Rohit Santhanam	44da7a3b9d	Disable specific subtests that fail on HIP due to non-functional device side malloc/free (on HIP).	2021-09-17 16:19:03 +00:00
Rasmus Munk Larsen	6cadab6896	Clean up EIGEN_STATIC_ASSERT to only use standard c++11 static_assert.	2021-09-16 20:43:54 +00:00
Ryan Pavlik	3c87d6b662	Fix typos in copyright dates (cherry picked from commit 3335e0767cb847154e24f5d4fa345318309d1281)	2021-09-15 20:49:43 +00:00
Rohit Santhanam	a751225845	Minor fix for compilation error on HIP.	2021-09-12 14:06:58 +00:00
Antonio Sanchez	2e31570c16	Fix tuple_test after gpu_test_helper update. Duplicating the namespace `tuple_impl` caused a conflict with the `arch/GPU/Tuple.h` definitions for the `tuple_test`. We can't just use `Eigen::internal` either, since there exists a different `Eigen::internal::get`. Renaming the namespace to `test_detail` fixes the issue.	2021-09-11 20:24:42 -07:00
Antonio Sanchez	d06c639667	Fix unused variable warning and unnecessessary gpuFree.	2021-09-11 20:02:22 -07:00
Antonio Sanchez	bf66137efc	New GPU test utilities. This introduces new functions: ``` // returns kernel(args...) running on the CPU. Eigen::run_on_cpu(Kernel kernel, Args&&... args); // returns kernel(args...) running on the GPU. Eigen::run_on_gpu(Kernel kernel, Args&&... args); Eigen::run_on_gpu_with_hint(size_t buffer_capacity_hint, Kernel kernel, Args&&... args); // returns kernel(args...) running on the GPU if using // a GPU compiler, or CPU otherwise. Eigen::run(Kernel kernel, Args&&... args); Eigen::run_with_hint(size_t buffer_capacity_hint, Kernel kernel, Args&&... args); ``` Running on the GPU is accomplished by: - Serializing the kernel inputs on the CPU - Transferring the inputs to the GPU - Passing the kernel and serialized inputs to a GPU kernel - Deserializing the inputs on the GPU - Running `kernel(inputs...)` on the GPU - Serializing all output parameters and the return value - Transferring the serialized outputs back to the CPU - Deserializing the outputs and return value on the CPU - Returning the deserialized return value All inputs must be serializable (currently POD types, `Eigen::Matrix` and `Eigen::Array`). The kernel must also be POD (though usually contains no actual data). Tested on CUDA 9.1, 10.2, 11.3, with g++-6, g++-8, g++-10 respectively. This MR depends on !622, !623, !624.	2021-09-10 14:22:50 -07:00
Antonio Sanchez	26e5beb8cb	Device-compatible Tuple implementation. An analogue of `std::tuple` that works on device. Context: I've tried `std::tuple` in various versions of NVCC and clang, and although code seems to compile, it often fails to run - generating "illegal memory access" errors, or "illegal instruction" errors. This replacement does work on device.	2021-09-08 13:34:19 -07:00
Antonio Sanchez	fcd73b4884	Add a simple serialization mechanism. The `Serializer<T>` class implements a binary serialization that can write to (`serialize`) and read from (`deserialize`) a byte buffer. Also added convenience routines for serializing a list of arguments. This will mainly be for testing, specifically to transfer data to and from the GPU.	2021-09-08 09:38:59 -07:00
Antonio Sanchez	3b48a3b964	Remove stray DynamicSparseMatrix references. DynamicSparseMatrix has been removed. These shouldn't be here anymore.	2021-09-02 19:47:26 +00:00
Antonio Sanchez	5db9e5c779	Fix fix<N> when variable templates are not supported. There were some typos that checked `EIGEN_HAS_CXX14` that should have checked `EIGEN_HAS_CXX14_VARIABLE_TEMPLATES`, causing a mismatch in some of the `Eigen::fix<N>` assumptions. Also fixed the `symbolic_index` test when `EIGEN_HAS_CXX14_VARIABLE_TEMPLATES` is 0. Fixes #2308	2021-08-30 08:06:55 -07:00
Antonio Sanchez	eeacbd26c8	Bump CMake files to at least c++11. Removed all configurations that explicitly test or set the c++ standard flags. The only place the standard is now configured is at the top of the main `CMakeLists.txt` file, which can easily be updated (e.g. if we decide to move to c++14+). This can also be set via command-line using ``` > cmake -DCMAKE_CXX_STANDARD 14 ``` Kept the `EIGEN_TEST_CXX11` flag for now - that still controls whether to build/run the `cxx11_*` tests. We will likely end up renaming these tests and removing the `CXX11` subfolder.	2021-08-25 20:07:48 +00:00
Jakub Lichman	dc5b1f7d75	AVX512 and AVX2 support for Packet16i and Packet8i added	2021-08-25 19:38:23 +00:00
Kolja Brix	58e086b8c8	Add random matrix generation via SVD	2021-08-23 16:00:05 +00:00
Antonio Sanchez	0c4ae56e37	Remove unaligned assert tests. Manually constructing an unaligned object declared as aligned invokes UB, so we cannot technically check for alignment from within the constructor. Newer versions of clang optimize away this check. Removing the affected tests.	2021-08-18 18:05:24 +00:00
Antonio Sanchez	fc9d352432	Renamed shift_left/shift_right to shiftLeft/shiftRight. For naming consistency. Also moved to ArrayCwiseUnaryOps, and added test.	2021-08-17 20:04:48 -07:00
Nikolay Tverdokhleb	f1b899eef7	Do not set AnnoyingScalar::dont_throw if not defined EIGEN_TEST_ANNOYING_SCALAR_DONT_THROW. - Because that member is not declared if the macro is defined.	2021-08-11 10:01:21 +00:00
Gauri Deshpande	e6a5a594a7	remove denormal flushing in fp32tobf16 for avx & avx512	2021-08-09 22:15:21 +00:00
Alexander Karatarakis	4ba872bd75	Avoid leading underscore followed by cap in template identifiers	2021-08-04 22:41:52 +00:00
Antonio Sanchez	3d98a6ef5c	Modify scalar pzero, ptrue, pselect, and p<binary> operations to avoid memset. The `memset` function and bitwise manipulation only apply to POD types that do not require initialization, otherwise resulting in UB. We currently violate this in `ptrue` and `pzero`, we assume bitmasks for `pselect`, and bitwise operations are applied byte-by-byte in the generic implementations. This is causing issues for scalar types that do require initialization or that contain non-POD info such as pointers (#2201). We either break them, or force specializations of these functions for custom scalars, even if they are not vectorized. Here we modify these functions for scalars only - instead using only scalar operations: - `pzero`: `Scalar(0)` for all scalars. - `ptrue`: `Scalar(1)` for non-trivial scalars, bitset to one bits for trivial scalars. - `pselect`: ternary select comparing mask to `Scalar(0)` for all scalars - `pand`, `por`, `pxor`, `pnot`: use operators `&`, `\|`, `^`, `~` for all integer or non-trivial scalars, otherwise apply bytewise. For non-scalar types, the original implementations are used to maintain compatibility and minimize the number of changes. Fixes #2201.	2021-08-03 08:44:28 -07:00
Antonio Sanchez	7880f10526	Enable equality comparisons on GPU. Since `std::equal_to::operator()` is not a device function, it fails on GPU. On my device, I seem to get a silent crash in the kernel (no reported error, but the kernel does not complete). Replacing this with a portable version enables comparisons on device. Addresses #2292 - would need to be cherry-picked. The 3.3 branch also requires adding `EIGEN_DEVICE_FUNC` in `BooleanRedux.h` to get fully working.	2021-08-03 01:53:31 +00:00
arthurfeeney	a77638387d	Fixes #1387 for compilation error in JacobiSVD with HouseholderQRPreconditioner that occurs when input is a compile-time row vector.	2021-07-20 20:11:22 +00:00
Antonio Sanchez	1e6c6c1576	Replace memset with fill to work for non-trivial scalars. For custom scalars, zero is not necessarily represented by a zeroed-out memory block (e.g. gnu MPFR). We therefore cannot rely on `memset` if we want to fill a matrix or tensor with zeroes. Instead, we should rely on `fill`, which for trivial types does end up getting converted to a `memset` under-the-hood (at least with gcc/clang). Requires adding a `fill(begin, end, v)` to `TensorDevice`. Replaced all potentially bad instances of memset with fill. Fixes #2245.	2021-07-08 18:34:41 +00:00
Kolja Brix	a59cf78c8d	Add Doxygen-style documentation to main.h.	2021-07-07 18:23:59 +00:00
Antonio Sanchez	154f00e9ea	Fix inverse nullptr/asan errors for LU. For empty or single-column matrices, the current `PartialPivLU` currently dereferences a `nullptr` or accesses memory out-of-bounds. Here we adjust the checks to avoid this.	2021-07-01 13:41:04 -07:00

1 2 3 4 5 ...

2735 Commits