Rasmus Munk Larsen
e8a2aa24a2
Fix a couple of issues with unary pow():
2022-09-09 17:21:11 +00:00
Rohit Santhanam
07d0759951
[ROCm] Fix for sparse matrix related breakage on ROCm.
2022-09-09 14:41:00 +00:00
Antonio Sánchez
fb212c745d
Fix g++-6 constexpr and c++20 constexpr build errors.
2022-09-09 03:41:45 +00:00
Thomas Gloor
ec9c7163a3
Feature/skew symmetric matrix3
2022-09-08 20:44:40 +00:00
Antonio Sánchez
311ba66f7c
Fix realloc for non-trivial types.
2022-09-08 19:39:36 +00:00
Rasmus Munk Larsen
f9dfda28ab
Add missing comparison operators for GPU packets.
2022-09-07 21:13:45 +00:00
Tobias Schlüter
133498c329
Add constexpr, test for C++14 constexpr.
2022-09-07 03:42:34 +00:00
Antonio Sanchez
3e44f960ed
Reduce compiler warnings for tests.
2022-09-06 18:20:56 +00:00
Florian Richer
b7e21d4e38
Call check_that_malloc_is_allowed() in aligned_realloc()
2022-09-06 18:00:37 +00:00
Michael Palomas
525f066671
fixed msvc compilation error in GeneralizedEigenSolver.h
2022-09-04 17:50:43 +00:00
Antonio Sánchez
f241a2c18a
Add asserts for index-out-of-bounds in IndexedView.
2022-09-02 17:28:03 +00:00
Antonio Sánchez
30c42222a6
Fix some test build errors in new unary pow.
2022-08-30 17:24:14 +00:00
Rasmus Munk Larsen
bd393e15c3
Vectorize acos, asin, and atan for float.
2022-08-29 19:49:33 +00:00
Charles Schlosser
e5af9f87f2
Vectorize pow for integer base / exponent types
2022-08-29 19:23:54 +00:00
chuckyschluz
8acbf5c11c
re-enable pow for complex types
2022-08-26 17:29:02 -04:00
Rasmus Munk Larsen
7064ed1345
Specialize psign<Packet8i> for AVX2, don't vectorize psign<bool>.
2022-08-26 17:02:37 +00:00
Rasmus Munk Larsen
98e51c9e24
Avoid undefined behavior in array_cwise test due to signed integer overflow
2022-08-26 16:19:03 +00:00
Arthur
a7c1cac18b
Fix GeneralizedEigenSolver::info() and Asserts
2022-08-25 22:05:04 +00:00
Antonio Sanchez
714678fc6c
Add missing ptr in realloc call.
2022-08-24 22:04:04 -07:00
Charles Schlosser
b2a13c9dd1
Sparse Core: Replace malloc/free with conditional_aligned
2022-08-23 21:44:22 +00:00
Rasmus Munk Larsen
6aad0f821b
Fix psign for unsigned integer types, such as bool.
2022-08-22 20:19:35 +00:00
Rasmus Munk Larsen
1a09defce7
Protect new pblend implementation with EIGEN_VECTORIZE_AVX2
2022-08-22 18:28:03 +00:00
Rasmus Munk Larsen
7c67dc67ae
Use proper double word division algorithm for pow<double>. Gives 11-15% speedup.
2022-08-17 18:36:23 +00:00
Matthew Sterrett
7a3b667c43
Add support for AVX512-FP16 for vectorizing half precision math
2022-08-17 18:15:21 +00:00
Charles Schlosser
76a669fb45
add fixed power unary operation
2022-08-16 21:32:36 +00:00
Matthew Sterrett
39fcc89798
Removed unnecessary checks for FP16C
2022-08-16 18:14:41 +00:00
Romain Biessy
2f7cce2dd5
[SYCL] Fix some SYCL tests
2022-08-16 17:37:54 +00:00
Arthur
27367017bd
Disable bad "deprecated warning" edge-case in BDCSVD
2022-08-11 18:43:31 +00:00
Lexi Bromfield
66ea0c09fd
Don't double-define Half functions on aarch64
2022-08-09 20:00:34 +00:00
Rasmus Munk Larsen
97e0784dc6
Vectorize the sign operator in Eigen.
2022-08-09 19:54:57 +00:00
Arthur
be20207d10
Fix vectorized Jacobi Rotation
2022-08-08 19:29:56 +00:00
Rasmus Munk Larsen
7a87ed1b6a
Fix code and unit test for a few corner cases in vectorized pow()
2022-08-08 18:48:36 +00:00
Chip Kerchner
9e0afe0f02
Fix non-VSX PowerPC build
2022-08-08 18:18:17 +00:00
Chip Kerchner
84a9d6fac9
Fix use of Packet2d type for non-VSX.
2022-08-03 20:48:13 +00:00
Chip Kerchner
ce60a7be83
Partial Packet support for GEMM real-only (PowerPC). Also fix compilation warnings & errors for some conditions in new API.
2022-08-03 18:15:19 +00:00
Antonio Sánchez
5a1c7807e6
Fix inner iterator for sparse block.
2022-08-03 17:26:12 +00:00
Antonio Sánchez
7896c7dc6b
Use numext::sqrt in ConjugateGradient.
2022-07-29 20:17:23 +00:00
Ilya Tokar
e618c4a5e9
Improve pblend AVX implementation
2022-07-29 18:45:33 +00:00
sjusju
ef4654bae7
Add true determinant to QR and it's variants
2022-07-29 18:24:14 +00:00
Alexander Richardson
b7668c0371
Avoid including <sstream> with EIGEN_NO_IO
2022-07-29 18:02:51 +00:00
John Mather
7dd3dda3da
Updated AccelerateSupport documentation after PR 966.
2022-07-29 17:42:31 +00:00
Julian Kent
69714ff613
Add Sparse Subset of Matrix Inverse
2022-07-28 18:04:35 +00:00
Antonio Sánchez
34780d8bd1
Include immintrin.h header for enscripten.
2022-07-22 02:27:42 +00:00
Antonio Sánchez
2cf4d18c9c
Disable AVX512 GEMM kernels by default.
2022-07-20 21:22:48 +00:00
Charles Schlosser
a678a3e052
Fix aligned_realloc to call check_that_malloc_is_allowed() if ptr == 0
2022-07-19 20:59:07 +00:00
b-shi
4a56359406
Add option to disable avx512 GEBP kernels
2022-07-18 17:59:09 +00:00
Mathieu Westphal
1092574b26
Fix wrong doxygen group usage
2022-07-12 13:22:46 +02:00
Antonio Sánchez
bb51d9f4fa
Fix ODR violations.
2022-07-09 04:56:36 +00:00
Chip Kerchner
84cf3ff18d
Add pload_partial, pstore_partial (and unaligned versions), pgather_partial, pscatter_partial, loadPacketPartial and storePacketPartial.
2022-06-27 19:18:00 +00:00
Chip Kerchner
c603275dc9
Better performance for Power10 using more load and store vector pairs for GEMV
2022-06-27 18:11:55 +00:00
Antonio Sánchez
bc2ab81634
Eliminate undef warnings when not compiling for AVX512.
2022-06-24 15:10:10 +00:00
Antonio Sánchez
0e083b172e
Use numext::sqrt in Householder.h.
2022-06-21 16:29:59 +00:00
b-shi
37673ca1bc
AVX512 TRSM kernels use alloca if EIGEN_NO_MALLOC requested
2022-06-17 18:05:26 +00:00
Chip Kerchner
4d1c16eab8
Fix tanh and erf to use vectorized version for EIGEN_FAST_MATH in VSX.
2022-06-15 16:06:43 +00:00
Mehdi Goli
7ea823e824
[SYCL-Spec] According to [SYCL-2020 spec](...
2022-06-13 15:52:29 +00:00
Arthur
ba4d7304e2
Document DiagonalBase
2022-06-08 17:46:32 +00:00
Binhao Qin
95463b59bc
Mark index_remap as EIGEN_DEVICE_FUNC in src/Core/Reshaped.h ( Fixes #2493 )
2022-06-07 20:10:47 +00:00
Shi, Brian
28812d2ebb
AVX512 TRSM Kernels respect EIGEN_NO_MALLOC
2022-06-07 11:28:42 -07:00
Arthur
14aae29470
Provide DiagonalMatrix Product and Initializers
2022-06-06 21:43:22 +00:00
aaraujom
8fbb76a043
Fix build issues with MSVC for AVX512
2022-06-03 14:55:40 +00:00
aaraujom
d49ede4dc4
Add AVX512 s/dgemm optimizations for compute kernel (2nd try)
2022-05-28 02:00:21 +00:00
Arthur
705ae70646
Add R-Bidiagonalization step to BDCSVD
2022-05-27 02:00:24 +00:00
Mario Rincon-Nigro
e99163e732
fix: issue 2481: LDLT produce wrong results with AutoDiffScalar
2022-05-25 15:26:10 +00:00
Chip Kerchner
aa8b7e2c37
Add subMappers to Power GEMM packing - simplifies the address calculations (10% faster)
2022-05-23 15:18:29 +00:00
Guoqiang QI
32a3f9ac33
Improve plogical_shift_* implementations and fix typo in SVE/PacketMath.h
2022-05-23 09:33:49 +00:00
Eisuke Kawashima
ac5c83a3f5
unset executable flag
2022-05-22 22:47:43 +09:00
Antonio Sanchez
481a4a8c31
Fix BDCSVD condition for failing with numerical issue.
2022-05-20 08:18:31 -07:00
Antonio Sánchez
028ab12586
Prevent BDCSVD crash caused by index out of bounds.
2022-05-19 22:29:48 +00:00
Antonio Sánchez
9b9496ad98
Revert "Add AVX512 optimizations for matrix multiply"
...
This reverts commit 25db0b4a82
2022-05-13 18:50:33 +00:00
aaraujom
25db0b4a82
Add AVX512 optimizations for matrix multiply
2022-05-12 23:41:19 +00:00
Alex_M
2c055f8633
make diagonal matrix cols() and rows() methods constexpr
2022-05-03 10:13:37 +02:00
Chip Kerchner
c2f15edc43
Add load vector_pairs for RHS of GEMM MMA. Improved predux GEMV.
2022-04-25 16:23:01 +00:00
John Mather
9e026e5e28
Removed need to supply the Symmetric flag to UpLo argument for Accelerate LLT and LDLT
2022-04-21 20:02:10 +00:00
Chip Kerchner
44ba7a0da3
Fix compiler bugs for GCC 10 & 11 for Power GEMM
2022-04-20 15:59:00 +00:00
Chip Kerchner
b02c384ef4
Add fused multiply functions for PowerPC - pmsub, pnmadd and pnmsub
2022-04-18 16:16:32 +00:00
Rohit Santhanam
3de96caeaa
Fix HouseholderSequence.h
2022-04-17 02:46:56 +00:00
Antonio Sánchez
f845a8bb1a
Fix cwise NaN propagation for scalar input.
2022-04-16 05:07:44 +00:00
Charles Schlosser
a4bb513b99
Update HouseholderSequence.h
2022-04-15 16:56:17 +00:00
Shi, Brian
fc1d888415
Remove AVX512VL dependency in trsm
2022-04-14 12:44:24 -07:00
Antonio Sánchez
07db964bde
Restrict new AVX512 trsm to AVX512VL, rename files for consistency.
2022-04-14 16:58:32 +00:00
Charles Schlosser
67eeba6e72
Avoidable heap allocation in applyHouseholderToTheLeft
2022-04-13 18:45:36 +00:00
Antonio Sánchez
efb08e0bb5
Revert "Fix ambiguous DiagonalMatrix constructors."
...
This reverts commit a81bba962a
2022-04-12 03:54:31 +00:00
Chip Kerchner
53eec53d2a
Fix Power GEMV order of operations in predux for MMA.
2022-04-11 21:29:05 +00:00
Antonio Sánchez
a81bba962a
Fix ambiguous DiagonalMatrix constructors.
2022-04-11 19:13:25 +00:00
Tobias Schlüter
f3ba220c5d
Remove EIGEN_EMPTY_STRUCT_CTOR
2022-04-08 18:27:26 +00:00
Antonio Sánchez
5ed7a86ae9
Fix MSVC+CUDA issues.
2022-04-08 18:05:32 +00:00
Antonio Sánchez
734ed1efa6
Fix ODR issues in lapacke_helpers.
2022-04-08 15:31:30 +00:00
Antonio Sánchez
2c45a3846e
Fix some max size expressions.
2022-04-06 22:19:57 +00:00
Erik Schultheis
df87d40e34
constexpr reshape helper
2022-04-05 17:32:17 +00:00
Chip Kerchner
403fa33409
Performance improvements in GEMM for Power
2022-04-05 12:18:53 +00:00
Erik Schultheis
e1df3636b2
More constexpr helpers
2022-04-04 18:38:34 +00:00
Erik Schultheis
64909b82bd
static const class members turned into constexpr
2022-04-04 17:33:33 +00:00
William Talbot
2c0ef43b48
Added Scaling function overload for vector rvalue reference
2022-04-04 16:50:09 +00:00
Antonio Sanchez
ba2cb835aa
Add back std::remove* aliases - third-party libraries rely on these.
2022-04-01 17:02:52 +00:00
Antonio Sánchez
73b2c13bf2
Disable f16c scalar conversions for MSVC.
2022-03-30 18:35:32 +00:00
Tobias Schlüter
e22d58e816
Add is_constant_evaluated, update alignment checks
2022-03-25 04:00:58 +00:00
Erik Schultheis
b9d2900e8f
added a missing typename and fixed a unused typedef warning
2022-03-24 12:07:18 +02:00
b-shi
0611f7fff0
Add missing explicit reinterprets
2022-03-23 21:10:26 +00:00
Essex Edwards
cd3c81c3bc
Add a NNLS solver to unsupported - issue #655
2022-03-23 20:20:44 +00:00
Chip Kerchner
0699fa06fe
Split general_matrix_vector_product interface for Power into two macros - one ColMajor and RowMajor.
2022-03-23 18:09:33 +00:00
Antonio Sánchez
19a6a827c4
Optimize visitor traversal in case of RowMajor.
2022-03-23 15:27:57 +00:00
Romain Biessy
f2a3e03e9b
Fix usages of wrong namespace
2022-03-21 15:07:53 +00:00
Antonio Sánchez
4451823fb4
Fix ODR violation in trsm.
2022-03-20 15:56:53 +00:00
Antonio Sánchez
9a14d91a99
Fix AVX512 builds with MSVC.
2022-03-18 16:04:53 +00:00
Chip Kerchner
7b10795e39
Change EIGEN_ALTIVEC_ENABLE_MMA_DYNAMIC_DISPATCH and EIGEN_ALTIVEC_DISABLE_MMA flags to be like TensorFlow's...
2022-03-17 22:35:27 +00:00
Antonio Sánchez
3ca1228d45
Work around MSVC compiler bug dropping const.
2022-03-17 20:50:26 +00:00
Tobias Schlüter
40eb34bc5d
Fix RowMajorBit <-> RowMajor mixup.
2022-03-17 15:28:12 +00:00
Antonio Sanchez
e34db1239d
Fix missing pound
2022-03-16 12:26:12 -07:00
Antonio Sánchez
591906477b
Fix up PowerPC MMA flags so it builds by default.
2022-03-16 19:16:28 +00:00
b-shi
518fc321cb
AVX512 Optimizations for Triangular Solve
2022-03-16 18:04:50 +00:00
Erik Schultheis
421cbf0866
Replace Eigen type metaprogramming with corresponding std types and make use of alias templates
2022-03-16 16:43:40 +00:00
Arthur
514f90c9ff
Remove workarounds for bad GCC-4 warnings
2022-03-16 00:08:16 +00:00
Rasmus Munk Larsen
9ad5661482
Revert "Fix up PowerPC MMA flags so it builds by default."
2022-03-15 20:51:03 +00:00
Antonio Sánchez
65eeedf964
Fix up PowerPC MMA flags so it builds by default.
2022-03-15 20:22:23 +00:00
Tobias Schlüter
cb1e8228e9
Convert bit calculation to constexpr, avoid casts.
2022-03-13 22:38:36 +09:00
Rohit Santhanam
2a6be5492f
Fix construct_at compilation breakage on ROCm.
2022-03-09 16:47:53 +00:00
Duncan McBain
a3b64625e3
Remove ComputeCpp-specific code from SYCL Vptr
2022-03-08 22:44:18 +00:00
Tobias Schlüter
cd2ba9d03e
Add construct_at, destroy_at wrappers. Use throughout.
2022-03-08 20:43:22 +00:00
AlexanderMueller
dfa5176780
make SparseSolverBase and IterativeSolverBase move constructable
2022-03-08 20:03:53 +01:00
Tobias Schlüter
9883108f3a
Remove copy_bool workaround for gcc 4.3
2022-03-08 17:43:11 +00:00
John Mather
3a9d404d76
Add support for Apple's Accelerate sparse matrix solvers
2022-03-08 00:09:18 +00:00
Antonio Sánchez
0ae94456a0
Remove duplicate IsRowMajor declaration.
2022-03-04 21:22:02 +00:00
Rasmus Munk Larsen
0e6f4e43f1
Fix a few confusing comments in psincos_float.
2022-03-04 20:41:49 +00:00
Sean McBride
f1b9692d63
Removed EIGEN_UNUSED decorations from many functions that are in fact used
2022-03-03 20:19:33 +00:00
Arthur
c9ff739af1
Fix JacobiSVD_LAPACKE bindings
2022-03-03 19:24:07 +00:00
Zhuo Zhang
d0b1aef6f6
Speed lscg by using .noalias
2022-03-03 08:52:09 +00:00
Antonio Sanchez
55c7400db5
Fix enum conversion warnings in BooleanRedux.
2022-03-03 04:44:20 +00:00
Antonio Sánchez
9c07e201ff
Modified sqrt/rsqrt for denormal handling.
2022-03-02 17:20:47 +00:00
Antonio Sánchez
b48922cb5c
Fix SVD for MSVC+CUDA.
2022-03-01 21:35:22 +00:00
Yury Gitman
bf6726a0c6
Fix any/all reduction in the case of row-major layout
2022-03-01 05:27:50 +00:00
Antonio Sánchez
f03df0df53
Fix SVD for MSVC.
2022-02-28 19:53:15 +00:00
Antonio Sánchez
19c39bea29
Fix mixingtypes for g++-11.
2022-02-25 19:28:10 +00:00
Rasmus Munk Larsen
8b875dbef1
Changes to fast SQRT/RSQRT
2022-02-23 17:32:21 +00:00
Ramil Sattarov
f9b7564faa
E2K: initial support of LCC MCST compiler for the Elbrus 2000 CPU architecture
2022-02-23 17:07:34 +00:00
Arthur
cd80e04ab7
Add assert for edge case if Thin U Requested at runtime
2022-02-23 05:35:19 +00:00
Martin Heistermann
550af3938c
Fix for crash bug in SPQRSupport: Initialize pointers to nullptr to avoid free() calls of invalid pointers.
2022-02-18 16:13:28 +00:00
Antonio Sánchez
58a90c7463
Use fixed-sized U/V for fixed-sized inputs.
2022-02-16 18:31:47 +00:00
Antonio Sánchez
c367ed26a8
Make FixedInt constexpr, fix ODR of fix<N>
2022-02-16 17:47:51 +00:00
Antonio Sánchez
766087329e
Re-add svd::compute(Matrix, options) method to avoid breaking external projects.
2022-02-16 00:54:02 +00:00
Antonio Sánchez
a58af20d61
Add descriptions to Matrix typedefs.
2022-02-15 21:53:27 +00:00
Antonio Sánchez
28e008b99a
Fix sqrt/rsqrt for NEON.
2022-02-15 21:31:51 +00:00
Antonio Sanchez
23755030c9
Fix MSVC+NVCC 9.2 pragma error.
2022-02-15 10:51:32 -08:00
Erik Schultheis
7197b577fb
Remove unused macros in AVX packetmath.
...
The following macros are removed:
* EIGEN_DECLARE_CONST_Packet8f
* EIGEN_DECLARE_CONST_Packet4d
* EIGEN_DECLARE_CONST_Packet8f_FROM_INT
* EIGEN_DECLARE_CONST_Packet8i
2022-02-14 10:34:23 +00:00
Chip Kerchner
cb5ca1c901
Cleanup compiler warnings, etc from recent changes in GEMM & GEMV for PowerPC
2022-02-09 18:47:08 +00:00
Matt Keeter
cec0005c74
Return alphas() and betas() by const reference
2022-02-08 23:16:10 +00:00
Rasmus Munk Larsen
92d0026b7b
Provide a definition for numeric_limits static data members
2022-02-08 20:34:53 +00:00
Björn Ingvar Dahlgren
b94bddcde0
Typo in COD's doc: matrixR() -> matrixT()
2022-02-07 18:30:25 +00:00
Antonio Sánchez
94bed2b80c
Fix collision with resolve.h.
2022-02-07 18:17:42 +00:00
Antonio Sánchez
9441d94dcc
Revert "Make fixed-size Matrix and Array trivially copyable after C++20"
...
This reverts commit 47eac21072
2022-02-05 04:40:29 +00:00
Rasmus Munk Larsen
979fdd58a4
Add generic fast psqrt and prsqrt impls and make them correct for 0, +Inf, NaN, and negative arguments.
2022-02-05 00:20:13 +00:00