Commit Graph

  • d194167149 Fix the bug using neon instruction fmla for data type half Lianhuang Li 2022-12-01 17:28:57 +0000
  • 31ab62d347 Add support for Power10 (AltiVec) MMA instructions for bfloat16. Pedro Caldeira 2022-11-30 23:33:37 +0000
  • dcb042a87d Fix serialization for non-compressed matrices. Antonio Sánchez 2022-11-30 18:16:47 +0000
  • 2260e11eb0 Fix reshape strides when input has non-zero inner stride. Antonio Sánchez 2022-11-29 19:39:29 +0000
  • 23524ab6fc Changing BiCGSTAB parameters initialization so that it works with custom types Alexandre Hoffmann 2022-11-29 19:37:46 +0000
  • ab2b26fbc2 Fix sparseLU solver when destination has a non-unit stride. Antonio Sánchez 2022-11-29 19:37:03 +0000
  • 551eebc8ca Add synchronize method to all devices. Antonio Sánchez 2022-11-29 19:35:02 +0000
  • b7551bff92 Fix a bunch of annoying compiler warnings in tests Charles Schlosser 2022-11-21 20:07:19 +0000
  • e7b1ad0315 Add serialization for sparse matrix and sparse vector. Antonio Sánchez 2022-11-21 19:43:07 +0000
  • 044f3f6234 Fix bug in handmade_aligned_realloc Charles Schlosser 2022-11-18 22:35:31 +0000
  • 6728683938 Small cleanup of IDRS.h Chris 2022-11-16 13:51:23 +0000
  • 02805bd56c Fix AVX2 psignbit Charles Schlosser 2022-11-16 13:43:11 +0000
  • 399ce1ed63 Fix duplicate execution code for Power 8 Altivec in pstore_partial. Chip Kerchner 2022-11-16 13:41:42 +0000
  • 6431dfdb50 Cross product for vectors of size 2. Fixes #1037 Gabriele Buondonno 2022-11-15 22:39:42 +0000
  • 8588d8c74b Correct pnegate for floating-point zero. Antonio Sánchez 2022-11-15 18:07:23 +0000
  • 5eacb9e117 Put brackets around unsigned type names. Antonio Sanchez 2022-11-15 09:09:08 -0800
  • 37e40dca85 Fix ambiguity in PPC for vec_splats call. Antonio Sánchez 2022-11-14 18:58:16 +0000
  • 7dc6db75d4 Fix typo in CholmodSupport Antonio Sánchez 2022-11-08 23:49:56 +0000
  • 9b6d624eab fix neon Charles Schlosser 2022-11-08 20:03:01 +0000
  • 7e398e9436 Add missing return keyword in psignbit for NEON. Rasmus Munk Larsen 2022-11-04 16:13:09 +0000
  • 82b152dbe7 Add signbit function Charles Schlosser 2022-11-04 00:31:20 +0000
  • 8f8e36458f Remove recently added sparse assert in SparseMapBase. Antonio Sánchez 2022-11-03 17:29:05 +0000
  • 01a31b81b2 Remove unused parameter name. Antonio Sanchez 2022-11-01 15:51:25 -0700
  • c5b896c5a3 Allow empty matrices to be resized. Antonio Sánchez 2022-10-27 20:33:35 +0000
  • 886aad1361 Disable patan for double on PPC. Antonio Sánchez 2022-10-27 17:56:08 +0000
  • ab407b2b6e Fix handmade_aligned_malloc offset computation. Antonio Sánchez 2022-10-27 17:33:47 +0000
  • adb30efb25 Add assert for invalid outerIndexPtr array in SparseMapBase. Antonio Sánchez 2022-10-26 22:51:33 +0000
  • c27d1abe46 Fix pragma check for disabling fastmath. Antonio Sánchez 2022-10-26 22:50:57 +0000
  • a226371371 Change handmade_aligned_malloc/realloc/free to store a 1 byte offset instead of absolute address Charles Schlosser 2022-10-22 22:51:31 +0000
  • bf48d46338 Explicitly state that indices must be sorted. Antonio Sánchez 2022-10-19 18:15:29 +0000
  • 3bb6a48d8c Fix bug atan2 Rasmus Munk Larsen 2022-10-12 23:49:32 +0000
  • 14c847dc0e Refactor special values test for pow, and add a similar test for atan2 Rasmus Munk Larsen 2022-10-12 20:12:08 +0000
  • 462758e8a3 Don't use generic sign function for sign(complex) unless it is vectorizable Rasmus Munk Larsen 2022-10-12 16:03:29 +0000
  • c0d6a72611 Use pnegate(pzero(x)) as a generic way to generate -0.0. Some compiler do not handle the literal -0.0 properly in fastmath mode. Rasmus Munk Larsen 2022-10-12 01:57:05 +0000
  • 7846c7387c Eigen/Sparse: fix warnings -Wunused-but-set-variable Laurent Rineau 2022-10-11 17:37:04 +0000
  • 3167544873 Handle NaN inputs to atan2. Rasmus Munk Larsen 2022-10-10 19:36:36 -0700
  • 72db3f0fa5 Remove references to M_PI_2 and M_PI_4. Rasmus Munk Larsen 2022-10-11 00:27:16 +0000
  • d6bc062591 Remove reference to EIGEN_HAS_CXX11_MATH. Rasmus Munk Larsen 2022-10-10 23:38:28 +0000
  • 5ceed0d57f Guard GCC-specific pragmas with "#ifdef EIGEN_COMP_GNUC" Rasmus Munk Larsen 2022-10-10 20:38:53 +0000
  • 528b68674c [clang-format] Add a few macros to AttributeMacros Alexander Richardson 2022-10-10 16:44:47 +0000
  • e95c4a837f Simpler range reduction strategy for atan<float>(). Rasmus Munk Larsen 2022-10-04 18:11:00 +0000
  • 80efbfdeda Unconditionally enable CXX11 math. Antonio Sánchez 2022-10-04 17:37:47 +0000
  • e5794873cb Replace assert with eigen_assert. Antonio Sánchez 2022-10-04 17:11:23 +0000
  • 7d6a9925cc Fix 4x4 inverse when compiling with -Ofast. Antonio Sánchez 2022-10-04 16:05:49 +0000
  • 1414a76fa9 Only vectorize atan<double> for Altivec if VSX is available. Rasmus Munk Larsen 2022-10-03 22:06:58 +0000
  • c475228b28 Vectorize atan() for double. Rasmus Munk Larsen 2022-10-01 01:49:30 +0000
  • 1e1848fdb1 Add a vectorized implementation of atan2 to Eigen. Rasmus Munk Larsen 2022-09-28 20:46:49 +0000
  • b3bf8d6a13 Try to reduce size of GEBP kernel for non-ARM targets. Rasmus Munk Larsen 2022-09-28 02:37:18 +0000
  • 13b69fc1b0 Try to reduce compilation time/memory for GEBP kernel using EIGEN_IF_CONSTEXPR Rasmus Munk Larsen 2022-09-23 20:09:42 +0000
  • 3c4637640b Remove unused typedef. Rasmus Munk Larsen 2022-09-23 19:11:31 +0000
  • ed8cda3ce4 Move EIGEN_NEON_GEBP_NR macro to the right place in GeneralBlockPanelKernel.h Rasmus Munk Larsen 2022-09-23 02:24:27 +0000
  • e2ea866515 Add a macro to set the nr trait in the BEBP kernel for NEON. Rasmus Munk Larsen 2022-09-22 23:56:34 +0000
  • 23299632c2 Use 3px8/2px8/1px8/1x8 gebp_kernel on arm64-neon Lianhuang Li 2022-09-21 16:36:40 +0000
  • 7b2901e2aa Add vectorized integer division for int32 with AVX512, AVX or SSE. Rasmus Munk Larsen 2022-09-21 00:27:23 +0000
  • 5ffe7b92e0 [ROCm] fixed gpuGetDevice unused message Chao Chen 2022-09-20 21:38:20 +0000
  • f913a40678 Revert "Add AVX int32_t pdiv" Rasmus Munk Larsen 2022-09-16 22:48:08 +0000
  • 273e0c884e Revert "Add constexpr, test for C++14 constexpr." Rasmus Munk Larsen 2022-09-16 21:14:29 +0000
  • ea84e7ad63 Add AVX int32_t pdiv Charles Schlosser 2022-09-16 17:06:29 +0000
  • dceb779ecd Fix test for pow with mixed integer types. We do not convert the exponent if it is an integer type. Rasmus Munk Larsen 2022-09-12 15:51:27 -0700
  • afc014f1b5 Allow mixed types for pow(), as long as the exponent is exactly representable in the base type. Rasmus Munk Larsen 2022-09-12 21:55:30 +0000
  • b2c82a9347 Remove bad skew_symmetric_matrix3 test. Antonio Sánchez 2022-09-10 07:08:37 +0000
  • e8a2aa24a2 Fix a couple of issues with unary pow(): Rasmus Munk Larsen 2022-09-09 17:21:11 +0000
  • 07d0759951 [ROCm] Fix for sparse matrix related breakage on ROCm. Rohit Santhanam 2022-09-09 14:41:00 +0000
  • fb212c745d Fix g++-6 constexpr and c++20 constexpr build errors. Antonio Sánchez 2022-09-09 03:41:45 +0000
  • ec9c7163a3 Feature/skew symmetric matrix3 Thomas Gloor 2022-09-08 20:44:40 +0000
  • 311ba66f7c Fix realloc for non-trivial types. Antonio Sánchez 2022-09-08 19:39:36 +0000
  • 3c37dd2a1d Tweak bound for pow to account for floating-point types. Antonio Sánchez 2022-09-08 17:40:45 +0000
  • f9dfda28ab Add missing comparison operators for GPU packets. Rasmus Munk Larsen 2022-09-07 14:10:02 -0700
  • 242325eca7 Remove unused variable. Rasmus Munk Larsen 2022-09-07 20:46:44 +0000
  • 133498c329 Add constexpr, test for C++14 constexpr. Tobias Schlüter 2022-03-05 19:02:12 +0900
  • 69f50e3a67 Adjust overflow threshold bound for pow tests. Antonio Sánchez 2022-09-06 19:53:29 +0000
  • 3e44f960ed Reduce compiler warnings for tests. Antonio Sanchez 2022-09-06 11:19:38 -0700
  • b7e21d4e38 Call check_that_malloc_is_allowed() in aligned_realloc() Florian Richer 2022-09-05 09:11:08 +0200
  • 6e83e906c2 fix typo in doc/TutorialSparse.dox Gilles Aouizerate 2022-09-04 21:20:21 +0000
  • 525f066671 fixed msvc compilation error in GeneralizedEigenSolver.h Michael Palomas 2022-09-04 17:50:43 +0000
  • f241a2c18a Add asserts for index-out-of-bounds in IndexedView. Antonio Sánchez 2022-09-02 17:28:03 +0000
  • f5364331eb Fix some cmake issues. Antonio Sánchez 2022-09-02 16:43:14 +0000
  • d816044b6e Fix mixingtypes tests. Antonio Sánchez 2022-09-02 15:30:13 +0000
  • 94cc83faa1 2 typos fix in the 3rd table. Gilles Aouizerate 2022-08-31 19:54:42 +0000
  • 30c42222a6 Fix some test build errors in new unary pow. Antonio Sánchez 2022-08-30 17:24:14 +0000
  • bd393e15c3 Vectorize acos, asin, and atan for float. Rasmus Munk Larsen 2022-08-29 19:49:33 +0000
  • e5af9f87f2 Vectorize pow for integer base / exponent types Charles Schlosser 2022-08-29 19:23:54 +0000
  • 8acbf5c11c re-enable pow for complex types chuckyschluz 2022-08-26 17:29:02 -0400
  • 7064ed1345 Specialize psign<Packet8i> for AVX2, don't vectorize psign<bool>. Rasmus Munk Larsen 2022-08-26 17:02:37 +0000
  • 98e51c9e24 Avoid undefined behavior in array_cwise test due to signed integer overflow Rasmus Munk Larsen 2022-08-26 16:19:03 +0000
  • a7c1cac18b Fix GeneralizedEigenSolver::info() and Asserts Arthur 2022-08-25 22:05:04 +0000
  • 714678fc6c Add missing ptr in realloc call. Antonio Sanchez 2022-08-24 22:04:04 -0700
  • b2a13c9dd1 Sparse Core: Replace malloc/free with conditional_aligned Charles Schlosser 2022-08-23 21:44:22 +0000
  • 6aad0f821b Fix psign for unsigned integer types, such as bool. Rasmus Munk Larsen 2022-08-22 20:19:35 +0000
  • 1a09defce7 Protect new pblend implementation with EIGEN_VECTORIZE_AVX2 Rasmus Munk Larsen 2022-08-22 18:28:03 +0000
  • 7c67dc67ae Use proper double word division algorithm for pow<double>. Gives 11-15% speedup. Rasmus Munk Larsen 2022-08-17 18:36:23 +0000
  • 7a3b667c43 Add support for AVX512-FP16 for vectorizing half precision math Matthew Sterrett 2022-08-17 18:15:21 +0000
  • 76a669fb45 add fixed power unary operation Charles Schlosser 2022-08-16 21:32:36 +0000
  • 39fcc89798 Removed unnecessary checks for FP16C Matthew Sterrett 2022-08-16 18:14:41 +0000
  • 2f7cce2dd5 [SYCL] Fix some SYCL tests Romain Biessy 2022-08-16 17:37:54 +0000
  • 27367017bd Disable bad "deprecated warning" edge-case in BDCSVD Arthur 2022-08-11 18:43:31 +0000
  • b8e93bf589 Eliminate bool bitwise warnings. Antonio Sánchez 2022-08-09 22:42:30 +0000
  • 66ea0c09fd Don't double-define Half functions on aarch64 Lexi Bromfield 2022-08-09 20:00:34 +0000
  • 97e0784dc6 Vectorize the sign operator in Eigen. Rasmus Munk Larsen 2022-08-09 19:54:57 +0000
  • be20207d10 Fix vectorized Jacobi Rotation Arthur 2022-08-08 19:29:56 +0000