Implementation of BMat16 #38

Victorin-Brunel · 2024-07-10T09:25:25Z

Here are the additions regarding BMat16 :

Creation of BMat16
Various constructors (with a vector register of 256 bits, 4 64 bits uint, or a two-dimensional vector)
Comparison operators (==, !=, <, >)
Acces operator to a particular bit at a position (i, j)
Method to set a bit at a position (i, j)
A convertor to a two-dimensional array
Bitwise or operator between two matrices
Naive transposition
Optimized transposition with vector instructions
Optimized matrix multplication
Optimized matrix multplication using BMat8 matrix multiplication
Two naive multiplications (one with the acces operator, the other with the array conversion)
Number of non-zero rows
Vector of rows
Identity matrix of size 0 to 16
Random matrix, and the possibility to specify a size from 1 to 16
Swap of two matrices
Display operator

james-d-mitchell · 2024-08-26T14:54:05Z

Closing and reopening to trigger the ci again

james-d-mitchell

I've tried BMat16's with libsemigroups (basically dropping it in everywhere we use HPCombi::BMat8 in the tests) and found no issues whatsoever. So I'm happy to merge this if you are @hivert ?

hivert · 2025-01-21T21:59:33Z

I've tried BMat16's with libsemigroups (basically dropping it in everywhere we use HPCombi::BMat8 in the tests) and found no issues whatsoever. So I'm happy to merge this if you are @hivert ?

I'd like to look a little at the generated assembly code to check that some think are properly optimized by the (a) compiler (notably some product by power of 2 which could be written as shift). I try to do it either during the week-end or next week. Other than that, I'm Ok to merge.

hivert · 2025-03-18T21:09:04Z

The code looks good. I checked:

product in the BMat16::BMat16, BMat16::operator(), BMat16::set and BMat16::to_array are properly optimized with shift.
the loop in BMat16::mult_transpose is properly unrolled.

This should be fixed easily

The doc of operator< doesn't specify the order;
the is a bitwise | on matrices but not & of bitwise complement.

There are finally some possible improvements which needs to be fixed:

This should be fixed as soon as simde provides the implementation

inline size_t BMat16::nr_rows() const noexcept {
  [...] 
  //// Vectorized version which doesn't work due to the absence of popcnt in
   /// simde
   // xpu16 tmp = _data, zero = simde_mm256_setzero_si256();
   // xpu16 x = (tmp != zero);
   // return simde_mm256_popcnt_epi16(x);
}

This one is low priority (in random which is now that used and probably a little time consuming):

    // TO DO : Instead of nulling all the cols/rows one by one, one could do
    // that at once with the proper mask

james-d-mitchell · 2025-03-19T09:30:41Z

Great! Thanks @hivert, should we then: merge this PR, and add issues for the TODOs that you mention, or would you rather that the TODOs are addressed in this PR first?

james-d-mitchell · 2025-03-19T09:36:28Z

I just rebased onto the most recent versions on main

Victorin-Brunel mentioned this pull request Jul 10, 2024

Implement larger boolean matrices BMat16 #8

Open

james-d-mitchell closed this Aug 26, 2024

james-d-mitchell reopened this Aug 26, 2024

james-d-mitchell force-pushed the main branch from 2ecaad4 to 1591168 Compare August 26, 2024 14:57

james-d-mitchell closed this Aug 26, 2024

james-d-mitchell reopened this Aug 26, 2024

james-d-mitchell mentioned this pull request Oct 10, 2024

Implementation of BMat16 #43

Closed

james-d-mitchell force-pushed the main branch from 1591168 to 76df096 Compare January 14, 2025 15:12

james-d-mitchell approved these changes Jan 14, 2025

View reviewed changes

james-d-mitchell force-pushed the main branch from fbd8def to 84cef85 Compare January 31, 2025 09:52

james-d-mitchell and others added 16 commits March 19, 2025 09:34

Test circleci

7ad21ce

20/06

377542b

21/06

7273cdb

24/06

6f17231

25/06

15152e8

26/06

df1ae78

27/06

b68c82d

08/07

4bff5b6

09/07

b36d94e

10/07

8923c21

11/07

45c2c56

12/07

4be6cda

15/07

78f850c

16/07

2dcd31a

17/07

0d0f8b9

Try fix compile with gcc-9

7b37f71

james-d-mitchell force-pushed the main branch from 84cef85 to 7b37f71 Compare March 19, 2025 09:35

james-d-mitchell mentioned this pull request Mar 20, 2025

Implement operator& for BMat16 #56

Open

james-d-mitchell merged commit 4aa10ca into libsemigroups:main Mar 20, 2025
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implementation of BMat16 #38

Implementation of BMat16 #38

Uh oh!

Victorin-Brunel commented Jul 10, 2024 •

edited

Loading

Uh oh!

james-d-mitchell commented Aug 26, 2024

Uh oh!

james-d-mitchell left a comment

Uh oh!

hivert commented Jan 21, 2025

Uh oh!

hivert commented Mar 18, 2025

Uh oh!

james-d-mitchell commented Mar 19, 2025

Uh oh!

james-d-mitchell commented Mar 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Implementation of BMat16 #38

Implementation of BMat16 #38

Uh oh!

Conversation

Victorin-Brunel commented Jul 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

james-d-mitchell commented Aug 26, 2024

Uh oh!

james-d-mitchell left a comment

Choose a reason for hiding this comment

Uh oh!

hivert commented Jan 21, 2025

Uh oh!

hivert commented Mar 18, 2025

Uh oh!

james-d-mitchell commented Mar 19, 2025

Uh oh!

james-d-mitchell commented Mar 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Victorin-Brunel commented Jul 10, 2024 •

edited

Loading