Simplify SIMD operations #284

lamphamsy · 2019-06-12T15:34:13Z

NB: the PR is split/extracted from the big one #282

The purpose is to eliminate as much as possible branches, particularly in essential operations s.t. modular operations.

Currently, we support vectorized operations of FNT in the cases:

FNT(257) using uint16_t and uint32_t
FNT(65537) using uint32_t

But the support of FNT(257) using uint32_t raises additional branches of checking the cardinal of the field. And its encoding/decoding speeds are obviously smaller than the two other cases: FNT(257) using uin16_t and FNT(65537) using uint32_t.

Hence, for FNT, we support only FNT(257) using uin16_t and FNT(65537) using uint32_t.

slaperche-scality

Great PR, lot of simplifications.

It's really nice 🙂

src/simd_128.h

test/ec_driver.cpp

src/simd_256.h

src/simd_fnt.h

src/simd_radix2_fft.h

slaperche-scality · 2019-06-13T11:26:17Z

test/quadiron_c_utest.cpp

+            }
+        }
+
+        // FIXME: for non-systematic FNT, `quadiron_fnt32_decode` will


IMHO, it's cleaner to have a copy of _data/call quadiron_fnt32_decode on a copy instead of relying on the order of the test.

yes, it's better. A copy of encoded fragments will be used to avoid such cases.

CMakeLists.txt

scripts/test_ec.sh

Some global variables are no longer used as we support only FNT for `w=T/2`, i.e. - FNT(257) using uint16_t - FNT(65537) using uint32_t To clarify reader, we use const reference for VecType variable if it's necessary.

The only support FNT with `w=T/2`, i.e. - FNT(257) using uint16_t - FNT(65537) using uint32_t gives advantages: - Operations are simplified by avoiding the argument cardinal. - Some branches can be avoided.

In butterfly operations, there are three cases depending on the coefficient `r`: - r = 1 - r = q - 1 - 1 < r < q - 1 We use an enum class to clarify such cases.

according changes in modular arithmetics

lamphamsy · 2019-06-14T10:29:04Z

@slaperche-scality : it's updated. Thanks for your next reviews :)

test/ec_driver.cpp

- Reset metadata of decoded data: for non-systematic code, we use first k parities to store decoded data. These metadata should be reset. - Remove an useless initialisation of data_vec for non-systematic codes.

Note that for non-systematic FNT, in encoding and decoding of QuadIron C API, input data will be overwritten by output data: - `quadiron_fnt32_encode` will store first `k` parities in the input data buffers, and the next `m` parities in the usual parity buffers. - `quadiron_fnt32_decode` will overwrite input data pointers (that stores actually encoded fragments) by decoded data. In the test, coded fragments will be stored to use correct fragments. They will be used to check reconstructed data.

lamphamsy added the WIP Work In Progress label Jun 12, 2019

lamphamsy force-pushed the eh/simd_multi_levels branch 4 times, most recently from fcad024 to fae4de8 Compare June 13, 2019 10:24

lamphamsy requested a review from slaperche-scality June 13, 2019 10:24

lamphamsy removed the WIP Work In Progress label Jun 13, 2019

slaperche-scality suggested changes Jun 13, 2019

View reviewed changes

lamphamsy added 5 commits June 13, 2019 17:45

Vectorized essential operations: FNT supports only w=T/2

7bc8049

Some global variables are no longer used as we support only FNT for `w=T/2`, i.e. - FNT(257) using uint16_t - FNT(65537) using uint32_t To clarify reader, we use const reference for VecType variable if it's necessary.

Vectorized modular arithmetics: FNT supports only w=T/2

5c799eb

The only support FNT with `w=T/2`, i.e. - FNT(257) using uint16_t - FNT(65537) using uint32_t gives advantages: - Operations are simplified by avoiding the argument cardinal. - Some branches can be avoided.

[SIMD] More readable in Butterfly operations

3f0f4c5

In butterfly operations, there are three cases depending on the coefficient `r`: - r = 1 - r = q - 1 - 1 < r < q - 1 We use an enum class to clarify such cases.

Update vectorized nf4 and RingModN operations

f0bcce7

according changes in modular arithmetics

Update FecTest: FecFnt supports only w=T/2

9437297

lamphamsy force-pushed the eh/simd_multi_levels branch 2 times, most recently from 388d6c0 to 09181fb Compare June 14, 2019 03:44

lamphamsy mentioned this pull request Jun 17, 2019

Unit tests for vectorised FNT operations #285

Merged

1 task

slaperche-scality reviewed Jun 17, 2019

View reviewed changes

test/ec_driver.cpp Outdated Show resolved Hide resolved

test/ec_driver.cpp Outdated Show resolved Hide resolved

slaperche-scality reviewed Jun 18, 2019

View reviewed changes

test/ec_driver.cpp Outdated Show resolved Hide resolved

lamphamsy added 6 commits June 18, 2019 17:05

Update ecdriver: FecFnt supports only w=T/2

2792ac0

[CI] Update benchmark script: FNT supports only w=T/2

a63a6bb

Remove long codelength in ec_driver's test

7c26d5c

Add link time optimization flag

00e7637

QuadIron C API: make it safer

321c7c9

- Reset metadata of decoded data: for non-systematic code, we use first k parities to store decoded data. These metadata should be reset. - Remove an useless initialisation of data_vec for non-systematic codes.

lamphamsy force-pushed the eh/simd_multi_levels branch from 856fe81 to 7147e6f Compare June 18, 2019 15:06

slaperche-scality approved these changes Jun 18, 2019

View reviewed changes

lamphamsy merged commit 8c90b65 into master Jun 18, 2019

lamphamsy deleted the eh/simd_multi_levels branch June 28, 2019 13:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify SIMD operations #284

Simplify SIMD operations #284

Uh oh!

lamphamsy commented Jun 12, 2019 •

edited

Loading

Uh oh!

slaperche-scality left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

slaperche-scality Jun 13, 2019

Uh oh!

lamphamsy Jun 14, 2019

Uh oh!

Uh oh!

Uh oh!

lamphamsy commented Jun 14, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Simplify SIMD operations #284

Simplify SIMD operations #284

Uh oh!

Conversation

lamphamsy commented Jun 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slaperche-scality left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

slaperche-scality Jun 13, 2019

Choose a reason for hiding this comment

Uh oh!

lamphamsy Jun 14, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lamphamsy commented Jun 14, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lamphamsy commented Jun 12, 2019 •

edited

Loading