Why use floor divide in shape_div? #1770

XieXiating · 2024-09-02T09:42:12Z

XieXiating
Sep 2, 2024

Q: I am using the composition() function and encountered an unexpected result. My input was:

lhs = (_256,(_32,_4)):(_32,(_1,_8192))
rhs = (200,13):(_1@0,_8@1)

However, the result was:

(200,(4,3)):(_32,(_8,_8192))

This is not what I expected. I was expecting:

(200,(4,4)):(_32,(_8,_8192))

I believe the issue arises because the domain<1> of rhs is 13 instead of 12.

And, I found shape_div() which is used in composition_impl() use floor divided rather than ceil divided.

Can you help clarify this behavior? Thanks

Answered by ccecka

Sep 3, 2024

After a cup of coffee on an actual workday, I realize that this particular composition case should fail and the current behavior is correct. One post-condition of composition is that the result is compatible with the rhs input, so composition can never perform any rounding at all. Because there is no possible output that satisfies all of the post-conditions of composition, it should fail on these inputs (perhaps with better runtime assertions, of course).

There is a related set of known artificial limitations around composition and logical_divide that can be loosened, but this problem is not an example of them.

View full answer

ccecka · 2024-09-02T23:32:42Z

ccecka
Sep 2, 2024

Good catch. At the moment, this is considered a violation of the "divisibility condition" mentioned in the documentation. The static version fails

// Fails with shape_div static assertion, indicating a violation of the divisibility condition
composition(make_layout(make_shape(4_s, 4_s), LayoutRight{}),   // (_4,_4):(_4,_1)
            make_layout(13_s));                                 // _13:_1

That said, I agree that the divisibility condition is actually too tight in cases like these and the above SHOULD work -- this is a known class of bugs. Fortunately, we have not found any applications that need this generalization, but, unfortunately, supporting it is a bit more complex than simply rounding up. In the near future, I plan to release a much more formal treatment of CuTe in a whitepaper along with some non-critical code updates and generalizations like this one.

3 replies

XieXiating Sep 3, 2024
Author

Thanks!

ccecka Sep 3, 2024

After a cup of coffee on an actual workday, I realize that this particular composition case should fail and the current behavior is correct. One post-condition of composition is that the result is compatible with the rhs input, so composition can never perform any rounding at all. Because there is no possible output that satisfies all of the post-conditions of composition, it should fail on these inputs (perhaps with better runtime assertions, of course).

There is a related set of known artificial limitations around composition and logical_divide that can be loosened, but this problem is not an example of them.

Answer selected by XieXiating

XieXiating Sep 6, 2024
Author

Got it. Thanks for the clarification!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why use floor divide in shape_div? #1770

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Why use floor divide in shape_div? #1770

Uh oh!

XieXiating Sep 2, 2024

Replies: 1 comment · 3 replies

Uh oh!

ccecka Sep 2, 2024

Uh oh!

XieXiating Sep 3, 2024 Author

Uh oh!

ccecka Sep 3, 2024

Uh oh!

XieXiating Sep 6, 2024 Author

XieXiating
Sep 2, 2024

Replies: 1 comment 3 replies

ccecka
Sep 2, 2024

XieXiating Sep 3, 2024
Author

XieXiating Sep 6, 2024
Author