Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

C++: Fix more FPs in cpp/overflow-buffer #18629

Merged
merged 7 commits into from
Jan 30, 2025

Conversation

MathiasVP
Copy link
Contributor

@MathiasVP MathiasVP commented Jan 29, 2025

This PR is a follow-up to #18615. This PR fixes an annoying case that happens when people write code such as:

struct S {
  struct {
    uint32_t x1;
  };
  struct {
    uint32_t x2;
  };

  void test() {
    memset(&x1, 0, sizeof(S) - offsetof(S, x1));
  }
};

The logic in #18615 wasn't enough to capture this since that PR computed the size of the buffer x1 as x1.getDeclaringType().getSize() - x1.getByteOffset() and since x1.getDeclaringType() is the anonymous struct (of size 4 bytes) and the offset is 0 we conclude that the buffer is size 4, but in reality it should be of size 8.

This PR makes use of a helper predicate initially used only in https://github.com/github/codeql/blob/main/cpp/ql/src/Security/CWE/CWE-843/TypeConfusion.ql by moving the implementation to a public member predicate on Field and generalizing the above "buffer size" calculation so that it works on nested unions/structs.

This fixes a large class of FPs internally at Microsoft 🎉

I've not done a change note since this change is covered by the change note in #18615

@github-actions github-actions bot added the C++ label Jan 29, 2025
@MathiasVP MathiasVP marked this pull request as ready for review January 30, 2025 12:06
@Copilot Copilot bot review requested due to automatic review settings January 30, 2025 12:06
@MathiasVP MathiasVP requested a review from a team as a code owner January 30, 2025 12:06
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot reviewed 6 out of 6 changed files in this pull request and generated no comments.

Tip: Copilot code review supports C#, Go, Java, JavaScript, Markdown, Python, Ruby and TypeScript, with more languages coming soon. Learn more

@MathiasVP MathiasVP added the no-change-note-required This PR does not need a change note label Jan 30, 2025
Copy link
Contributor

@jketema jketema left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some initial comments.

Comment on lines +85 to +88
// Given an object `S2 s2` the size of the buffer `&s2.s.y`
// is the size of the base object type (i.e., `S2`) minutes the offset
// of `y` relative to the type `S2` (i.e., `4`). So the size of the
// buffer is `12 - 4 = 8`.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So "size" here includes everything between the start of s2.s.y and the end of the outermost struct? Is that correct? If so why is that a sensible notion of "size"?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Correct. This is sensible when you consider how S2 is laid out in memory:

x y z

So we compute the buffer starting at y by computing the size of the outermost struct (i.e., S2 in this case) which includes x, y, and z. We subtract how many bytes we've already "passed" in order to get to y. So the total size is the size of y + the size of z.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right, so you're basically interested in not writing beyond the end of the struct?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep!

* `f.getOffsetInClass(S2) = 0` holds. Likewise, if `f` represents the
* field `a`, then `f.getOffsetInClass(c) = 0` holds.
*/
int getOffsetInClass(Class c) { hasAFieldWithOffset(c, this, result) }
Copy link
Contributor

@jketema jketema Jan 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we do need a change note for the predicate that's being added here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh, that's a good point. I'll add that

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed in 02cf458

@MathiasVP MathiasVP removed the no-change-note-required This PR does not need a change note label Jan 30, 2025
Copy link
Contributor

@jketema jketema left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@MathiasVP MathiasVP merged commit f35fea3 into github:main Jan 30, 2025
11 checks passed
jketema added a commit that referenced this pull request Feb 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants