Adds a fuzz test #2233

jesslatimer · 2023-02-23T00:25:06Z

Summary of changes

Adds a fuzz test that targets the parser for each default plugin using atheris to further ensure rdflib is reliable and secure.
Fuzz tests throw thousands of different scenarios at the code, ensuring it can handle invalid and unusual inputs as well as find and test those hard-to-reach/overlooked edge cases sitting deep within your code.

Checklist

Checked that there aren't other open pull requests for
the same change.
Checked that all tests and type checking passes.
Considered granting push permissions to the PR branch,
so maintainers can fix minor issues and keep your PR up to date.

aucampia · 2023-03-26T09:26:03Z

test/fuzz_rdflib.py

+    from xml.sax import SAXParseException
+
+
+def TestOneInput(data):


I know this is the style that atheris examples use, but I would prefer rather using PEP-8 compatible naming.

Suggested change

def TestOneInput(data):

def test_one_input(data):

Also, what is happening will be clearer to reviewers if there are type hints, I am not sure what data will be here, and not sure why you check the length of data and why it must be less than 20.

Done. Let me know if anything needs more clarification.

aucampia · 2023-03-26T09:45:57Z

Thanks for the Pull request, I'm open to the idea of adding Fuzz testing to RDFLib, but I'm not that familiar with Fuzz testing, so some clarification would be helpful on these points:

How long does the fuzz tester take to run?
Does it find anything interesting, and if it does, how will it report it?
Could we integrate it with pytest?

If we do add fuzz testing, it should ideally be integrated with our CI and pipeline, so it keeps working. To do that you would have to do the following:

Add atheris as a dependency in pyproject.toml
Add a task to run atheris to Taskfile.yml

To then run as part of the CI pipeline, the simplest would possibly be to add it as an extra-task similar to this

rdflib/.github/workflows/validate.yaml

Lines 126 to 129 in 4da67f9

    
           matrix: 
        
             include: 
        
               - task: "gha:flake8" 
        
                 python-version: 3.8

This should have a corresponding gha:atheris entry in the taskfile similar to this:

rdflib/Taskfile.yml

Lines 260 to 266 in 4da67f9

    
           gha:flake8: 
        
             desc: GitHub Actions flake8 workflow 
        
             cmds: 
        
               - task: poetry:configure 
        
                 vars: 
        
                   CLI_ARGS: --no-root --only=flake8 
        
               - task: flake8

aucampia · 2023-03-26T09:46:29Z

@RDFLib/core-reviewers if anyone has some input on this, it will be appreciated.

aucampia · 2023-03-26T09:48:02Z

test/fuzz_rdflib.py

+    except (SAXParseException, IndexError, Exception):
+        pass


Having Exception here may be over broad, as that includes the other two errors and potentially errors that should not occur.

I think even IndexError may constitute an error, we should possibly consider what Exceptions make sense, and possibly clean up the exceptions raised by RDFLib to be more sane.

Agreed, I have changed the fuzz test to only except issues with inappropriate inputs (as far as I can tell - let me know if you think they could also be bugs)

jesslatimer · 2023-04-01T20:09:07Z

How long does the fuzz tester take to run?

Fuzz tests run indefinitely or until a bug is found. You can also set a time-out. If you integrate with OSS-Fuzz it will run once a day for a few hours across multiple computers.

Does it find anything interesting?

I am making a few changes to this fuzz test and did see that the IndexError is likely a bug. Would you like me to report any bugs I find? I believe all the errors I'm catching now are the result of inappropriate data being parsed, but please let me know if you would like me to remove any and report the error.

How will it report it?

Locally, when it crashes it will output the stacktrace and a crash file with the data that caused it. Through OSS-Fuzz, it will give you a web interface to view statistics like coverage, crashes, speed and memory usage, which only developers can see. Crash reports are also provided with a stacktrace and a crash file with the data that caused it.

Could we integrate it with pytest?

Theoretically yes, but it does not necessarily make sense to. Pytest runs unit tests serially with a defined start and end time, however, fuzz tests run for an arbitrary amount of time (or until they find a bug), so if the first fuzz test does not find a bug or timeout, the following tests will never run.

If we do add fuzz testing, it should ideally be integrated with our CI and pipeline, so it keeps working.

An easy way to integrate into your CI and pipeline would be through OSS-Fuzz github action. This link explains it best, but briefly, it uses CIFuzz which runs for a configurable amount of time (default is 10min) on each pull request.

ghost · 2023-04-02T14:15:12Z

@RDFLib/core-reviewers if anyone has some input on this, it will be appreciated.

I'm quite well-disposed towards this PR (thank you Jess for the initiative). There are a few RDFLIb functions/methods (both internal and API-exposed) that are likely to be even better-suited to fuzz testing than the RDF parsers. I found the fuzzing test setup for the rfc3986 Python package to be informative is this aspect ... e.g.

def fuzz_iri(data):
  fdp = atheris.FuzzedDataProvider(data)
  iri_ref = rfc3986.IRIReference.from_string(
      fdp.ConsumeUnicodeNoSurrogates(256))

jesslatimer · 2023-04-14T23:48:10Z

I'm quite well-disposed towards this PR (thank you Jess for the initiative). There are a few RDFLIb functions/methods (both internal and API-exposed) that are likely to be even better-suited to fuzz testing than the RDF parsers. I found the fuzzing test setup for the rfc3986 Python package to be informative is this aspect ...

@gjhiggins I am happy to add to this fuzzer and update it here.
Are there any particular functions/methods you would like me to focus on?

jesslatimer · 2023-04-18T18:02:22Z

pre-commit.ci autofix

for more information, see https://pre-commit.ci

jesslatimer · 2023-04-18T18:52:02Z

A nice and easy way to find the functions that will result in the most amount of code covered by the fuzzer is to integrate with OSS-Fuzz and utilise introspector. Introspector tells you the optimal functions to target that result in the maximum amount of code coverage by the fuzzer. Therefore, the most efficient way to develop a fuzzer is to integrate a small fuzzer and go from there.

If this interests you, I have submitted a PR with OSS-Fuzz to get the ball rolling but I still need a developer's primary contact email and a comment on said PR giving their approval.

aucampia · 2023-05-19T12:03:50Z

I think it is quite essential to integrate this with the CI and some Taskfile targets; otherwise we really don't get the benefit from this, as people will forget to run it, and it will also likely stop working otherwise as various things change while the fuzz testing execution does not get updated.

aucampia · 2023-05-21T18:40:31Z

PRs to V6 is closed until further notice. See this for more details:

RDFLib V6 and V7 #2395

aucampia · 2023-06-09T16:27:35Z

PRs to V6 is closed until further notice. See this for more details:

RDFLib V6 and V7 #2395

That issue is now settled and PRs are open again, but this is still the case.

I think it is quite essential to integrate this with the CI and some Taskfile targets; otherwise we really don't get the benefit from this, as people will forget to run it, and it will also likely stop working otherwise as various things change while the fuzz testing execution does not get updated.

jesslatimer mentioned this pull request Feb 23, 2023

Integrating with Google/OSS-fuzz #2234

Open

aucampia reviewed Mar 26, 2023

View reviewed changes

jesslatimer force-pushed the main branch 2 times, most recently from 4e156ab to faca78f Compare April 1, 2023 20:24

jesslatimer and others added 2 commits April 18, 2023 11:08

Adds a fuzz test

6480c2c

[pre-commit.ci] auto fixes from pre-commit.com hooks

2976b77

for more information, see https://pre-commit.ci

jesslatimer force-pushed the main branch from 1115013 to 2976b77 Compare April 18, 2023 18:08

jesslatimer marked this pull request as ready for review April 18, 2023 18:17

aucampia added the on hold Progress on this issue is blocked by something. label May 21, 2023

aucampia added needs more work The PR needs more work before being merged or further reviewed. and removed on hold Progress on this issue is blocked by something. labels Jun 2, 2023

Merge branch 'main' into main

d3832c8

		from xml.sax import SAXParseException


		def TestOneInput(data):

Adds a fuzz test #2233

Are you sure you want to change the base?

Adds a fuzz test #2233

Conversation

jesslatimer commented Feb 23, 2023

Summary of changes

Checklist

Uh oh!

aucampia Mar 26, 2023

Choose a reason for hiding this comment

Uh oh!

aucampia Mar 26, 2023

Choose a reason for hiding this comment

Uh oh!

jesslatimer Apr 1, 2023

Choose a reason for hiding this comment

Uh oh!

aucampia commented Mar 26, 2023

Uh oh!

aucampia commented Mar 26, 2023

Uh oh!

aucampia Mar 26, 2023

Choose a reason for hiding this comment

Uh oh!

jesslatimer Apr 1, 2023

Choose a reason for hiding this comment

Uh oh!

jesslatimer commented Apr 1, 2023

Uh oh!

ghost commented Apr 2, 2023

Uh oh!

jesslatimer commented Apr 14, 2023

Uh oh!

jesslatimer commented Apr 18, 2023

Uh oh!

jesslatimer commented Apr 18, 2023

Uh oh!

aucampia commented May 19, 2023

Uh oh!

aucampia commented May 21, 2023

Uh oh!

aucampia commented Jun 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants