Improved parser performance #372

jkronegg · 2025-02-11T15:39:56Z

🤔 What's changed?

It started as a temptative to improve StringUtils (#361), but ended up with many other small improvements, all based on JMH micro-benchmark and IntelliJ profiler.

The following improvements have been done (no public API modified):

GherkinLine:
- constructor/getTableCells(): rewrote trim+symbolCount logic to an integrated operation which trim and intent at the same time
- getTags(): replaced split by String traversal and using compiled Regexp
GherkinDocumentBuilder: compiled regexps and simplified mapping between TokenType and RuleType
GherkinDialect: precomputing list size and removed duplicates
EncodingParser: avoid split on the while file to split only the first lines

The parser is now 1.7x faster (=40%) on the very_long.feature test file, as reflected by JMH micro-benchmark:

 MyClassBenchmark.original  avgt   25  5514.929 ± 943.017  us/op
 MyClassBenchmark.modified  avgt   25  3265.629 ± 287.454  us/op

There is the JMH code to reproduce the results:

package io.cucumber.gherkin;

import io.cucumber.messages.types.Envelope;
import org.openjdk.jmh.annotations.Benchmark;
import org.openjdk.jmh.annotations.BenchmarkMode;
import org.openjdk.jmh.annotations.Mode;
import org.openjdk.jmh.annotations.OutputTimeUnit;

import java.io.IOException;
import java.nio.file.Paths;
import java.util.concurrent.TimeUnit;

public class MyClassBenchmark {
    @Benchmark
    @BenchmarkMode(Mode.AverageTime)
    @OutputTimeUnit(TimeUnit.MICROSECONDS)
    public Stream<Envelope> original() throws IOException {
        return GherkinParser.builder().build().parse(Paths.get("../testdata/good/very_long.feature"));
    }
}

On a real project with 1000 scenarios, 50 parameterTypes and 250 step definitions, the IntelliJ profiler gives for GherkinMessagesFeatureParser.parse:

original version (gherkin 31.0.0): 434 ms
modified version (this PR): 209 ms

That's 2.1x faster... 😁

⚡️ What's your motivation?

Fixes #361

🏷️ What kind of change is this?

🏦 Refactoring/debt/DX (improvement to code design, tooling, etc. without changing behaviour)

♻️ Anything particular you want feedback on?

On this PR, we can run the following test:

@Test
void test_for_profiler_parser() throws IOException {
    for (int i=0; i<1000; i++) GherkinParser.builder().build().parse( Paths.get("../testdata/good/very_long.feature"));
}

Below is the Intellij profile flame graph for this test:

The is still some little room for improvement in:

getLocation (8%): avoid using Long values in cucumber-messages when primitive types can be used, see Codegen generates inefficient Java code for Long and Boolean mandatory parameters messages#283
cucumber-messages (1%): use Java 10+ to avoid recreating immutable lists, see Codegen generates inefficient Java code for List parameters messages#282

I'm not counting the UUID.randomUUID() because it can be easily solved by selecting a faster UUID generator (e.g. IncrementingUuidGenerator) by configuring Cucumber properly.

📋 Checklist:

I agree to respect and uphold the Cucumber Community Code of Conduct
I've changed the behaviour of the code
- I have added/updated tests to cover my changes.
My change requires a change to the documentation.
- I have updated the documentation accordingly.
Users should know about my change
- I have added an entry to the "Unreleased" section of the CHANGELOG, linking to this pull request.

luke-hill · 2025-02-12T12:49:05Z

Given this is close to completion @jkronegg i'll wait til you fix up the last bits of the codegeneration task, before cutting and releasing v32

Quick question, is this really a bug fix or more of a generic change? Just noticed you popped a changelog in fixed?

jkronegg · 2025-02-12T12:53:47Z

From my point of view, performance issues are bugs (given the non-functional requirement "the application must go as fast as possible"😁). But someone else could consider this PR as an improvement given no one asked for that NFR😅.
@luke-hill as you prefer.

# Conflicts: # CHANGELOG.md

luke-hill · 2025-02-12T12:55:42Z

As/when it's reviewed by Rien he can best advise as this is a big Java improvement. I'm in no way able to advise on Java stuff

U117293 added 3 commits February 11, 2025 11:13

fix: corrected misc performance issues for #361

9a1d709

fix: improved encoding detection performance for #361

f895ae1

feat: added release info for #361

17c54fc

mpkorstanje self-requested a review February 11, 2025 16:21

U117293 added 4 commits February 12, 2025 08:11

fix: corrected Parser razor generation for #361

2e7829b

fix: corrected Parser razor generation (2) for #361

a842703

fix: corrected Parser razor generation (3) for #361

ffb825c

fix: corrected Parser razor generation (4) for #361

3ff3ae7

Merge branch 'refs/heads/main' into gerkinline_optimization

999af84

# Conflicts: # CHANGELOG.md

U117293 added 2 commits February 27, 2025 10:18

fix: improved performance to get the default dialect for #361

6e0de0d

fix: variable reusing and minor rewrite for #361

17f35df

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved parser performance #372

Improved parser performance #372

jkronegg commented Feb 11, 2025

luke-hill commented Feb 12, 2025 •

edited

Loading

jkronegg commented Feb 12, 2025

luke-hill commented Feb 12, 2025

Improved parser performance #372

Are you sure you want to change the base?

Improved parser performance #372

Conversation

jkronegg commented Feb 11, 2025

🤔 What's changed?

⚡️ What's your motivation?

🏷️ What kind of change is this?

♻️ Anything particular you want feedback on?

📋 Checklist:

luke-hill commented Feb 12, 2025 • edited Loading

jkronegg commented Feb 12, 2025

luke-hill commented Feb 12, 2025

luke-hill commented Feb 12, 2025 •

edited

Loading