perf: faster handling of static paths #2148

skirtles-code · 2024-02-20T20:44:48Z

Background

Original issue: #2132. In short, the performance of adding a large number of routes is slow, and is particularly problematic with SSR.

This PR attempts some performance improvements. While the changes are currently unpolished and experimental, I believe these changes are 'realistic', in the sense that they could be tidied up and included without making breaking changes.

These changes are separate from the caching and binary search ideas that have already been suggested. Each strategy has its pros and cons and they each target slightly different use cases.

My priority was just runtime performance, not clean code, maintainability, or bundle size. Balancing those other concerns can come later. I'm currently looking to share and discuss the main ideas, rather than worrying about the minutiae of the implementation.

Overview of the changes

I started coming at this from a slightly different angle, investigating the performance of route resolution rather than addition. I figured that iterating through the array of routes and testing each one against the path might be slow when the number of routes is large. On pages with a lot of links this could happen many times.

After some profiling, I had two main ideas for how to speed this up:

Testing using a RegExp might not be the most efficient approach, especially for routes that are effectively static (i.e. no params).
Route paths often begin with a static section. e.g. The Nuxt i18n module prefixes route paths with the locale, e.g. /en/foo. If we're trying to match /de/bar then we can ignore all the paths beginning with /en without needing to check each one individually.

Roughly speaking, these are the two changes that are implemented here. There is some overlap between them, as they both require some initial analysis of the route path to establish which portions are static. Thankfully, most of that work is already done by tokenizePath(), and that step is already very fast.

While these changes were motivated by investigating route resolution, they also significantly improve the performance of route addition.

The changes in detail

The matcher tree

The idea here is inspired by Eduardo's suggestion here to use a trie. Whether this is actually what he had in mind I'm not sure, but I think it's a similar concept.

Paths often start with a static section. e.g. in /en/products/:id, the /en/products portion is static. This allows us to create a tree structure to store the route, with a node for en and a child node for products.

When we want to match a path, say /en/products/1234, we can walk that tree to find candidate routes. There may still be multiple routes that need to be checked, but we can immediately skip routes in different parts of the tree, such as those that begin with /de, or those beginning /en/users.

This also speeds up the route creation process. Routes no longer need inserting into a massive array. Instead, each node of the tree holds a much smaller array of routes, so the insertion point can be found much quicker.

There are various complications, such as case sensitivity. I'm currently assuming that calling toUpperCase() is sufficient to achieve matching equivalent to the i flag of a RegExp. This is based on something I read here, but it would need further verification to confirm it gives consistent casing.

While this approach does seem to be really fast, it does lead to a lot of extra garbage collection. This can lead to spikes in the performance when the garbage collector kicks in. This is understandable, as the data structures involved are much more complicated. It needs further investigation to understand whether this can be avoided or if it's really a problem worth worrying about.

Further work may be needed on the getRoutes() method. This can no longer simply return the matchers array, as it doesn't exist. I currently have it building a new array each time it's called. It may need some form of caching.

Exact path matching

For routes that are entirely static, i.e. that have no params, it's possible to bypass the tree mentioned above and just store them directly in an object as a quick lookup. Case sensitivity still needs taking into account, but it's still really fast for both adding routes and resolving routes. Keeping these static routes out of the tree can greatly reduce the size of the tree.

Parsing static paths

tokenizePath() parses the route path into tokens. It's already really fast, so I've left that alone.

Those tokens are then passed to tokensToParser(), which does a lot of stuff. For entirely static paths, without any params, I believe this is doing a lot of unnecessary work.

I've introduced staticPathToParser(), which creates a simpler implementation of PathParser. This is only used in cases where the path is entirely static. It skips most of the work. For example, rather than using a RegExp to check the path it uses a === check instead (also taking account of the sensitive and strict options as required). This not only makes the addition of routes faster, it also makes the parser itself more efficient.

Further work

All of these changes assume a static prefix to the path. If an application has a dynamic section at the start of all its paths then these changes won't help at all. There needs to be a reasonable spread within the tree for there to be any benefit. In those problematic cases, the binary search proposal would still be beneficial, as would the caching proposal.

This is nowhere near an exhaustive list of things to do, but:

Investigate the GC problems.
Write a lot more tests to confirm this actually handles all the edge cases correctly.
Establish more concretely how much each of these changes helps performance. I was checking performance all the way through writing them, so I'm confident they do help, but I don't have concrete numbers to share.
Evaluate other trade-offs, such as maintainability and bundle size. The code will need polishing up, including handling all edge cases, before this can be properly evaluated.

There are several places, e.g. pathParserRanker.ts, where I just hacked in whatever changes I needed, without much regard for anything else. I'll worry about that once the bigger picture becomes a bit clearer.

Update 1: I've now written a lot more tests and there were problems in some edge cases. Most of these problems were with end: false, but the handling of the sensitive option wasn't quite correct either. I've just pushed fixes for most of those problems. The remaining (known) problems are just with end: false, and only occur in cases where it has the same score as another route. As noted in #2153, I think that would best be addressed by including end in the score calculation.

Update 2: I've merged the latest main, which bring this branch into line with Vue Router 4.4.3. This includes integrating the binary search changes from #2137. Those changes are beneficial when the matcher tree isn't balanced, e.g. when most routes have the same (or no) static prefix. In that scenario the tree essentially acts like a single array, like the matchers array on main.

netlify · 2024-02-20T20:45:04Z

✅ Deploy Preview for vue-router canceled.

Name	Link
🔨 Latest commit	`454ecf8`
🔍 Latest deploy log	https://app.netlify.com/sites/vue-router/deploys/66cd16a46b0f3000087ae4c1

negezor · 2024-08-04T18:27:15Z

Hello. A lot of work has been done here, I appreciate it, and I am very interested in merging this PR. What issues still need to be resolved for it to be accepted?

I have tested this PR in my production project, which has about 50 routes, including 8 aliases/redirects. I conducted profiling on Slowdown 6x to simulate low-end devices, as otherwise, the AMD Ryzen 7950x3D handles it too well 😅

Current:

This PR:

This PR with this change skirtles-code#1:

This PR has almost completely eliminated the overhead on resolve. In my project, there are a lot of links, and their resolve is very noticeable in the profiler. An initial router.push() taking 20ms is not what we want to see in our applications 😅

skirtles-code · 2024-08-05T13:45:09Z

@negezor Nice to hear that this helps in a real project.

There are lots of problems with the code in this PR, several of which I've noted in the PR description. Not least, it significantly increases the bundle size. We'll likely need to have the pluggable matcher (#2166) implemented first, as that'll allow these changes to be implemented in a separate, treeshakable matcher. Last I heard, Eduardo was still interested in pursuing the pluggable matcher idea, #2132 (comment).

@posva, if there's anything I can do to move that forwards then please let me know.

posva · 2024-08-06T06:34:50Z

@skirtles-code Thanks for the work on this PR, it's great. The change in size is indeed a bit of a deal breaker right now because I started working on an internal refactoring of the matchers (as you noted) that would also allow to have a better performance for statics paths but also introduce other new features like:

Compile time route parsing
Query Params
Custom Param serialization/parsing

I had to put that on pause to advance on other topics like data loaders and pinia colada but I plan on coming back to these after that. Hopefully by September or October.

I don't think you need to do anything right now. The next step for the new-matcher branch was refactoring the existing code to use the new interfaces but that might be too much to ask in its current state. I think that once it's implemented, I will need help to test it more, document it, and maybe even release new matchers!

szwenni · 2024-08-10T10:28:43Z

I actually (even if not recommended) use this branch in production right now
I did not measure it exactly but in combination with nuxt and i18n the whole application is now really really speed up and I did not face any issue till now.
We have around 1500 different routes multiplied by 40 languages. The current matching creation and resolving behavior is not really working with this amount of routes. Especially when the application is under pressure (e.g. google is traversing the site) the load and memory usage is piling up and node has no time anymore to cleanup (this is for the Serverside of nuxt).

# Conflicts: # packages/router/src/matcher/index.ts

codecov · 2024-08-27T00:00:42Z

Codecov Report

Attention: Patch coverage is 89.77556% with 41 lines in your changes missing coverage. Please review.

Project coverage is 96.88%. Comparing base (2f48746) to head (454ecf8).

Files	Patch %	Lines
packages/router/src/matcher/matcherTree.ts	89.36%	30 Missing ⚠️
packages/router/src/matcher/staticPathParser.ts	87.32%	9 Missing ⚠️
packages/router/src/matcher/pathMatcher.ts	93.75%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2148      +/-   ##
==========================================
- Coverage   97.32%   96.88%   -0.45%     
==========================================
  Files          26       28       +2     
  Lines        6013     6317     +304     
  Branches      819      881      +62     
==========================================
+ Hits         5852     6120     +268     
- Misses        159      195      +36     
  Partials        2        2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

skirtles-code · 2024-08-27T00:26:07Z

I've just updated this branch to the latest main (4.4.3). That includes integrating the binary search changes from #2137. Those likely won't make much difference to projects with large numbers of static routes, but it should help in other edge cases where the matcher tree isn't effective.

skirtles-code · 2024-09-04T14:48:33Z

I've released a fork of Vue Router that includes these performance improvements:

https://www.npmjs.com/package/@skirtle/vue-router-perf

I'm aware that most people are unwilling to use an unofficial fork in their applications, but the option is now there for those who need it.

It also makes it easier to run performance comparisons.

perf: faster handling of static paths

d9607ab

skirtles-code force-pushed the static-path-perf branch from 510f66f to d9607ab Compare February 20, 2024 20:48

This was referenced Feb 20, 2024

Router creation performance degradation when using many routes in comparison to v3 #2132

Closed

perf: use a binary search for insertMatcher #2137

Merged

skirtles-code added 2 commits March 4, 2024 10:49

Fix problems combining end: false and strict: false

07e86a9

Fix problems with resolving paths for sensitive and end options

987ea09

skirtles-code mentioned this pull request Mar 7, 2024

feat: pluggable matcher #2166

Closed

szwenni mentioned this pull request Aug 10, 2024

Memory Leak when using top-level await before defining computed translations nuxt-modules/i18n#2629

Closed

skirtles-code added 2 commits August 26, 2024 21:12

Merge branch 'main' into static-path-perf

aca96a5

# Conflicts: # packages/router/src/matcher/index.ts

Fix problems caused by merging 'main'

454ecf8

skirtles-code mentioned this pull request Sep 4, 2024

perf: faster handling of static paths skirtles-code/vue-router#2

Merged

skirtles-code mentioned this pull request Sep 5, 2024

Performance degradation when using many routes in comparison to Nuxt 2 nuxt/nuxt#25612

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

perf: faster handling of static paths #2148

perf: faster handling of static paths #2148

Uh oh!

skirtles-code commented Feb 20, 2024 •

edited

Loading

Uh oh!

netlify bot commented Feb 20, 2024 •

edited

Loading

Uh oh!

negezor commented Aug 4, 2024

Uh oh!

skirtles-code commented Aug 5, 2024

Uh oh!

posva commented Aug 6, 2024

Uh oh!

szwenni commented Aug 10, 2024

Uh oh!

codecov bot commented Aug 27, 2024

Uh oh!

skirtles-code commented Aug 27, 2024

Uh oh!

skirtles-code commented Sep 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

perf: faster handling of static paths #2148

Are you sure you want to change the base?

perf: faster handling of static paths #2148

Uh oh!

Conversation

skirtles-code commented Feb 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Overview of the changes

The changes in detail

The matcher tree

Exact path matching

Parsing static paths

Further work

Uh oh!

netlify bot commented Feb 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vue-router canceled.

Uh oh!

negezor commented Aug 4, 2024

Uh oh!

skirtles-code commented Aug 5, 2024

Uh oh!

posva commented Aug 6, 2024

Uh oh!

szwenni commented Aug 10, 2024

Uh oh!

codecov bot commented Aug 27, 2024

Codecov Report

Uh oh!

skirtles-code commented Aug 27, 2024

Uh oh!

skirtles-code commented Sep 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

skirtles-code commented Feb 20, 2024 •

edited

Loading

netlify bot commented Feb 20, 2024 •

edited

Loading