perf: performance overhaul #26114

GalacticHypernova · 2024-03-06T17:58:41Z

NOTE:

This is a continuation and a rework of the previous PR that had some unexplainable issues so I decided to rewrite it. I have opened this to rework it more carefully and take care of all the edge cases and random errors. (For more info, please refer to #25771)

NOTE 2:

The 70 docs commits are due to me forgetting to flush the original patch-9 branch when I was done with that one. I usually start these PR's remotely, which means I don't get to name them something unique, and I just got onto the github app to try and re-run the failed tests, which still had the old patch-9 commits. That's my bad. I used the origin's files in every conflict so it ended up just not changing anything (as can be seen in the files changed).

🔗 Linked issue

Related: #25771

❓ Type of change

📖 Documentation (updates to the documentation, readme or JSdoc annotations)
🐞 Bug fix (a non-breaking change that fixes an issue)
👌 Enhancement (improving an existing functionality like performance)
✨ New feature (a non-breaking change that adds functionality)
🧹 Chore (updates to the build process or auxiliary tools and libraries)
⚠️ Breaking change (fix or feature that would cause existing functionality to change)

📚 Description

This is a PR that aims to improve both build time and runtime performance to the max by refactoring everything that comes to mind with performance best practices, some of which include smarter and fewer object/array iterations, reduced indexes, smarter variable usage for constant values, and more.

Some key improvements in this PR:

Refactors of known costly javascript methods:
In some places throughout the app, there are operations that rely on certain javascript methods that are known to be more expensive at certain conditions than manual implementation
(similar to endsWith in perf(vite): rework endsWith to direct index #24746
and startsWith in perf(kit, schema, nuxt): rework startsWith to direct index #24744).
One key change under this category is the usage of Array.prototype.includes.
Array.prototype.includes appears to be slower when checking for existence of a certain input between at the very least 2 elements than directly checking each element with strict equality (like when dealing with strings).
After thorough benchmarking, both browser and node, direct comparisons showed marginally better performance, benchmarks attached below in that same order with the following benchmark:
```
const a = "#text"
console.time("start")
for(let i = 0; i< 1000000; i++) {
  const inc = ['#comment', '#text'].includes(a)
}
console.timeEnd("start")
console.time("stop")
for(let i = 0; i< 1000000; i++) {
  const inc  = a === "#comment" || a === "#text"
}
console.timeEnd("stop")
```
Browser:

Node:
Reduced and reworked iterations
Originally the main change in this PR, many parts in nuxt involve some sort of array/object manipulations, and in some places they are either repeated, redundant, or simply costly in terms of performance. The main examples of such cases are the small side PR's I have submitted like perf(kit): avoid duplicate join operation #24717, perf(nuxt): avoid duplicate iterations over layers #24730, and numerous others. I couldn't get all of it at the same time so I have decided to group the rest here, and also handle the iteration logic as opposed to mere count.
Reduced indexes:
Some places in nuxt include working with indexes, like the length property of array/strings. While that alone isn't expensive, it still makes that repeated index, which especially with bigger inputs (like in HTML parsing or AST traversal) will gradually cost more and more microseconds, and might eventually become slightly noticeable in page performance.
Early returns
Similar to perf: don't manipulate an empty value #25647, there are some places where some iterations are made on potentially empty values, which increase the overhead and therefore decrease performance. This PR adds guards to ensure there won't be iterations over an empty value.
Avoiding amortized operations where possible
Methods like Array.prototype.push have an amortized complexity (O(1) in the case of push), which for the most part would serve most use cases well. But if we want to really fine-tune performance and take care of that amortization, we can remove them in favor of plain complexity if we know the size of the array. In multiple places around the Nuxt source, there are usages of push where the exact array size is known. This PR uses a hybrid approach that refactors said parts with known sizes by using the new Array(size) constructor with direct index modification, which would result in plain O(1) complexity, potentially outperforming the amortized O(1) complexity of push, while keeping push for dynamic arrays, variable sizes, and other parts where the size cannot be accurately obtained without causing increased time complexity (like Object.keys to obtain the size of an object).

📝 Checklist

I have linked an issue or discussion.
I have added tests (if possible).
I have updated the documentation accordingly.

stackblitz · 2024-03-06T17:58:45Z

Run & review this pull request in StackBlitz Codeflow.

GalacticHypernova

Notes to self:

packages/nuxt/src/app/composables/state.ts

packages/nuxt/src/components/module.ts

packages/nuxt/src/core/nitro.ts

…nto patch-9

danielroe · 2024-06-11T13:35:49Z

packages/schema/src/config/experimental.ts

@@ -89,7 +89,12 @@ export default defineUntypedSchema({
      async $resolve (val, get) {
        // TODO: remove in v3.10
        val = val ?? await (get('experimental') as Promise<Record<string, any>>).then((e: Record<string, any>) => e?.inlineSSRStyles)
-        if (val === false || (await get('dev')) || (await get('ssr')) === false || (await get('builder')) === '@nuxt/webpack-builder') {
+        const [dev, ssr, builder] = await Promise.all([


this seems less performant in the most common case (running in dev mode) as previously we short-circuited if any of those values were true.

True, this might be a bit less performant in best case scenario, but it's difficult to make it more performant for the worst case scenario without sacrificing a tiny bit for the rest. Due to it being parallelized, the potential impact in best case is minimized and therefore justifies the potential improvement in worst case.
It's like #24718 , where for more than 1 plugin/middleware it could slow it down a bit due to the additional if check but in other cases it improves it, it's about the average improvement across a wide variety of possible scenarios.
Although I could run a benchmark real quick to see how it behaves, I'll report back with the results.

Alright, the benchmark seems to show that the second approach is much slower. (difference 10 seconds). That was my mistake, I apologize, and thanks for spotting that! Here's the benchmark I used:

async function get(key) { return new Promise((resolve) => { setTimeout(() => { const values = { dev: Math.random() > 0.5, ssr: Math.random() > 0.5, builder: '@nuxt/webpack-builder' }; resolve(values[key]); }, 100); }); } async function snippet1(val) { if (val === false || (await get('dev')) || (await get('ssr')) === false || (await get('builder')) === '@nuxt/webpack-builder') { return true; } return false; } async function snippet2(val) { const [dev, ssr, builder] = await Promise.all([ get('dev'), get('ssr'), get('builder'), ]); if (val === false || dev || ssr === false || builder === '@nuxt/webpack-builder') { return true; } return false; } async function benchmark() { const iterations = 100; let start, end; console.time("e") for (let i = 0; i < iterations; i++) { await snippet1(false); } console.timeEnd("e") console.time("F") for (let i = 0; i < iterations; i++) { await snippet2(false); } console.timeEnd("F") }

Actually, when testing with val = true, I get that the new version is almost twice as fast:

So it's more so about which is more likely to occur. Which approach do you think we should take?
Actually, I have an idea

I pushed an update that combines the best of both worlds. If val is false it will immediately return false before awaiting the Promise.all, as per the results of the benchmark when val is set to false.
Otherwise it awaits promise.all and handles everything else. This actually doesn't impact the best case scenario at all, when dev is true

And it does make sense when I think about it, considering Promise.all parallelizes it, so each promise is handled in a different core, separate from the rest. There is no dependency between each promise so there's nothing to cause slowdown. Of course, as the if conditions progress to the next, the performance difference becomes much bigger (for example, here it is when dev is hardcoded to false)

GalacticHypernova · 2024-06-11T18:54:18Z

Marking as draft to resolve type issues

GalacticHypernova marked this pull request as draft March 6, 2024 17:58

github-actions bot added 3.x performance labels Mar 6, 2024

GalacticHypernova mentioned this pull request Mar 8, 2024

perf: performance overhaul #25771

Draft

9 tasks

GalacticHypernova commented Mar 8, 2024

View reviewed changes

packages/nuxt/src/app/composables/state.ts Outdated Show resolved Hide resolved

GalacticHypernova commented Mar 14, 2024

View reviewed changes

packages/nuxt/src/components/module.ts Show resolved Hide resolved

GalacticHypernova commented Mar 18, 2024

View reviewed changes

packages/nuxt/src/core/nitro.ts Outdated Show resolved Hide resolved

GalacticHypernova and others added 22 commits March 18, 2024 22:19

Merge branch 'main' into patch-9

bb57e0f

Merge branch 'main' into patch-9

aee0182

Merge branch 'main' into patch-9

792d2de

Merge branch 'main' into patch-9

9bc6f88

Merge branch 'main' into patch-9

751208b

perf: start templates.ts

c7f1d27

Merge branch 'main' into patch-9

376b6c1

perf: more templates.ts

1a78232

fix: missing curly bracket

83921bd

[autofix.ci] apply automated fixes

272f764

perf: more templates.ts

1f8d8c1

fix: parenthesis

d3cf3ba

[autofix.ci] apply automated fixes

0eb9f47

perf: more templates.ts

323204f

[autofix.ci] apply automated fixes

501ab79

perf: more templates.ts

a30c85b

[autofix.ci] apply automated fixes

de25ed5

chore: retrying failed test

5defbec

perf: more templates.ts

b30971a

[autofix.ci] apply automated fixes

a46e617

perf: more templates.ts

60cad85

Merge branch 'main' into patch-9

4e3a05d

GalacticHypernova and others added 12 commits June 8, 2024 15:19

Merge branch 'main' into patch-9

1c1991e

perf: avoid an IIFE in favor of promise chaining

2476bf4

[autofix.ci] apply automated fixes

51e3d6e

perf: combine for loops, less if nesting

8bee366

[autofix.ci] apply automated fixes

a494edc

perf: promise.all client bundle and server entry in SSR renderer

6604f08

[autofix.ci] apply automated fixes

43baf1f

perf: another Promise.all

4648f6f

[autofix.ci] apply automated fixes

d2f7c5c

Merge branch 'main' into patch-9

53a9cf8

perf: promise.all

2323645

perf: Promise.all

c9e60c3

GalacticHypernova marked this pull request as ready for review June 9, 2024 04:58

GalacticHypernova and others added 8 commits June 9, 2024 15:54

perf: remove unnecessary iteration

c6ea7c9

[autofix.ci] apply automated fixes

b46a191

perf: promise.all

7c633bd

[autofix.ci] apply automated fixes

e16a56b

perf: more promise.all

532b612

Merge branch 'patch-9' of https://github.com/GalacticHypernova/nuxt i…

6ccef7a

…nto patch-9

fix: remove parenthesis

e59bde2

Merge branch 'main' into patch-9

ae68f80

danielroe reviewed Jun 11, 2024

View reviewed changes

GalacticHypernova and others added 6 commits June 11, 2024 16:58

chore: revert experimental Promise.all

aacc9ab

[autofix.ci] apply automated fixes

a292947

perf: refactor experimental Promise.all

5c753fb

Merge branch 'main' into patch-9

ceaf97c

chore: remove unused ts-expect-error

25682b6

chore: remove ts-expect-error

7f228c6

GalacticHypernova marked this pull request as draft June 11, 2024 18:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: performance overhaul #26114

perf: performance overhaul #26114

GalacticHypernova commented Mar 6, 2024 •

edited

stackblitz bot commented Mar 6, 2024

GalacticHypernova left a comment

danielroe Jun 11, 2024

GalacticHypernova Jun 11, 2024 •

edited

GalacticHypernova Jun 11, 2024 •

edited

GalacticHypernova Jun 11, 2024 •

edited

GalacticHypernova Jun 11, 2024

GalacticHypernova commented Jun 11, 2024

perf: performance overhaul #26114

Are you sure you want to change the base?

perf: performance overhaul #26114

Conversation

GalacticHypernova commented Mar 6, 2024 • edited

NOTE:

NOTE 2:

🔗 Linked issue

❓ Type of change

📚 Description

📝 Checklist

stackblitz bot commented Mar 6, 2024

GalacticHypernova left a comment

Choose a reason for hiding this comment

danielroe Jun 11, 2024

Choose a reason for hiding this comment

GalacticHypernova Jun 11, 2024 • edited

Choose a reason for hiding this comment

GalacticHypernova Jun 11, 2024 • edited

Choose a reason for hiding this comment

GalacticHypernova Jun 11, 2024 • edited

Choose a reason for hiding this comment

GalacticHypernova Jun 11, 2024

Choose a reason for hiding this comment

GalacticHypernova commented Jun 11, 2024

GalacticHypernova commented Mar 6, 2024 •

edited

GalacticHypernova Jun 11, 2024 •

edited

GalacticHypernova Jun 11, 2024 •

edited

GalacticHypernova Jun 11, 2024 •

edited