refactor: Speed up Base58 encoding/decoding by 400%/200% via preliminary byte packing #29473

paplorinc · 2024-02-24T12:52:07Z

Continuing the base conversion optimizations, started in TryParseHex and IsSpace and Bech32.

To mitigate the quadratic complexity inherent in the sequential byte division of the Base58 encoding process, this update aggregates characters into groups of 7 and 9, fitting them into 64-bit long integers.
This refinement utilizes the inherent efficiency of native division and modulo operations on 64-bit architectures, significantly optimizing computational overhead.
The benchmarks indicate a 4x speedup for Base58CheckEncode and Base58Encode:

make && ./src/bench/bench_bitcoin --filter=Base58Encode --min-time=1000

Before:

|             ns/byte |              byte/s |    err% |     total | benchmark
|--------------------:|--------------------:|--------:|----------:|:----------
|               59.27 |       16,872,442.54 |    0.2% |      1.10 | `Base58Encode`
|               59.09 |       16,923,588.35 |    0.1% |      1.10 | `Base58Encode`
|               59.15 |       16,906,104.83 |    0.3% |      1.11 | `Base58Encode`

After:

|             ns/byte |              byte/s |    err% |     total | benchmark
|--------------------:|--------------------:|--------:|----------:|:----------
|               14.32 |       69,831,031.44 |    0.1% |      1.10 | `Base58Encode`
|               14.32 |       69,811,995.18 |    0.1% |      1.10 | `Base58Encode`
|               14.35 |       69,662,527.88 |    0.5% |      1.10 | `Base58Encode`

make && ./src/bench/bench_bitcoin --filter=Base58CheckEncode --min-time=1000

Before:

|             ns/byte |              byte/s |    err% |     total | benchmark
|--------------------:|--------------------:|--------:|----------:|:----------
|               79.93 |       12,511,576.75 |    0.4% |      1.10 | `Base58CheckEncode`
|               79.49 |       12,580,136.21 |    0.1% |      1.10 | `Base58CheckEncode`
|               79.65 |       12,554,785.16 |    0.1% |      1.10 | `Base58CheckEncode`

After:

|             ns/byte |              byte/s |    err% |     total | benchmark
|--------------------:|--------------------:|--------:|----------:|:----------
|               19.15 |       52,231,968.11 |    0.1% |      1.06 | `Base58CheckEncode`
|               19.13 |       52,269,345.54 |    0.4% |      1.05 | `Base58CheckEncode`
|               19.14 |       52,244,117.61 |    0.3% |      1.06 | `Base58CheckEncode`

The same was applied to decoding, resulting in a 2x speedup:

make && ./src/bench/bench_bitcoin --filter=Base58Decode --min-time=1000

Before:

|             ns/byte |              byte/s |    err% |     total | benchmark
|--------------------:|--------------------:|--------:|----------:|:----------
|               17.50 |       57,141,622.65 |    0.1% |      1.10 | `Base58Decode`
|               17.42 |       57,392,223.96 |    0.0% |      1.10 | `Base58Decode`
|               17.43 |       57,358,655.44 |    0.2% |      1.10 | `Base58Decode`

After:

|             ns/byte |              byte/s |    err% |     total | benchmark
|--------------------:|--------------------:|--------:|----------:|:----------
|                8.79 |      113,767,685.06 |    0.1% |      1.10 | `Base58Decode`
|                8.78 |      113,831,528.33 |    0.0% |      1.10 | `Base58Decode`
|                8.80 |      113,607,470.35 |    0.2% |      1.10 | `Base58Decode`

See https://corecheck.dev/bitcoin/bitcoin/pulls/29458 for all benchmarks.

DrahtBot · 2024-02-24T12:52:09Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage

For detailed information about the code coverage, see the test coverage report.

Reviews

See the guideline for information on the review process.
A summary of reviews will appear here.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#27260 (Enhanced error messages for invalid network prefix during address parsing. by russeree)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

src/base58.cpp

DrahtBot · 2024-02-24T18:12:40Z

🚧 At least one of the CI tasks failed. Make sure to run all tests locally, according to the
documentation.

Possibly this is due to a silent merge conflict (the changes in this pull request being
incompatible with the current code in the target branch). If so, make sure to rebase on the latest
commit of the target branch.

Leave a comment here, if you need help tracking down a confusing failure.

_{Debug: https://github.com/bitcoin/bitcoin/runs/21941361780}

src/base58.cpp

DrahtBot · 2024-02-25T15:24:39Z

🚧 At least one of the CI tasks failed. Make sure to run all tests locally, according to the
documentation.

Possibly this is due to a silent merge conflict (the changes in this pull request being
incompatible with the current code in the target branch). If so, make sure to rebase on the latest
commit of the target branch.

Leave a comment here, if you need help tracking down a confusing failure.

_{Debug: https://github.com/bitcoin/bitcoin/runs/21955441135}

maflcko · 2024-03-15T12:11:44Z

src/test/data/base58_encode_decode.json

@@ -11,6 +11,13 @@
 ["ecac89cad93923c02321", "EJDM8drfXA6uyA"],
 ["10c8511e", "Rt5zm"],
 ["00000000000000000000", "1111111111"],
+["00000000000000000000000000000000000000000000000000000000000000000000000000000000", "1111111111111111111111111111111111111111"],
+["00000000000000000000000000000000000000000000000000000000000000000000000000000001", "1111111111111111111111111111111111111112"],
+["0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000ec39d04c37e71e5d591881f6", "111111111111111111111111111111111111111111111111111111111111111111111111111111111111115TYzLYH1udmLdzCLM"],


Seems fine to add tests, but I don't think this is used in a critical path, to warrant a speedup?

I've added the random test cases that failed during development to make sure they're covered deterministically.
Also added ones which are testing the boundaries of powers of 58, since those cases happen less frequently in the randomized tests.

Yes, I meant you can open a new pull request with the added tests, if you want.

Instead of collecting the values to and empty b58, we're cloning the input without leading zeros and dividing it in-place, while doing the quadratic division from the most significant, non-zero digit, forward. make && ./src/bench/bench_bitcoin --filter=Base58Encode --min-time=1000 Before: ``` | ns/byte | byte/s | err% | total | benchmark |--------------------:|--------------------:|--------:|----------:|:---------- | 59.34 | 16,851,250.88 | 0.2% | 1.10 | `Base58Encode` | 59.20 | 16,892,479.61 | 0.2% | 1.10 | `Base58Encode` | 58.97 | 16,958,226.76 | 0.2% | 1.10 | `Base58Encode` ``` After: ``` | ns/byte | byte/s | err% | total | benchmark |--------------------:|--------------------:|--------:|----------:|:---------- | 40.80 | 24,508,402.06 | 0.1% | 1.10 | `Base58Encode` | 40.83 | 24,489,762.49 | 0.2% | 1.10 | `Base58Encode` | 39.71 | 25,182,310.62 | 0.2% | 1.10 | `Base58Encode` ``` and make && ./src/bench/bench_bitcoin --filter=Base58CheckEncode --min-time=1000 Before: ``` | ns/byte | byte/s | err% | total | benchmark |--------------------:|--------------------:|--------:|----------:|:---------- | 79.93 | 12,511,576.75 | 0.4% | 1.10 | `Base58CheckEncode` | 79.49 | 12,580,136.21 | 0.1% | 1.10 | `Base58CheckEncode` | 79.65 | 12,554,785.16 | 0.1% | 1.10 | `Base58CheckEncode` ``` After: ``` | ns/byte | byte/s | err% | total | benchmark |--------------------:|--------------------:|--------:|----------:|:---------- | 55.27 | 18,094,561.35 | 0.1% | 1.10 | `Base58CheckEncode` | 55.41 | 18,046,778.32 | 0.1% | 1.10 | `Base58CheckEncode` | 55.32 | 18,075,763.16 | 0.1% | 1.10 | `Base58CheckEncode` ```

To mitigate the quadratic complexity inherent in the sequential byte division of the Base58 encoding process, this update aggregates characters into groups of 7, fitting them into 64-bit long integers. This refinement utilizes the inherent efficiency of native division and modulo operations on 64-bit architectures, significantly optimizing computational overhead. The benchmarks indicate a 4x speedup for `Base58CheckEncode` and `Base58Encode`: make && ./src/bench/bench_bitcoin --filter=Base58Encode --min-time=1000 After: ``` | ns/byte | byte/s | err% | total | benchmark |--------------------:|--------------------:|--------:|----------:|:---------- | 14.32 | 69,831,031.44 | 0.1% | 1.10 | `Base58Encode` | 14.32 | 69,811,995.18 | 0.1% | 1.10 | `Base58Encode` | 14.35 | 69,662,527.88 | 0.5% | 1.10 | `Base58Encode` ``` make && ./src/bench/bench_bitcoin --filter=Base58CheckEncode --min-time=1000 After: ``` | ns/byte | byte/s | err% | total | benchmark |--------------------:|--------------------:|--------:|----------:|:---------- | 19.15 | 52,231,968.11 | 0.1% | 1.06 | `Base58CheckEncode` | 19.13 | 52,269,345.54 | 0.4% | 1.05 | `Base58CheckEncode` | 19.14 | 52,244,117.61 | 0.3% | 1.06 | `Base58CheckEncode` ```

> make && ./src/bench/bench_bitcoin --filter=Base58Decode --min-time=1000 Before: ``` | ns/byte | byte/s | err% | total | benchmark |--------------------:|--------------------:|--------:|----------:|:---------- | 17.50 | 57,141,622.65 | 0.1% | 1.10 | `Base58Decode` | 17.42 | 57,392,223.96 | 0.0% | 1.10 | `Base58Decode` | 17.43 | 57,358,655.44 | 0.2% | 1.10 | `Base58Decode` ``` After: ``` | ns/byte | byte/s | err% | total | benchmark |--------------------:|--------------------:|--------:|----------:|:---------- | 16.37 | 61,093,842.90 | 0.2% | 1.10 | `Base58Decode` | 16.37 | 61,100,514.64 | 0.1% | 1.10 | `Base58Decode` | 16.38 | 61,046,275.93 | 0.1% | 1.10 | `Base58Decode` ```

> make && ./src/bench/bench_bitcoin --filter=Base58Decode --min-time=1000 After: ``` | ns/byte | byte/s | err% | total | benchmark |--------------------:|--------------------:|--------:|----------:|:---------- | 8.79 | 113,767,685.06 | 0.1% | 1.10 | `Base58Decode` | 8.78 | 113,831,528.33 | 0.0% | 1.10 | `Base58Decode` | 8.80 | 113,607,470.35 | 0.2% | 1.10 | `Base58Decode` ```

paplorinc changed the title ~~WIP optimization: Unify size estimations in Base58 encoding/decoding~~ WIP optimization: Enhance Base58 conversion through preliminary Base56 conversion Feb 24, 2024

paplorinc changed the title ~~WIP optimization: Enhance Base58 conversion through preliminary Base56 conversion~~ WIP optimization: Enhance Base58 conversion through intermediary Base56 conversion Feb 24, 2024

paplorinc commented Feb 24, 2024

View reviewed changes

src/base58.cpp Outdated Show resolved Hide resolved

paplorinc changed the title ~~WIP optimization: Enhance Base58 conversion through intermediary Base56 conversion~~ optimization: Enhance Base58 conversion through intermediary Base56 conversion by 300% Feb 24, 2024

paplorinc changed the title ~~optimization: Enhance Base58 conversion through intermediary Base56 conversion by 300%~~ optimization: Speed up Base58 conversion through intermediary Base56 conversion by 300% Feb 24, 2024

paplorinc marked this pull request as ready for review February 24, 2024 18:02

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch from afbc980 to 321456b Compare February 24, 2024 18:12

DrahtBot added the CI failed label Feb 24, 2024

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch 3 times, most recently from 397ae64 to e168e62 Compare February 24, 2024 21:36

optout21 reviewed Feb 25, 2024

View reviewed changes

src/base58.cpp Outdated Show resolved Hide resolved

paplorinc marked this pull request as draft February 25, 2024 06:37

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch 2 times, most recently from cab8a6e to b930cd5 Compare February 25, 2024 07:06

DrahtBot removed the CI failed label Feb 25, 2024

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch 2 times, most recently from ba23707 to c9b351d Compare February 25, 2024 15:24

DrahtBot added the CI failed label Feb 25, 2024

DrahtBot removed the CI failed label Feb 25, 2024

paplorinc requested review from optout21 and Empact February 25, 2024 16:41

paplorinc marked this pull request as ready for review February 25, 2024 16:42

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch from c9b351d to 83caaf7 Compare February 26, 2024 21:25

paplorinc changed the title ~~optimization: Speed up Base58 conversion through intermediary Base56 conversion by 300%~~ optimization: Speed up Base58 conversion through intermediary Base56 conversion by 400% Feb 26, 2024

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch from 83caaf7 to 73845ec Compare February 26, 2024 21:41

Empact mentioned this pull request Feb 26, 2024

build: Enable -Wshadow #29485

Closed

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch 4 times, most recently from 6b684c4 to bc303f3 Compare March 7, 2024 14:20

paplorinc marked this pull request as ready for review March 7, 2024 16:24

DrahtBot removed the CI failed label Mar 7, 2024

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch 2 times, most recently from 3c5435b to 947ea98 Compare March 7, 2024 21:47

DrahtBot mentioned this pull request Mar 8, 2024

kernel: Streamline util library #29015

Open

paplorinc changed the title ~~optimization: Speed up Base58 encoding/decoding by 400%/200% via preliminary byte packing~~ refactor: Speed up Base58 encoding/decoding by 400%/200% via preliminary byte packing Mar 8, 2024

DrahtBot added the Refactoring label Mar 8, 2024

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch from 947ea98 to c2df10b Compare March 8, 2024 19:03

DrahtBot mentioned this pull request Mar 9, 2024

Enhanced error messages for invalid network prefix during address parsing. #27260

Open

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch 2 times, most recently from 5789342 to ffb81c9 Compare March 9, 2024 13:43

paplorinc mentioned this pull request Mar 9, 2024

refactor: Reduce memory copying operations in bech32 encoding/decoding #29607

Open

maflcko reviewed Mar 15, 2024

View reviewed changes

paplorinc mentioned this pull request Apr 11, 2024

test: add a few more base32/64 calculation corner cases #29847

Open

paplorinc mentioned this pull request May 3, 2024

test: Add a few more corner cases to the base58 test suite #30035

Open

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch 2 times, most recently from 7a3f66a to 28c6717 Compare May 8, 2024 11:35

paplorinc marked this pull request as draft May 8, 2024 11:35

paplorinc marked this pull request as ready for review May 8, 2024 20:21

paplorinc added 6 commits May 11, 2024 21:59

Unify size estimations in Base58 encoding/decoding

73577d1

Remove redundant zero-skipping logic in Base58 encode/decode

f8c00ce

paplorinc force-pushed the paplorinc/optimize-base58-conversion branch from 28c6717 to ca21f67 Compare May 11, 2024 19:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: Speed up Base58 encoding/decoding by 400%/200% via preliminary byte packing #29473

refactor: Speed up Base58 encoding/decoding by 400%/200% via preliminary byte packing #29473

paplorinc commented Feb 24, 2024 •

edited

DrahtBot commented Feb 24, 2024 •

edited

DrahtBot commented Feb 24, 2024

DrahtBot commented Feb 25, 2024

maflcko Mar 15, 2024

paplorinc Mar 15, 2024

maflcko Mar 15, 2024

refactor: Speed up Base58 encoding/decoding by 400%/200% via preliminary byte packing #29473

Are you sure you want to change the base?

refactor: Speed up Base58 encoding/decoding by 400%/200% via preliminary byte packing #29473

Conversation

paplorinc commented Feb 24, 2024 • edited

DrahtBot commented Feb 24, 2024 • edited

Code Coverage

Reviews

Conflicts

DrahtBot commented Feb 24, 2024

DrahtBot commented Feb 25, 2024

maflcko Mar 15, 2024

Choose a reason for hiding this comment

paplorinc Mar 15, 2024

Choose a reason for hiding this comment

maflcko Mar 15, 2024

Choose a reason for hiding this comment

paplorinc commented Feb 24, 2024 •

edited

DrahtBot commented Feb 24, 2024 •

edited