Add op_sha256tree #632

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

matt-o-how wants to merge 111 commits into main from op_sha256tree

Contributor

matt-o-how commented Sep 25, 2025 •

edited

Loading

This PR adds a costed and cached shatree which accounts for cost as if it isn't caching so that further improvements may be made in the future without breaking consensus.

Attached are the benchmarked performance graphs for cost of call, cost per pair and cost for 32byte chunk.

sha256tree-per-pair

sha256tree-per-byte

sha256tree-base

coveralls-official bot commented Sep 25, 2025 •

edited

Loading

Pull Request Test Coverage Report for Build 20377346286

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

For more information on this, see Tracking coverage changes with pull request builds.
To avoid this issue with future PRs, see these Recommended CI Configurations.
For a quick fix, rebase this PR at GitHub. Your next report should be accurate.

Details

86 of 449 (19.15%) changed or added relevant lines in 6 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage decreased (-3.1%) to 87.111%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/chia_dialect.rs	0	1	0.0%
src/test_ops.rs	0	1	0.0%
src/treehash.rs	81	109	74.31%
tools/src/bin/sha256tree-benching.rs	0	333	0.0%

Totals
Change from base Build 18973962912:	-3.1%
Covered Lines:	6380
Relevant Lines:	7324

💛 - Coveralls

matt-o-how force-pushed the op_sha256tree branch from 909adfe to 837026a Compare

October 24, 2025 15:14

matt-o-how marked this pull request as ready for review

October 27, 2025 15:49

arvidn reviewed

View reviewed changes

op-tests/test-sha256tree.txt Outdated Show resolved Hide resolved

src/sha_tree_op.rs Outdated Show resolved Hide resolved

op-tests/test-sha256tree.txt Show resolved Hide resolved

src/chia_dialect.rs Show resolved Hide resolved

src/sha_tree_op.rs Outdated Show resolved Hide resolved

src/treehash.rs Outdated Show resolved Hide resolved

src/treehash.rs Outdated Show resolved Hide resolved

src/treehash.rs Outdated Show resolved Hide resolved

src/treehash.rs Outdated Show resolved Hide resolved

arvidn reviewed

View reviewed changes

Contributor

arvidn left a comment

there is a tool that can be extended that establishes a reasonable cost for new operators as well. We need some kind of benchmark to set the cost.

src/treehash.rs Outdated Show resolved Hide resolved

op-tests/test-sha256tree.txt Show resolved Hide resolved

src/treehash.rs Outdated Show resolved Hide resolved

matt-o-how added 23 commits

October 31, 2025 13:26


          initial commit

c4b88fe


          bring the cache over

d1e93df


          more functionality

4ff94fc


          separate treehash into own file and further correctness

850e5d5


          test test

790cc9b


          fix testing and add new tests


          add more tests

b02106f


          adjust costs

feb2f0f


          add 0x

0c023d9


          cache cost

f340214


          no TreeHash type

0925c41


          remove duplicate precomputed_hashes

ed42080


          make treeop private

056abac


          add malloc_cost_per_byte

3142cd6


          comment fixes

1cfa204


          generate random tests and fix subtle costing bug

94aa383


          add treecache tests

8b569fb


          clippy fixes

27c57c0


          fmt again

0976bc3


          add negative tests and more atom tests

f777a13


          test fix

7efd8d4


          pass ref into pair

c676a34


          add sha256tree to bechmarker

52fb0ba

matt-o-how force-pushed the op_sha256tree branch from 2dcc4df to 52fb0ba Compare

October 31, 2025 13:27

arvidn reviewed

View reviewed changes

Contributor

arvidn left a comment

do you feel confident that the cost benchmarks are good? specifically the cost per byte, cost per pair and cost per atom?

tools/generate-sha256tree-tests.py Outdated Show resolved Hide resolved

tools/src/bin/benchmark-clvm-cost.rs Outdated Show resolved Hide resolved

matt-o-how and others added 17 commits

December 9, 2025 12:17


          Update tools/src/bin/benchmark-clvm-cost.rs

0f62718

Co-authored-by: Arvid Norberg <arvid.norberg@gmail.com>


          Update tools/src/bin/benchmark-clvm-cost.rs

2d0bc2d

Co-authored-by: Arvid Norberg <arvid.norberg@gmail.com>


          Update tools/src/bin/benchmark-clvm-cost.rs

2823b16

Co-authored-by: Arvid Norberg <arvid.norberg@gmail.com>

fmt

3ac31f7


          add new cost_factor calculator into benchmark-clvm-cost.rs

dd4c4a9


          bench per type

0519dc5


          add balanced binary tree

b992caf


          clippy fix

d927993


          add percent factor and output its affect on cost

92765ad


          improve per-node calculation for balanced tree

fc17ebe


          clarify sections in terminal output

1c809d7


          fix leaf count calculation

9dc6367


          subtract out bytes32 chunk hashing in benchmark


          accurately track leaf amount and add missing parenthesis

f0f54e4


          add md file explaining thought process

8f68bcb


          add windows timings to MD file

6f75406


          extra formatting

baa0e43

arvidn reviewed

View reviewed changes

tools/src/bin/benchmark-clvm-cost.rs

Contributor

arvidn Dec 19, 2025

My understanding is that all benchmarking for establishing the cost model for sha256tree is done by tools/src/bin/sha256tree-benching.rs. It seems it would make sense to leave this file unchanged now.

Contributor Author

matt-o-how Dec 19, 2025

I've removed all changes except the comments that were added to the flags

tools/src/bin/sha256tree-benching.rs

    
                      // native

                      let start = Instant::now();

                      let red = run_program(a, &dialect, call, a.nil(), 11_000_000_000).unwrap();

                      let cost = red.0;

Contributor

arvidn Dec 19, 2025

this cost seems like a circular dependency. This program is meant to establish this cost, not present what it happens to be set to right now. It really seems like something that easy to misunderstand. I don't know what it tells us here, but I do see how it can be confusing. I think it would be best to remove it, but at least it should have a comment explaining what it's for.

The main risk I see is that it, accidentally, is used to make it appear as if a bad cost is good, because it matches this.

Contributor Author

matt-o-how Dec 19, 2025

I understand your concern, but believe it's useful to keep in the file to track how close the current set values are to the calculated expected value. I've reformatted the output to be far more explicit about what the meaning of that output is.

tools/src/bin/sha256tree-benching.rs Outdated

    
                      // a new list entry is 2 nodes (a cons and a nil) and a 3 chunk hash operation and a 1 chunk hash operation

                      // this equation lets us figure out a theoretical cost just for a node

                      let duration = (duration - (500.0 + i as f64) * (4.0 * bytes32_native_time)) / 2.0;

                      let cost = (cost as f64 - (500.0 + i as f64) * (4.0 * bytes32_native_cost)) / 2.0;

Contributor

arvidn Dec 19, 2025

similarly, this cost must not be used to establish the cost model or cost constants for the tree. I think it's risky to mix it in.

tools/src/bin/sha256tree-benching.rs

Comment on lines +235 to +238

    
                      writeln!(output_native_time, "{}\t{}", i, duration).unwrap();

                      writeln!(output_native_cost, "{}\t{}", i, cost).unwrap();

                      samples_time_native.push((i as f64, duration));

                      samples_cost_native.push((i as f64, cost as f64));

Contributor

arvidn Dec 19, 2025

Suggested change

      
                    writeln!(output_native_time, "{}\t{}", i, duration).unwrap();
          
                    writeln!(output_native_cost, "{}\t{}", i, cost).unwrap();
          
                    samples_time_native.push((i as f64, duration));
          
                    samples_cost_native.push((i as f64, cost as f64));
          
                    writeln!(output_native_time, "{}\t{}", (500 + i), duration).unwrap();
          
                    writeln!(output_native_cost, "{}\t{}", (500 + i), cost).unwrap();
          
                    samples_time_native.push(((500 + i) as f64, duration));
          
                    samples_cost_native.push(((500 + i) as f64, cost as f64));

You add 500 items before this loop, so I think you need to offset all i by 500. A simpler way may be to change the loop to:

for i in 500..1500 {

And remove the offsetting.

Contributor Author

matt-o-how Dec 19, 2025

Done

tools/src/bin/sha256tree-benching.rs Outdated

    
              // this function is for comparing the cost per 32byte chunk of hashing between the native and clvm implementation

              #[allow(clippy::type_complexity)]

              fn time_per_byte_for_atom(

Contributor

arvidn Dec 19, 2025

Suggested change

      
            fn time_per_byte_for_atom(
          
            fn time_per_32bytes_for_atom(

This is measuring blocks of 32 bytes, right?

Contributor Author

matt-o-how Dec 19, 2025

Done

tools/src/bin/sha256tree-benching.rs Outdated Show resolved Hide resolved

docs/sha256tree.md Outdated Show resolved Hide resolved

docs/sha256tree.md Outdated Show resolved Hide resolved

docs/sha256tree.md Outdated Show resolved Hide resolved

docs/sha256tree.md

    
              Finally the `COST_PER_NODE` was the trickiest to pin down as it is the most unique to this operator.

              The trick to costing was to compare with the "in-language" implementation and deduct the costs of the known hash operations using our previously costed `COST_PER_BYTES32`.

              The calculations for this can be seen in the file `sha256tree-benching.rs`.

Contributor

arvidn Dec 19, 2025

would you mind including the relevant plots of the measurements also, to demonstrate that this model matches reality? Specifically, that CPU cost grows proportional to CLVM cost.

Contributor Author

matt-o-how Dec 19, 2025

Plots added

matt-o-how and others added 5 commits

December 19, 2025 11:50


          fix sentence fragment

a7a6377


          remove cost_factor as it was confusing and no longer useful

ca4f149


          Update docs/sha256tree.md

47f5f80

Co-authored-by: Arvid Norberg <arvid.norberg@gmail.com>


          add explicit warning to cost

9f94b89


          format i loop to track accurately

7d52bd3

arvidn reviewed

View reviewed changes

docs/sha256tree.md

Comment on lines +55 to +56

    
              Native time per node  (ns): 115.0891

              CLVM   time per node  (ns): 397.1927

Contributor

arvidn Dec 19, 2025

the two different ways you measure time-per-node seem to disagree quite a lot.

benchmark	balanced tree	list
CLVM time per node	517.8038	397.1927
Native time per node	203.8718	115.0891

I see a similar pattern on RPi 5.

benchmark	balanced tree	list
CLVM time per node	1034.3594	855.1965
Native time per node	320.4549	181.4918

I think this suggests there's something fishy about the assumptions in those measurements.

matt-o-how added 5 commits

December 19, 2025 16:48


          update benching to generate graph for balanced tree and add all graph…

c95cfe8

…s to md file


          remove all changes except added comments from benchmark-clvm-cost

e71d4ed


          remove unused flag

3b40d55

fmt

2e13d52


          prettier md

a7088f7

arvidn mentioned this pull request

improve some of the tools #670

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet