bench 31132 #178

willcl-ark · 2026-01-13T10:31:15Z

.

github-actions · 2026-01-13T17:09:14Z

Benchmark Results

Comparison to nightly master:

No nightly history available for comparison

View detailed results
View nightly trend chart

github-actions · 2026-01-14T01:33:16Z

Benchmark Results

Comparison to nightly master:

450 MB: 48 min (nightly: 60 min, 2026-01-12) → +20.0% faster
32000 MB: 38 min (nightly: 45 min, 2026-01-12) → +15.1% faster

View detailed results
View nightly trend chart

github-actions · 2026-01-14T19:27:22Z

Benchmark Results

Comparison to nightly master:

450 MB: 49 min (nightly: 61 min, 2026-01-13) → +20.2% faster
32000 MB: 38 min (nightly: 46 min, 2026-01-13) → +16.5% faster

View detailed results
View nightly trend chart

Adds build configuration, benchmarking CI workflows, Python dependencies, plotting tools, and documentation for benchcoin. Co-authored-by: David Gumberg <davidzgumberg@gmail.com> Co-authored-by: Lőrinc <pap.lorinc@gmail.com>

- Fix empty chart: use get_chart_data() instead of to_dict() so JS filters can match config strings ("450", "32000") instead of objects - Capture machine specs on self-hosted runner during build job and pass via --machine-specs flag to nightly append, instead of detecting on the ubuntu-latest publish runner

Add a `Reset()` method to `CCoinsViewCache` that clears `cacheCoins`, `cachedCoinsUsage`, and `hashBlock` without flushing to the `base` view. This allows efficiently reusing a cache instance across multiple blocks. Introduce `m_connect_block_view` as a persistent cache layer for `ConnectBlock`, avoiding repeated memory allocations. On block validation failure, `Reset()` discards uncommitted changes without affecting the main cache. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Introduce a helper to look up a Coin through a stack of CCoinsViewCache layers without populating parent caches. This is useful for ephemeral views (e.g. during ConnectBlock) that want to avoid polluting CoinsTip() when validating invalid blocks. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Introduce CoinsViewCacheAsync, a CCoinsViewCache subclass that reads coins without mutating the underlying cache via FetchCoin(). Add GetCoinFromBase() which is called for cache misses in FetchCoin. In CoinsViewCacheAsync this method is overridden and calls PeekCoin(). This prevents the main cache from caching inputs pulled from disk for a block that has not yet been fully validated. Once Flush() is called on m_connect_block_view, these inputs will be added as spent to coinsCache in the main cache via BatchWrite(). This is the foundation for async input fetching, where worker threads must not mutate shared state. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Refactor TestCoinsView() to accept the cache as a parameter instead of creating it internally. This prepares for adding CoinsViewCacheAsync fuzz targets that need to pass in a different cache type. This is a non-functional change. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Add StartFetching() to populate a queue of all transaction inputs in a block, then fetch them all via ProcessInput() before entering ConnectBlock. GetCoinFromBase() now checks this queue first. StartFetching() returns a FetchControl struct which is bound to the lifetime of the block. When FetchControl goes out of scope and is destroyed, it will clear the fetched inputs so the prevout referencing the block are not accessed. Introduce InputToFetch struct to track each input's outpoint and fetched coin. GetCoinFromBase() scans the queue sequentially, matching ConnectBlock's access pattern where inputs are processed in block order. ProcessInput() fetches coins one at a time using PeekCoin(), preparing for parallel execution in later commits. Also add fuzz targets for CoinsViewCacheAsync and add StartFetching() to unit tests. Co-authored-by: sedited <seb.kung@gmail.com> Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Add a benchmark measuring CoinsViewCacheAsync performance when fetching inputs for a block. Creates a realistic scenario by adding all inputs from block 413567 to a chainstate with an in memory leveldb backend. Measures the time to access all inputs through the cache. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Skip fetching inputs that spend outputs created earlier in the same block, since these coins won't exist in the cache or database yet. Store the first 8 bytes of each transaction's txid in a sorted vector for O(log n) binary search lookups. Using truncated txids is a performance optimization; in the rare case of a collision, the input simply won't be prefetched and will fall back to normal fetching on the main thread. This adds a performance regression due to the extra sorting and filtering. Since the benchmark uses an in-memory leveldb, there is no real disk I/O that is avoided. > bench: add CoinsViewCacheAsync benchmark | ns/op | op/s | err% | ins/op | cyc/op | IPC | bra/op | miss% | total | benchmark |--------------------:|--------------------:|--------:|----------------:|----------------:|-------:|---------------:|--------:|----------:|:---------- | 1,664,383.00 | 600.82 | 2.4% | 31,957,257.00 | 4,017,069.00 | 7.955 | 5,318,396.00 | 0.6% | 0.02 | CoinsViewCacheAsyncBenchmark > coins: filter inputs created in same block before fetching | ns/op | op/s | err% | ins/op | cyc/op | IPC | bra/op | miss% | total | benchmark |--------------------:|--------------------:|--------:|----------------:|----------------:|-------:|---------------:|--------:|----------:|:---------- | 1,970,543.00 | 507.47 | 4.0% | 32,640,039.00 | 4,760,784.00 | 6.856 | 5,506,291.00 | 1.2% | 0.02 | CoinsViewCacheAsyncBenchmark Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Allow the main thread to process unfetched inputs while waiting for a specific coin. Instead of blocking, GetCoinFromBase() calls ProcessInput() to make forward progress on the queue. This prepares for parallel fetching where the main thread can help workers complete the queue rather than idling while waiting. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Restructure TestCoinsView() to perform all checks that don't mutate the backend before accessing backend_coins_view with HaveCoin()/GetCoin(). This prepares for CoinsViewCacheAsync testing, where we want to run as many checks as possible while async fetching is still active. Only at the very end do we call StopFetching() and perform the backend consistency checks that require mutating calls (HaveCoin/GetCoin call FetchCoin which writes to cacheCoins). Non-mutating operations like GetBestBlock(), EstimateSize(), and Cursor() can safely run on the backend while workers are still fetching. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Rename ProcessInput to ProcessInputInBackground. Add thread-safe synchronization primitives to allow any thread to safely call ProcessInputInBackground once all threads arrive_and_wait() a std::barrier. Make m_input_head a std::atomic_unit32_t, so workers can claim inputs atomically in ProcessInputInBackground. Make ready flag a std::atomic_flag per InputToFetch to act as an atomic memory fence. Workers release and the main thread acquires the flag to ensure the coin is seen correctly no matter which thread has written it. Add StopFetching() private method that skips all remaining inputs, waits for all threads to arrive at the std::barrier, and resets all state in CoinsViewCacheAsync. Override Flush(), Sync(), and SetBackend() to call StopFetching() before calling CCoinsViewCache base class methods. This ensures no worker threads can access base while it is being mutated. Co-authored-by: l0rinc <pap.lorinc@gmail.com>

Spawn a fixed pool of worker threads (default 4) that fetch coins in parallel. Workers wait at the barrier until StartFetching() signals work is available, then race to claim and fetch inputs from the queue. Once all inputs have been fetched, the workers wait at the barrier until the main thread arrives via StopFetching(). The destructor arrives at the barrier a final time with an empty m_inputs, which signals to the threads to exit their loop. > coins: filter inputs created in same block before fetching | ns/op | op/s | err% | ins/op | cyc/op | IPC | bra/op | miss% | total | benchmark |--------------------:|--------------------:|--------:|----------------:|----------------:|-------:|---------------:|--------:|----------:|:---------- | 1,970,543.00 | 507.47 | 4.0% | 32,640,039.00 | 4,760,784.00 | 6.856 | 5,506,291.00 | 1.2% | 0.02 | CoinsViewCacheAsyncBenchmark > validation: fetch inputs on parallel threads | ns/op | op/s | err% | ins/op | cyc/op | IPC | bra/op | miss% | total | benchmark |--------------------:|--------------------:|--------:|----------------:|----------------:|-------:|---------------:|--------:|----------:|:---------- | 1,601,969.00 | 624.23 | 2.9% | 8,345,989.00 | 2,232,468.00 | 3.738 | 1,089,340.00 | 1.8% | 0.03 | CoinsViewCacheAsyncBenchmark Co-authored-by: l0rinc <pap.lorinc@gmail.com>

willcl-ark force-pushed the master branch from 8d5d67e to f892fc6 Compare January 14, 2026 03:00

willcl-ark force-pushed the pr31132-test branch from 9197776 to 3a78b5f Compare January 14, 2026 14:46

willcl-ark force-pushed the master branch from c660ac8 to da27a85 Compare January 14, 2026 14:47

willcl-ark force-pushed the pr31132-test branch from 3a78b5f to ee2c32a Compare January 14, 2026 14:47

willcl-ark and others added 21 commits January 15, 2026 02:54

benchcoin: add tooling

b089402

Adds build configuration, benchmarking CI workflows, Python dependencies, plotting tools, and documentation for benchcoin. Co-authored-by: David Gumberg <davidzgumberg@gmail.com> Co-authored-by: Lőrinc <pap.lorinc@gmail.com>

don't compare to master in prs

87be800

only run single bins in prs

6fa2fe6

rebase at 0100 GMT

696bd15

make charts taller

3d4dd3f

update machine configs and charts

3dbeb79

chart: make chart series dynamic and unique

fd87b71

rename history file

702c85a

use better colours in charts

fb591c8

don't use inline html

8568094

use commit date in chart data points

28b9eed

use nix flake in both publish workflow steps

758b13d

fix nightly-history mismatch

6544564

fix instrumented suffixes in reports

39c0c30

add clickable plotly links

b00689b

use corect path in index

a0d06ec

use scatter plot for leveldb compaction

ea022f6

add debug logs to artifacts

1695858

dynamic charts test

2002203

fix theme render order

944c924

willcl-ark force-pushed the master branch from da27a85 to 944c924 Compare January 15, 2026 02:54

add ruff and ty to flake

f0d78f9

willcl-ark and others added 15 commits January 15, 2026 10:40

add ty.toml

66e1228

add ruff.toml

44ce167

support a full IBD PR run

2908ea8

configure full IBD uninstrumented run

d7109ab

willcl-ark force-pushed the pr31132-test branch from ee2c32a to d7109ab Compare January 15, 2026 10:43

willcl-ark force-pushed the master branch from 2908ea8 to 5f20e15 Compare January 16, 2026 02:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bench 31132 #178

bench 31132 #178

Uh oh!

willcl-ark commented Jan 13, 2026

Uh oh!

github-actions bot commented Jan 13, 2026

Uh oh!

github-actions bot commented Jan 14, 2026

Uh oh!

github-actions bot commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bench 31132 #178

Are you sure you want to change the base?

bench 31132 #178

Uh oh!

Conversation

willcl-ark commented Jan 13, 2026

Uh oh!

github-actions bot commented Jan 13, 2026

Benchmark Results

Uh oh!

github-actions bot commented Jan 14, 2026

Benchmark Results

Uh oh!

github-actions bot commented Jan 14, 2026

Benchmark Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants