Configurable block sizes in sealed fraction

For search requests on real seq-db instances there are two major contributors of CPU usage:
- iterating through query tree (left side on the flamegraph)
- reading LID blocks from disk (right side on the flamegraph)

<img width="1866" height="752" alt="Image" src="https://github.com/user-attachments/assets/4ab98bcb-647c-4cce-8b2e-886486333b43" />

The first part will be addressed through batcher query execution and block skipping. The second part can be partially addressed though lower LID pages. Databases strive to align page size to 4kb which is equal to the minimum amount which can be read from disk.

Pros:
- higher cache granularity and efficiency
- better skipping granurality
- faster performance on search query and histograms

Cons:
- more independent disk reads on aggregations
- more independent read on S3 searches

Most cons can be addressed through intelligent prefetching and repacking fracs before uploading to S3.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configurable block sizes in sealed fraction #330

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Configurable block sizes in sealed fraction #330

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions