Add versioned cache invalidation and materialize() to Series for production streaming use cases #66

Copilot · 2025-12-05T07:57:49Z

The Series class had no automatic cache invalidation when underlying data mutated, and nested operations created O(N) closure chains that retained all intermediate Series objects in memory.

Changes

BarData wrapper for automatic cache invalidation

Wraps Bar[] with version tracking that increments on mutations (push(), pop(), set(), updateLast(), setAll())
Series now checks version on toArray() and recomputes only when data changed
Enables real-time streaming scenarios where bars are updated incrementally

materialize() method to break closure chains

Eagerly computes values and returns fresh Series without closure references
Example: a.add(b).mul(c).materialize().div(d) frees references to a, b, c
Critical for complex indicators with deep operation chains

Backward compatibility

Existing Bar[] arrays automatically wrapped in BarData internally
All factory methods (fromBars(), constant(), fromArray()) accept both types
No API changes required for existing code

Example

// Before: manual invalidation, stale cache risk
const close = Series.fromBars(bars, 'close');
const values1 = close.toArray();  // caches
bars.push(newBar);                // Series unaware
close._invalidate();              // must remember to call
const values2 = close.toArray();  // recomputes

// After: automatic invalidation
const barData = new BarData(bars);
const close = Series.fromBars(barData, 'close');
const values1 = close.toArray();  // caches
barData.push(newBar);             // version++
const values2 = close.toArray();  // auto-detects, recomputes

// Memory management
const result = a.add(b).mul(c).div(d).sub(e);  // holds a,b,c,d,e in memory
const result = a.add(b).mul(c).materialize().div(d).sub(e);  // frees a,b,c

Performance impact: O(1) version check overhead on cache hits, otherwise identical.

Original prompt

Problem Statement

The Series class in packages/oakscriptjs/src/runtime/series.ts has two critical issues that need to be addressed for production use cases, especially real-time/streaming applications.

Issue 1: Series Cache Invalidation

Current Behavior

The Series class caches computed values in toArray() but has no automatic mechanism to detect when the underlying data changes:
toArray(): number[] {
  if (this.cached !== null) {
    return this.cached;  // Returns stale data if underlying bars changed
  }
  this.cached = this.data.map((bar, i) => this.extractor(bar, i, this.data));
  return this.cached;
}

_invalidate(): void {
  this.cached = null;  // Must be called manually
}
Problems

Manual invalidation only: _invalidate() must be called explicitly—there's no automatic detection when this.data changes

No derived Series tracking: When you compose Series (e.g., close.add(open)), the derived Series has its own cache but doesn't know when parent caches should be invalidated

Mutable data reference: this.data is stored by reference—if the original Bar[] array is mutated externally, cached values become stale

Required Solution

Implement a versioned data source pattern:

Create a BarData class (or similar) that wraps Bar[] and tracks a version number that increments on mutations

Modify Series to store a reference to the data source and track the version when cache was computed

On toArray(), check if the current data version matches the cached version before returning cached results

Optionally add a invalidateAll() mechanism for derived Series coordination

Issue 2: Closure Chain Memory Leak

Current Behavior

Every Series operation creates a new closure that captures the parent Series:
add(other: Series | number): Series {
  return new Series(this.data, (bar, i, data) => {
    const a = this.extractor(bar, i, data);  // Captures 'this'
    const b = typeof other === 'number' ? other : other.extractor(bar, i, data);  // Captures 'other'
    return a + b;
  });
}
Problem

For a chain like a.add(b).mul(c).div(d).sub(e):

Each step creates a closure capturing all previous Series objects

Results in O(N) Series objects kept alive for a chain of N operations

Each Series holds: data (Bar[]), extractor (function), cached (number[] | null)

Deep chains in complex indicators can retain significant memory

Required Solution

Implement one or more of these approaches:
Add a materialize() method that eagerly computes values and creates a fresh Series, breaking the closure chain:
materialize(): Series {
  const values = this.toArray();
  return Series.fromArray(this.data, values);
}
Expression tree approach (more complex): Replace nested closures with an expression AST that can be evaluated in a single pass without holding references to intermediate Series

Automatic materialization heuristic: Track chain depth and auto-materialize when exceeding a threshold
Files to Modify

packages/oakscriptjs/src/runtime/series.ts - Main Series class implementation

packages/oakscriptjs/src/types.ts - May need new types for BarData/DataSource

packages/oakscriptjs/src/index.ts - Export new classes if added

packages/oakscriptjs/tests/ - Add tests for new functionality

Acceptance Criteria

Series cache is automatically invalidated when underlying data changes (for streaming/real-time use)

A materialize() method (or similar) exists to break closure chains and free memory

Existing API remains backward compatible

Unit tests cover cache invalidation scenarios

Unit tests cover memory management with materialize()

Documentation updated to explain new features

References

Current Series implementation: https://github.com/deepentropy/oakscriptJS/blob/434521cbef06ea0f2bf0476c7f6b5fa5fdc7e312/packages/oakscriptjs/src/runtime/series.ts

This pull request was created as a result of the following prompt from Copilot chat.

Problem Statement

The Series class in packages/oakscriptjs/src/runtime/series.ts has two critical issues that need to be addressed for production use cases, especially real-time/streaming applications.

Issue 1: Series Cache Invalidation

Current Behavior

The Series class caches computed values in toArray() but has no automatic mechanism to detect when the underlying data changes:
toArray(): number[] {
  if (this.cached !== null) {
    return this.cached;  // Returns stale data if underlying bars changed
  }
  this.cached = this.data.map((bar, i) => this.extractor(bar, i, this.data));
  return this.cached;
}

_invalidate(): void {
  this.cached = null;  // Must be called manually
}
Problems

Manual invalidation only: _invalidate() must be called explicitly—there's no automatic detection when this.data changes

No derived Series tracking: When you compose Series (e.g., close.add(open)), the derived Series has its own cache but doesn't know when parent caches should be invalidated

Mutable data reference: this.data is stored by reference—if the original Bar[] array is mutated externally, cached values become stale

Required Solution

Implement a versioned data source pattern:

Create a BarData class (or similar) that wraps Bar[] and tracks a version number that increments on mutations

Modify Series to store a reference to the data source and track the version when cache was computed

On toArray(), check if the current data version matches the cached version before returning cached results

Optionally add a invalidateAll() mechanism for derived Series coordination

Issue 2: Closure Chain Memory Leak

Current Behavior

Every Series operation creates a new closure that captures the parent Series:
add(other: Series | number): Series {
  return new Series(this.data, (bar, i, data) => {
    const a = this.extractor(bar, i, data);  // Captures 'this'
    const b = typeof other === 'number' ? other : other.extractor(bar, i, data);  // Captures 'other'
    return a + b;
  });
}
Problem

For a chain like a.add(b).mul(c).div(d).sub(e):

Each step creates a closure capturing all previous Series objects

Results in O(N) Series objects kept alive for a chain of N operations

Each Series holds: data (Bar[]), extractor (function), cached (number[] | null)

Deep chains in complex indicators can retain significant memory

Required Solution

Implement one or more of these approaches:
Add a materialize() method that eagerly computes values and creates a fresh Series, breaking the closure chain:
materialize(): Series {
  const values = this.toArray();
  return Series.fromArray(this.data, values);
}
Expression tree approach (more complex): Replace nested closures with an expression AST that can be evaluated in a single pass without holding references to intermediate Series

Automatic materialization heuristic: Track chain depth and auto-materialize when exceeding a threshold
Files to Modify

packages/oakscriptjs/src/runtime/series.ts - Main Series class implementation

packages/oakscriptjs/src/types.ts - May need new types for BarData/DataSource

packages/oakscriptjs/src/index.ts - Export new classes if added

packages/oakscriptjs/tests/ - Add tests for new functionality

Acceptance Criteria

Series cache is automatically invalidated when underlying data changes (for streaming/real-time use)

A materialize() method (or similar) exists to break closure chains and free memory

Existing API remains backward compatible

Unit tests cover cache invalidation scenarios

Unit tests cover memory management with materialize()

Documentation updated to explain new features

References

Current Series implementation: https://github.com/deepentropy/oakscriptJS/blob/434521cbef06ea0f2bf0476c7f6b5fa5fdc7e312/packages/oakscriptjs/src/runtime/series.ts

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

- Add BarData class with version tracking for automatic cache invalidation - Update Series to use BarData and track cache version - Add materialize() method to break closure chains and free memory - Maintain backward compatibility with Bar[] arrays - Add comprehensive test suite with 33 tests covering all new features Co-authored-by: deepentropy <8287111+deepentropy@users.noreply.github.com>

Copilot

Pull request overview

This PR adds production-ready cache invalidation and memory management to the Series class for streaming use cases. It introduces a BarData wrapper that tracks data mutations via version numbers, enabling automatic cache invalidation, and adds a materialize() method to break closure chains that cause memory leaks in complex indicator calculations.

Key Changes

BarData wrapper: Automatically increments version on mutations (push(), pop(), set(), updateLast(), setAll()), enabling Series to detect stale caches
Automatic cache invalidation: Series checks BarData version before returning cached values, recomputing only when data has changed
materialize() method: Breaks closure chains by eagerly computing values and creating a fresh Series without parent references, critical for complex multi-operation chains

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 6 comments.

File	Description
packages/oakscriptjs/src/runtime/series.ts	Adds BarData class with version tracking, updates Series constructor to accept Bar[] or BarData, implements version-based cache invalidation in toArray(), adds materialize() method
packages/oakscriptjs/src/index.ts	Exports BarData class alongside Series for public API
packages/oakscriptjs/tests/runtime/series.test.ts	Comprehensive test coverage for BarData mutations, cache invalidation scenarios, materialize() functionality, backward compatibility, and integration tests

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-05T08:18:15Z