fix(riva): avoid stale interim carryover in segment assembly by rbright · Pull Request #3 · rbright/sotto

rbright · 2026-02-23T00:18:24Z

Summary

stop pre-committing divergent interim ASR hypotheses in the Riva receive loop
keep only the latest interim hypothesis until a final result is emitted
add regression tests to prevent stale/duplicated leading text in assembled transcripts

Why

Riva interim hypotheses can reset boundaries across updates. Pre-committing previous interim text could prepend stale text or duplicate leading phrases in final output.

Testing

go test ./apps/sotto/internal/riva
just ci-check
nix build 'path:.#sotto'

Summary by CodeRabbit

Bug Fixes
- Improved handling of interim speech-recognition results: stale interim text is no longer prepended to finals; prior interim is cleared or preserved based on stability, suffix/continuation checks, and sentence-ending punctuation. Introduces a high-confidence stability threshold (~0.85) and resets interim stability on final results.
Tests
- Added and updated tests covering divergent/stale interim scenarios, stability-threshold behavior, suffix-correction handling, and final-result consolidation.

Do not pre-commit divergent interim hypotheses before final results. Riva can reset interim boundaries between updates; pre-committing the prior interim can prepend stale text or duplicate leading phrases in the final transcript. Tests: - go test ./apps/sotto/internal/riva - just ci-check - nix build 'path:.#sotto'

coderabbitai · 2026-02-23T00:18:41Z

📝 Walkthrough

Walkthrough

Stream interim handling now tracks interim stability and uses a stability-plus-punctuation and suffix/prefix similarity rule to decide when to commit a prior interim on divergence; stream state and processing were updated and tests added/renamed to validate replacement and commit behaviors.

Changes

Cohort / File(s)	Summary
Core stream state `apps/sotto/internal/riva/client.go`	Added `lastInterimStability` field to `Stream` and adjusted related field comments/ordering to track interim stability.
Stream processing logic `apps/sotto/internal/riva/stream_receive.go`	Reset `lastInterimStability` on final results; replaced prior interim-commit check with `shouldCommitPriorInterimOnDivergence(...)`; set `lastInterim` and `lastInterimStability` after interim results and commit prior interim conditionally.
Interim decision logic `apps/sotto/internal/riva/transcript_segments.go`	Added `stableInterimBoundaryThreshold = 0.85`; extended `isInterimContinuation` with suffix checks and stronger prefix/suffix heuristics; added `shouldCommitPriorInterimOnDivergence(previous, previousStability, current) bool` and `endsWithSentencePunctuation(transcript) bool` to decide commits based on stability or sentence-ending punctuation.
Tests `apps/sotto/internal/riva/client_test.go`	Renamed `TestRecordResponseCommitsInterimAcrossPauseLikeReset` → `TestRecordResponseReplacesDivergentInterimWithoutPrecommit`; added tests `TestRecordResponseCommitsStableDivergentInterimForPartialRecovery`, `TestRecordResponseDoesNotPrependStaleInterimBeforeFinal`, `TestRecordResponseTreatsSuffixCorrectionAsContinuation`; added assertions about clearing segments, interim replacement, and stability-driven commit behavior.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Poem

🐇 I hopped through interims, sniffed each line,
Kept the steady ones, let the shaky resign,
If punctuation stood proud or stability high,
I planted it firm — else the old would fly,
Fresh hops, no stale echoes — that's my sign.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 46.67% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and concisely summarizes the main change: fixing stale interim carryover in segment assembly, which aligns with the PR's core objective of preventing pre-committed divergent interim hypotheses from creating stale text in assembled transcripts.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix/riva-interim-hypothesis-reset

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 golangci-lint (2.5.0)

Error: can't load config: unsupported version of the configuration: "" See https://golangci-lint.run/docs/product/migration-guide for migration instructions
The command is terminated due to an error: can't load config: unsupported version of the configuration: "" See https://golangci-lint.run/docs/product/migration-guide for migration instructions

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 38d2798505

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

apps/sotto/internal/riva/stream_receive.go

Address PR review feedback by preserving prior interim text only when divergence indicates a likely utterance boundary. - track interim stability in stream state - commit prior interim on divergence only when high-confidence or sentence-complete - keep low-confidence divergence replacement to avoid stale/duplicated leading text - add regression coverage for stable partial recovery and stale-prepend avoidance Tests: - go test ./apps/sotto/internal/riva - just ci-check - nix build 'path:.#sotto'

Only preserve divergent interim text when it looks like a completed sentence and has high stability. This avoids prepending corrected-but-stale leading words while still keeping partial-recovery support for likely utterance boundaries in no-final EOF paths. Tests: - go test ./apps/sotto/internal/riva - just ci-check - nix build 'path:.#sotto'

Relax divergent interim boundary commits to preserve stable partial speech in long dictation while treating suffix-based corrections as continuation updates. - commit divergent interim when prior stability is high or sentence punctuation is present - detect suffix-overlap corrections and avoid pre-committing stale leading words - add regression tests for long-stream partial recovery and suffix correction behavior Tests: - go test ./apps/sotto/internal/riva - just ci-check - nix build 'path:.#sotto'

coderabbitai

🧹 Nitpick comments (2)

apps/sotto/internal/riva/transcript_segments.go (2)

72-75: Consider clarifying the minimum word count rationale.

The shorter >= 3 condition prevents false positives on very short transcripts, but this intent isn't immediately obvious. A brief inline comment would help.

📝 Suggested comment

+	// Require at least 3 words to use suffix similarity; shorter transcripts
+	// risk coincidental trailing-word matches.
 	commonSuffix := commonSuffixWords(prevWords, currWords)
 	if shorter >= 3 && commonSuffix*2 >= shorter {
 		return true
 	}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@apps/sotto/internal/riva/transcript_segments.go` around lines 72 - 75, Add a
brief inline comment above the conditional that uses commonSuffixWords to
explain the rationale for the minimum-word cutoff: clarify that the check
(shorter >= 3 && commonSuffix*2 >= shorter) is intended to avoid false positives
on very short transcripts by requiring at least 3 words before applying the
"majority suffix" heuristic; reference the variables commonSuffix, shorter and
the helper commonSuffixWords so reviewers can quickly understand why the
threshold exists.

5-6: Consider documenting the threshold rationale.

The 0.85 threshold is a magic number that determines when interim results are stable enough to commit on divergence. A brief comment explaining the empirical basis or design rationale would help future maintainers understand when this value might need adjustment.

📝 Suggested documentation

-const stableInterimBoundaryThreshold = 0.85
+// stableInterimBoundaryThreshold is the minimum stability score at which a prior
+// interim hypothesis is considered reliable enough to commit when the next
+// hypothesis diverges (i.e., is not a continuation).
+const stableInterimBoundaryThreshold = 0.85

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@apps/sotto/internal/riva/transcript_segments.go` around lines 5 - 6, The
constant stableInterimBoundaryThreshold = 0.85 is a magic number; add a short
inline comment above the stableInterimBoundaryThreshold declaration that
documents its purpose and rationale (e.g., how 0.85 was chosen—empirically tuned
for X dataset/latency vs stability tradeoff—or a pointer to tests/experiments
that justify it), note the units/meaning (probability/confidence), and describe
when to adjust it or where to find related tests/benchmarks that should be
updated if it changes.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@apps/sotto/internal/riva/transcript_segments.go`:
- Around line 72-75: Add a brief inline comment above the conditional that uses
commonSuffixWords to explain the rationale for the minimum-word cutoff: clarify
that the check (shorter >= 3 && commonSuffix*2 >= shorter) is intended to avoid
false positives on very short transcripts by requiring at least 3 words before
applying the "majority suffix" heuristic; reference the variables commonSuffix,
shorter and the helper commonSuffixWords so reviewers can quickly understand why
the threshold exists.
- Around line 5-6: The constant stableInterimBoundaryThreshold = 0.85 is a magic
number; add a short inline comment above the stableInterimBoundaryThreshold
declaration that documents its purpose and rationale (e.g., how 0.85 was
chosen—empirically tuned for X dataset/latency vs stability tradeoff—or a
pointer to tests/experiments that justify it), note the units/meaning
(probability/confidence), and describe when to adjust it or where to find
related tests/benchmarks that should be updated if it changes.

chatgpt-codex-connector bot reviewed Feb 23, 2026

View reviewed changes

apps/sotto/internal/riva/stream_receive.go Show resolved Hide resolved

rbright added 3 commits February 22, 2026 18:20

coderabbitai bot reviewed Feb 23, 2026

View reviewed changes

rbright merged commit 7c7f91e into main Feb 23, 2026
2 checks passed

rbright deleted the fix/riva-interim-hypothesis-reset branch February 23, 2026 03:17

coderabbitai bot mentioned this pull request Feb 23, 2026

fix(riva): rebuild segment assembly for long dictation #4

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(riva): avoid stale interim carryover in segment assembly#3

fix(riva): avoid stale interim carryover in segment assembly#3
rbright merged 4 commits intomainfrom
fix/riva-interim-hypothesis-reset

rbright commented Feb 23, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Feb 23, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rbright commented Feb 23, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rbright commented Feb 23, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 23, 2026 •

edited

Loading