Provide fallback if tiktoken cannot be imported #456

cachafla · 2025-12-05T00:12:51Z

Pull Request Description

What and why?

Eliminate Network Calls on Import with Lazy Tiktoken Loading

This PR refactors the validmind.ai.test_descriptions module to eliminate network calls during import by implementing lazy loading for tiktoken. Previously, import tiktoken at the module level would trigger network requests to download encoding data, causing delays and failures in environments without network access.

The solution implements a hybrid approach that attempts to import tiktoken once at module load within a try-catch block, caching the result in module-level flags (_TIKTOKEN_AVAILABLE and _TIKTOKEN_ENCODING). The _truncate_summary function now checks these cached flags with zero runtime overhead:

Before: Direct import causes network call

import tiktoken

def _truncate_summary(summary, test_id, max_tokens=100_000):
    encoding = tiktoken.encoding_for_model("gpt-4o")  # Called every time
    summary_tokens = encoding.encode(summary)
    ...

After: Cached import with character-based fallback

_TIKTOKEN_AVAILABLE = False
_TIKTOKEN_ENCODING = None

try:
    import tiktoken
    _TIKTOKEN_ENCODING = tiktoken.encoding_for_model("gpt-4o")
    _TIKTOKEN_AVAILABLE = True
except (ImportError, Exception):
    pass  # Fall back to character-based estimation

def _truncate_summary(summary, test_id, max_tokens=100_000):
    if _TIKTOKEN_AVAILABLE:
        summary_tokens = _TIKTOKEN_ENCODING.encode(summary)  # Use cached encoding
        ...
    else:
        estimated_tokens = len(summary) // 4  # Simple fallback
        ...

When tiktoken is available, the implementation uses accurate token counting. When unavailable (no network, import failure), it gracefully falls back to character-based estimation (~4 characters per token). This ensures the library works reliably in all environments while maintaining accuracy when possible. Comprehensive unit tests verify both code paths execute correctly with proper assertions on mocked function calls.

How to test

What needs special review?

Dependencies, breaking changes, and deployment notes

Release notes

Checklist

github-actions · 2025-12-05T07:06:36Z

PR Summary

This PR introduces significant enhancements to the token estimation and summary truncation logic within the project. The changes include:

Implementation of a character-based token estimation function (_estimate_tokens_simple) and a corresponding text truncation function (_truncate_text_simple) that are used as a fallback when the tiktoken library is unavailable.
Modification of the _truncate_summary function to dynamically choose between using tiktoken for accurate token counting and falling back to the character-based methods. This ensures that summary truncation works reliably in different runtime environments.
Addition of comprehensive unit tests in tests/test_test_descriptions.py that validate both the tiktoken and fallback code paths. These tests cover scenarios such as:
- Token estimation for texts of varying lengths.
- Proper truncation behavior for both short and excessively long texts.
- Correct selection of the code path based on the availability of the tiktoken module using patching techniques.
Minor version updates in configuration files to reflect the new release version.

Overall, these changes enhance the robustness of the module by ensuring that summary truncation is both accurate (using tiktoken when possible) and resilient (with a reliable fallback).

Test Suggestions

Test with multi-byte or Unicode characters to ensure the character-based estimation remains consistent.
Add edge case tests where the summary length is just around the max_tokens threshold to verify boundaries.
Include tests that simulate failures in tiktoken functions (e.g., encoding/decoding errors) to further validate fallback behavior.
Run performance benchmarks for long text inputs to ensure the fallback method scales well.

Provide fallback if tiktoken cannot be imported

0831f97

cachafla requested a review from nibalizer December 5, 2025 00:12

cachafla added the internal Not to be externalized in the release notes label Dec 5, 2025

nibalizer approved these changes Dec 5, 2025

View reviewed changes

2.10.6

53670d2

cachafla merged commit f0edca4 into main Dec 5, 2025
17 checks passed

cachafla deleted the cacahfla/tiktoken-fallback branch December 5, 2025 16:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Provide fallback if tiktoken cannot be imported #456

Provide fallback if tiktoken cannot be imported #456

Uh oh!

cachafla commented Dec 5, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Provide fallback if tiktoken cannot be imported #456

Provide fallback if tiktoken cannot be imported #456

Uh oh!

Conversation

cachafla commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Description

What and why?

Eliminate Network Calls on Import with Lazy Tiktoken Loading

How to test

What needs special review?

Dependencies, breaking changes, and deployment notes

Release notes

Checklist

Uh oh!

github-actions bot commented Dec 5, 2025

PR Summary

Test Suggestions

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cachafla commented Dec 5, 2025 •

edited

Loading