Refactor code structure for improved readability and maintainability #2
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request updates the documentation and dependencies to switch the RTF file handler from
pyrtf-ngtostriprtfacross the project. It also improves error handling for the DOC handler by detecting missing LibreOffice dependencies when usingantiword. These changes ensure more reliable RTF extraction and clearer installation instructions.Dependency and Documentation Updates:
pyrtf-ngtostriprtfinpyproject.toml, and updated all related documentation files (docs/api.md,docs/index.md,docs/installation.md,docs/usage.md) to reflect this new requirement. [1] [2] [3] [4] [5]textxtract/core/registry.pyto referencestriprtfinstead ofpyrtf-ngwhen the RTF handler is not installed.Error Handling Improvements:
textxtract/handlers/doc.pyto detect missing LibreOffice dependencies when usingantiword, and raise aFileNotFoundErrorto trigger fallback extraction methods.Minor Codebase Cleanup:
Pathintextxtract/core/registry.py.Python Version Documentation:
docs/installation.mdto include Python 3.13.