Skip to content

Conversation

@Chain-Frost
Copy link
Owner

Summary

  • account for header rows when determining if a workbook exceeds Excel size limits
  • export all oversized worksheets to Parquet before beginning any CSV writes
  • keep Parquet dependency failures non-fatal while informing the user

Testing

  • python -m black ryan_library/functions/misc_functions.py

Codex Task

@github-actions
Copy link

github-actions bot commented Nov 7, 2025

@codex review

@chatgpt-codex-connector
Copy link

To use Codex here, create a Codex account and connect to github.

@Githubcopilot111 Githubcopilot111 merged commit 2d03e80 into main Nov 7, 2025
2 checks passed
Copy link

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment on lines +262 to +266
for df, sheet in zip(dataframes, sheets):
sanitized_sheet: str = self._sanitize_name(sheet)
base_filename: str = f"{datetime_string}_{file_name}_{sanitized_sheet}"

parquet_path: Path = self._build_output_path(

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Prevent overwriting when sanitized sheet names collide

Output filenames are derived from self._sanitize_name(sheet) and then reused for both Parquet and CSV exports, but there is no guarantee that the sanitized names are unique. If two sheets differ only by characters that _sanitize_name strips (e.g. "Sales#1" vs "Sales@1"), both exports target the same base_filename, so the later write silently replaces the earlier files and those data are lost. Consider de‑duplicating sanitized names or appending a counter when a collision occurs before writing.

Useful? React with 👍 / 👎.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants