Support algorithms by yannrichet-asnr · Pull Request #40 · Funz/fz

yannrichet-asnr · 2025-10-18T19:25:33Z

This commit adds support for using pandas DataFrames as input_variables,
enabling non-factorial parametric study designs alongside the existing
dict-based factorial (Cartesian product) approach.

Implementation (fz/helpers.py):

Updated generate_variable_combinations() to detect DataFrame input
DataFrame: each row represents one case (non-factorial design)
Dict: existing Cartesian product behavior (factorial design)
Added optional pandas import with HAS_PANDAS flag
Enhanced type hints and docstring with examples
Added informative logging when DataFrame detected
Raises TypeError for invalid input types

Key features:

Factorial design (dict): Creates ALL combinations (Cartesian product)
Example: {"x": [1,2], "y": [3,4]} → 4 cases
Non-factorial design (DataFrame): Only specified combinations
Example: pd.DataFrame({"x":[1,2], "y":[3,4]}) → 2 cases (rows)

Use cases for DataFrames:

Variables with constraints or dependencies
Latin Hypercube Sampling, Sobol sequences
Imported designs from DOE software
Optimization algorithm sample points
Sensitivity analysis (one-at-a-time)
Sparse or adaptive sampling
Any irregular design pattern

Tests (tests/test_dataframe_input.py):

12 comprehensive tests covering all scenarios
Unit tests for generate_variable_combinations()
Integration tests with fzr()
Tests for DataFrame vs dict behavior comparison
Tests for mixed types, constraints, repeated values
Input validation tests
All 12 tests pass successfully

Documentation:

README.md: New "Input Variables: Factorial vs Non-Factorial Designs" section
- Comparison of dict (factorial) vs DataFrame (non-factorial)
- When to use each approach
- Examples with LHS, constraint-based designs
examples/dataframe_input.md: Comprehensive guide with:
- 7 practical examples (constraints, LHS, Sobol, DOE import, etc.)
- Comparison table
- Tips and best practices
- Common patterns and workflows
Updated Features section to mention both design types
Updated DataFrame I/O description

Backward compatibility:

Existing dict-based code continues to work unchanged
DataFrame support requires pandas (optional dependency)
Graceful handling when pandas not installed

Example usage:

import pandas as pd
from fz import fzr

# Non-factorial: specific combinations only
input_variables = pd.DataFrame({
    "temp": [100, 200, 100, 300],
    "pressure": [1.0, 1.0, 2.0, 1.5]
})
# Creates 4 cases: (100,1.0), (200,1.0), (100,2.0), (300,1.5)

results = fzr(input_file, input_variables, model, calculators)

Copilot

Pull Request Overview

Adds support for pandas DataFrame input to enable non-factorial (row-wise) parametric designs alongside existing dict-based factorial Cartesian product generation.

Extends generate_variable_combinations to accept DataFrames and return one case per row.
Adds comprehensive tests and documentation/examples differentiating factorial dict vs non-factorial DataFrame usage.
Updates README and adds a detailed example guide for DataFrame-driven designs.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
fz/helpers.py	Implements DataFrame handling in generate_variable_combinations with optional pandas import and logging.
tests/test_dataframe_input.py	Adds unit and integration tests for DataFrame vs dict behavior and input validation.
examples/dataframe_input.md	New extensive guide on using DataFrames for non-factorial designs with multiple sampling patterns.
README.md	Updates feature list and documents factorial vs non-factorial input variable formats with examples.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-18T19:26:26Z

fz/helpers.py



-def generate_variable_combinations(input_variables: Dict) -> List[Dict]:
+def generate_variable_combinations(input_variables: Union[Dict, Any]) -> List[Dict]:


The type hint Union[Dict, Any] effectively collapses to Any and advertises acceptance of all types, while the function raises TypeError for non-dict/non-DataFrame inputs. Narrow the annotation to accepted types only, e.g. Union[Dict[str, Any], 'pd.DataFrame'] guarded by a TYPE_CHECKING block or a Protocol to improve static analysis.

Copilot · 2025-10-18T19:26:26Z

fz/helpers.py

+        var_combinations = []
+        for _, row in input_variables.iterrows():
+            var_combinations.append(row.to_dict())


Using iterrows is relatively slow and may coerce dtypes; you can replace this block with var_combinations = input_variables.to_dict(orient='records') for a vectorized, faster conversion that preserves dtypes.

Suggested change

var_combinations = []

for _, row in input_variables.iterrows():

var_combinations.append(row.to_dict())

var_combinations = input_variables.to_dict(orient='records')

Copilot · 2025-10-18T19:26:26Z

examples/dataframe_input.md

+| **Example** | `{"x": [1,2], "y": [3,4]}` → 4 cases | `pd.DataFrame({"x":[1,2], "y":[3,4]})` → 2 cases |
+| **Constraints** | Cannot handle constraints | Can handle constraints |
+| **Sampling** | Grid-based | Any sampling method |
+


Each line has a double leading pipe '||', which will render an extra empty column or break the table. Remove one leading pipe per line so the table starts with a single | (e.g. | Aspect | Dict (Factorial) | DataFrame (Non-Factorial) |).

Suggested change

Copilot · 2025-10-18T19:26:26Z

tests/test_dataframe_input.py

+        model = {
+            "formulaprefix": "@",
+            "delim": "{}",
+            "commentline": "#",
+            "output": {
+                "result": "grep 'result:' output.txt | awk '{print $2}'"
+            }
+        }


[nitpick] The same model dict is duplicated across multiple tests (e.g., lines 171–178, 201–208, 257–263). Consider extracting it into a fixture or a class attribute to reduce repetition and ease future changes.

Resolves PermissionError on Windows during temporary directory cleanup by restoring the original working directory before the TemporaryDirectory context manager exits. On Windows, you cannot delete a directory that is the current working directory. The tests were calling os.chdir(tmpdir) and then attempting to clean up the directory when the context exited, causing: - PermissionError: [WinError 32] The process cannot access the file because it is being used by another process - PermissionError: [WinError 5] Access is denied Solution: Wrap test logic in try/finally blocks that save and restore the original working directory, allowing Windows to successfully delete temporary directories during cleanup. Fixes #40 (Windows CI failure in test_dict_flattening.py)

Resolves PermissionError on Windows during temporary directory cleanup by restoring the original working directory before the TemporaryDirectory context manager exits. On Windows, you cannot delete a directory that is the current working directory. The tests were calling os.chdir(tmpdir) and then attempting to clean up the directory when the context exited, causing: - PermissionError: [WinError 32] The process cannot access the file because it is being used by another process - PermissionError: [WinError 5] Access is denied Solution: Wrap test logic in try/finally blocks that save and restore the original working directory, allowing Windows to successfully delete temporary directories during cleanup. Fixes #40 (Windows CI failure in test_dict_flattening.py) Co-authored-by: Claude <noreply@anthropic.com>

Squash-merge of implement-algorithms branch onto current main (v0.9.1). Resolved conflicts in: - fz/__init__.py: Added both fzl and fzd exports - fz/core.py: Merged imports, kept callbacks from main + added fzd functions - fz/helpers.py: Kept main's format_time (Windows bash moved to shell.py) - fz/cli.py: Kept fzl list command + added algorithm install subcommands - fz/shell.py: Kept main's safer regex replacement from #56 - README.md: Listed both fzl and fzd, kept improved env var docs - tests/test_cli_commands.py: Added new test methods from PR - Removed CLAUDE.md (moved to claude/ dir on main) - Removed setup.py (replaced by pyproject.toml on main)

… flake8 errors (undefined variable names in fzd code):\n display_dict→analysis_dict, processed_final_analysis→processed_final_display,\n tmp_display_processed→tmp_analysis_processed, iter_display_processed→iter_analysis_processed\n- Remove duplicate 'list' subparser conflicting with fzl's list command\n- Remove shadowing local _resolve_calculators_arg (use imported version from helpers.py)\n- Update tests for new fzl-style 'list' command (no more 'list models' subcommand)"

Copilot AI review requested due to automatic review settings October 18, 2025 19:25

Copilot AI reviewed Oct 18, 2025

View reviewed changes

yannrichet changed the title ~~Add DataFrame input support for non-factorial parametric designs~~ Support algorithms Oct 23, 2025

yannrichet-asnr force-pushed the implement-algorithms branch from 5077247 to 6b43c56 Compare October 24, 2025 21:28

yannrichet-asnr force-pushed the implement-algorithms branch 2 times, most recently from 3b8c6c5 to 28d619f Compare November 22, 2025 12:58

yannrichet mentioned this pull request Nov 22, 2025

Fix Windows file deletion issue in test_dict_flattening.py #46

Merged

yannrichet-asnr force-pushed the implement-algorithms branch from 877028c to 4819568 Compare February 13, 2026 15:52

yannrichet-asnr merged commit df83b06 into main Feb 13, 2026
33 of 34 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support algorithms#40

Support algorithms#40
yannrichet-asnr merged 2 commits intomainfrom
implement-algorithms

yannrichet-asnr commented Oct 18, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 18, 2025

Uh oh!

Copilot AI Oct 18, 2025

Uh oh!

Copilot AI Oct 18, 2025

Uh oh!

Copilot AI Oct 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants



		def generate_variable_combinations(input_variables: Dict) -> List[Dict]:
		def generate_variable_combinations(input_variables: Union[Dict, Any]) -> List[Dict]:

Conversation

yannrichet-asnr commented Oct 18, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants