fixed bug in 2024 dataset + grouping data for sample and era letter#216
Conversation
kandrosov
left a comment
There was a problem hiding this comment.
Do not change the AnaTupleFileListBuilderTask branches definition. Only modify the logic inside AnaTupleFileList.py.
…r_anaTupleFileList
…r_anaTupleFileList
|
@cms-flaf-bot please test
|
|
pipeline#13941443 started |
|
pipeline#13941443 failed |
|
@cms-flaf-bot please test
|
|
@cms-flaf-bot please test
|
|
pipeline#13948509 started |
|
pipeline#13948509 failed |
|
@cms-flaf-bot please test
|
|
pipeline#13949523 started |
|
pipeline#13949523 passed |
|
@cms-flaf-bot please test
|
|
pipeline#13951659 started |
|
pipeline#13951659 failed |
config/Processors/stitching_DY_amcatnlo_Vpt_NpNLO_singleFlavor.yaml
Outdated
Show resolved
Hide resolved
|
@cms-flaf-bot please test
|
|
pipeline#13958487 started |
|
pipeline#13958487 passed |
There was a problem hiding this comment.
Pull request overview
This PR addresses data organization and dataset configuration issues by separating data files by era letter instead of grouping them together. The changes enable proper grouping of datasets like "Muon_2022C" and "Muon_2022D" separately.
Changes:
- Fixed invalid dataset names in Run3_2024 config (removed leading slashes)
- Updated cross-section definitions to support both all-flavors and single-flavor variants
- Modified
fileNamePatternhandling from string to dictionary keyed by nanoAOD version - Added missing type mapping for
int8_tin FuseAnaTuples
Reviewed changes
Copilot reviewed 13 out of 13 changed files in this pull request and generated no comments.
Show a summary per file
| File | Description |
|---|---|
| test/checkDatasetConfigConsistency.py | Updated logic to handle fileNamePattern as dictionary with version keys |
| run_tools/law_customizations.py | Extracted get_nano_version method for reusability |
| config/dataset_exceptions.yaml | Added exception for DYto2L samples in Run3_2024 |
| config/crossSections13p6TeV.yaml | Added _allFlavors/_singleFlavor cross-section variants |
| config/Run3_2024/datasets.yaml | Fixed dataset names, updated cross-sections, added new samples |
| config/Run3_2023/datasets.yaml | Converted fileNamePattern to dict, added allFlavors entries |
| config/Run3_2022EE/datasets.yaml | Similar updates for fileNamePattern and cross-sections |
| config/Run3_2022/datasets.yaml | Similar updates for fileNamePattern and cross-sections |
| config/Processors/stitching_DY_amcatnlo_Vpt_NpNLO_allFlavors.yaml | New stitching config for all-flavors |
| config/Processors/stitching_DY_amcatnlo_Vpt_NpNLO.yaml | Updated to use singleFlavor cross-sections |
| AnaProd/tasks.py | Updated to retrieve fileNamePattern from dict by version |
| AnaProd/FuseAnaTuples.py | Added int8_t type mapping |
This PR is intended to handle the data merge and file list definition.
Now the data are not grouped in one big data file but they are separated by era letter.
e.g.:
and they will be grouped in "Muon_2022C" and "Muon_2022D"