Skip to content

[8.3.2] pscratch raw pruning #103

@davramov

Description

@davramov

Using the handy Prefect Status Monitoring script I created in this PR: #102, I found an error with pruning raw data from pscratch after reconstruction, where it appends the full spot832 path to the base path for the endpoint, rather than the relative path.

Flow run encountered an exception.
TransferAPIError: ('GET', 'https://transfer.api.globus.org/v0.10/operation/endpoint/d40248e6-d874-4f7b-badd-2c06c16f1a58/ls?path=%2Fpscratch%2Fsd%2Fa%2Falsdev%2F8.3.2%2Fraw%2Fglobal%2Fraw%2F_als-13450_chin',
'Bearer', 404, 'ClientError.NotFound',
"Directory '/pscratch/sd/a/alsdev/8.3.2/raw/global/raw/_als-13450_chin' not found on endpoint 'd40248e6-d874-4f7b-badd-2c06c16f1a58'", 'fw3u9O0W4')

I will add this to the schedule_prune() function in orchestration/flows/832/nersc.py to make the correct path:

path = Path(raw_file_path)
folder_name = path.parent.name
file_name = path.name  # includes .h5 extension
pscratch_relative_path = f"{folder_name}/{file_name}"

Which would make the path to prune:
/pscratch/sd/a/alsdev/8.3.2/raw/_als-13450_chin/20251212_014539_T30_50_S.h5

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions