Releases: seqcode/seqchromloader
Releases · seqcode/seqchromloader
v0.6.8
v0.5.3 bug fix: wds braceexpand style filename
Full Changelog: 0.5.2...0.5.3
v0.5.2 Add get_genome_size and random_coords functions
Add get_genome_size and random_coords functions, also the corresponding test functions
Full Changelog: 0.5.1...0.5.2
v0.5.1
v0.5.0: Utility functions for prepraring training dataset
Add several utility functions to help easily prepare the training dataset:
- filter_chromosomes
- make_random_shift
- make_flank
- chop_genome
- dna2OneHot
- rev_comp
0.4.0: function to transform existing webdataset file, param to create DALI compatible webdataset
- add a new function
convert_data_webdatasetto transform an existing webdataset file - add
DALIoption todump_data_webdatasetto create NVIDIA DALI compatible webdataset for loading
Full Changelog: 0.3.0...0.4.0
0.3.0: add SeqChromDatasetByDataFrame function
New SeqChromDatasetByDataFrame function to load tensors given a region dataframe
What's Changed
New Contributors
Full Changelog: 0.2.4...0.3.0
0.2.4: SeqchromDatasetByWds accepts device rank info, bug fix
- Now SeqchromDatasetByWds can use device rank info to split files
- Fix the bug occurred when there are more workers than # wds files
- Allow empty Bigwig file list as input
Full Changelog: 0.2.3...0.2.4
Performance issue quick fix
Fix the performance issue due to each worker of webdataset loader uncompressed all files instead of uncompressing only the files assigned
Full Changelog: 0.2.2...0.2.3
Several bug fix: Duplicate webdataset sample index, default dataloader kwargs value
Fixes:
- Add unique id to each sample to avoid duplicate sample name in webdataset
- Change default dataloader kwargs to
{} - Initialize Exception class before calling
__init__()
Full Changelog: 0.2.1...0.2.2