Skip to content

davidecaroselli/qshuf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

qshuf - Quick Shuffle

qshuf is a fast and memory-efficient command-line tool for shuffling very large text files. It uses memory mapping to minimize RAM usage, making it ideal for AI, machine learning, and data processing tasks that require randomized datasets.

Build from Source

To build qshuf, compile it using cmake:

mkdir build && cd build
cmake ..
make

Examples

Shuffle a large file and print to stdout:

qshuf data.txt

Shuffle and save output to a file:

qshuf data.txt -o shuffled.txt

Use a specific random seed:

qshuf data.txt -s 42 > shuffled.txt

License

This project is licensed under the MIT License. See LICENSE for details.

Author

Created by Davide Caroselli. Contributions welcome!

About

A fast, memory-mapped, tool to shuffle text file lines.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •