download in pdf format
Status: in progress
Motivation
- Have an idea of how long is it going to take
- Interruption does not mean corruption
- Faster downloading
aria2c
aria2c -x 16 -s 16 -c https://server/file.fastq.gz
parallel -j 4 aria2c -c {} :::: urls.txt
for sra
prefetch --max-size 100G SRR123456
fasterq-dump --threads 8 SRR123456
Simple examples and syntax
Understand how big a folder is
du -sh /path/to/folder
Dry run to understand what gets copied
rsync -avhn --stats source/ destination/
-afor sensible defaults (permissions, recursivity etc.)-vfor verbose-hfor human readible file sizes-nfor not run
Copy from source to destination displaying progress bar
rsync -ah --info=progress2 source/ destination/
Check all get copied by checking checksum
rsync -a --checksum --dry-run source/ destination/
Delete the source if needed
rm -r source/