Data Download - Error Processing

rough pipeline of data processing.

The Issue

Once the data was downloaded, it could be damaged during transfer. The objective is to identify those damaged file and re-transfer them.

Validation

The file downloaded is in .fastq.gz formate, and we could validate the integrity by decompression. If the file is good, it will be moved to the final destination, if not, the corresponding URL of the file will be extracted automatically to a list. Once all other data within the same batch is downloaded, this list will be processed again.

sample image of moving script.

Navigate through the Data Download Section

Navigate through the Genetic Project