Skip to main content

Genomic Data Release

Submitted by Sheeba on

Genomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data ReleaseGenomic Data Release

Pre-Aggregation QC pipeline

Submitted by Sheeba on
 

 

Version 1

This pipeline tests each WGS sample, independently. If a sample fails this test, then it is excluded from the aggregation and not released or reported in this document. These tests detect sample swaps, cross-individual contamination, and sample preparation/sequencing errors. The list of specific QC processes is listed in the table below:

Aggregation Pipeline

Submitted by Sheeba on
Version 1

Samples that PASS the pre-aggregation QC pipeline (per sample QC) are aggregated into one multi-sample VCF file. Sample are aggregated using Dragen Iterative gVCF genotyper. To accelerate the aggregation and the downstream analyses (parallel processing), the genome is split into 100 shards. 

The output of this pipeline is a multi-sample VCF. One multi-sample VCF is generated per shard in addition to the global multi-sample VCF files (all shards concatenated).
 

Subscribe to