This page contains a Flash digital edition of a book.
70 BIOTECHNOLOGY


GIAB’s NA12878 high confidence regions; and GIAB’s NA12878 VCF file of high confidence variant calls. At runtime, the VCF is filtered down to those positions delineated by the intersected BED file.


Te data is then assembled against the human genome reference sequence using SeqMan NGen running on a standard desktop computer. Fully gapped alignments are analysed in-stream using a modified version of the MAQ variant caller to produce variant and reference call files for each position in the intersected BED file. Key metrics, including the depth of coverage and probability scores, are recorded for each position. Assemblies can be visualised in SeqMan Pro, allowing evidence for variants and regions of low coverage to be assessed.


For accuracy calculations, variant and reference call files are automatically loaded with the


www.scientistlive.com


filtered VCF file into ArrayStar post-assembly. Only positions within the intersected BED are considered. Positions are classified as: true positives (TP), called variants also present in the VCF file; false positives (FP), called variants not in the VCF file; true negatives (TN), called reference bases not in the VCF file; and false negatives (FN), called reference bases that are present in the VCF file. Te counts of each class are then used to calculate various accuracy metrics, including the true positive rate (TPR, “sensitivity”), the true negative rate (TNR, “specificity”) and the false discovery rate (FDR).


Summary report A summary report is produced with the absolute number of positions in each class and the corresponding statistics stratified based on two thresholds: minimum depth of coverage that a position must match or exceed to be considered in the analysis; and


minimum p-value a position must have to be considered a variant. Tree p-values (“Pnotref” of 90.0%, 99.0% and 99.9% corresponding to phred-like scores of 10, 20 and 30, respectively) are employed for each of 12 different depths of coverage cutoffs (ranging from 1 to 100). Te number and percentage of targeted bases meeting each depth cutoff are also presented.


Lessons learned NGS-based genetic tests promise to greatly enhance patient care in this era of personalised medicine. Given their potential impact on treatment decisions, it is critical that those tests be appropriately validated as part of their regulatory approval and as a measure of routine assessment of the test’s performance. DNASTAR’s validation control workflow using NIST/GIAB’s NA12878 reference materials greatly facilitates this process.


For more information ✔ at www.scientistlive.com/eurolab


Screenshot from Lasergene Genomics Suite


Frederick R. Blattner & Tim Durfee are with DNASTAR. www.dnastar.com


Page 1  |  Page 2  |  Page 3  |  Page 4  |  Page 5  |  Page 6  |  Page 7  |  Page 8  |  Page 9  |  Page 10  |  Page 11  |  Page 12  |  Page 13  |  Page 14  |  Page 15  |  Page 16  |  Page 17  |  Page 18  |  Page 19  |  Page 20  |  Page 21  |  Page 22  |  Page 23  |  Page 24  |  Page 25  |  Page 26  |  Page 27  |  Page 28  |  Page 29  |  Page 30  |  Page 31  |  Page 32  |  Page 33  |  Page 34  |  Page 35  |  Page 36  |  Page 37  |  Page 38  |  Page 39  |  Page 40  |  Page 41  |  Page 42  |  Page 43  |  Page 44  |  Page 45  |  Page 46  |  Page 47  |  Page 48  |  Page 49  |  Page 50  |  Page 51  |  Page 52  |  Page 53  |  Page 54  |  Page 55  |  Page 56  |  Page 57  |  Page 58  |  Page 59  |  Page 60  |  Page 61  |  Page 62  |  Page 63  |  Page 64  |  Page 65  |  Page 66  |  Page 67  |  Page 68  |  Page 69  |  Page 70  |  Page 71  |  Page 72  |  Page 73  |  Page 74  |  Page 75  |  Page 76  |  Page 77  |  Page 78  |  Page 79  |  Page 80  |  Page 81  |  Page 82  |  Page 83  |  Page 84  |  Page 85  |  Page 86  |  Page 87  |  Page 88  |  Page 89  |  Page 90  |  Page 91  |  Page 92