Note: Library files for some Axiom™ arrays may use alternative names for the Step1 and Step2 .xml
files:
·apt-probeset-genotype.AxiomGT1.apt2.xml
·apt-axiom-genotype.AxiomGT1.apt2.xml
Introduction
The success of a genome-wide association study (GWAS) in finding or confirming the association
between an allele and disease and traits in human, plant, and animal genomes is greatly influenced by
proper study design and the data analysis workflow, including the use of quality control (QC) checks
for genotyping data. Although the number of replicated allele/complex disease associations that are
discovered through human GWAS has been steadily increasing, most variants that are detected to
date have small eects, and very large sample sizes have been required to identify and validate these
findings (Manolio & Collins, 2009; de Bakker, et. al. 2008; Baker, 2010). As a result, even small sources
of systematic or random error can cause false positive results or obscure real eects. This reinforces
the need for careful attention to study design and data quality (Laurie, et. al. 2010). In addition, most
genotyping methods assume three genotype clusters (AA, AB, BB) for two alleles. This assumption
does not always hold, especially in plant and animal studies, due to the existence of subpopulation
genome structural variation and/or auto-polyploid genomes.
This guide presents the Best Practices Genotyping Analysis Workflow to address these challenges,
along with instructions for using Axiom™ software for all Axiom™ Genotyping Arrays (human,
plant, and animal). The Axiom™ Genotyping Solution produces calls for both SNPs and indels
(insertions/deletions). For simplicity, in this document, the term SNPs refers to both SNPs and indels.
Additional chapters in the document include:
•Chapter 2, “Background” provides information that is needed for understanding the remainder of
the document.
•Chapter 3, “Best Practices Genotyping Analysis Workflow” discusses the required eight steps for
producing high quality and appropriate genotypes for downstream statistical analysis and guidance
on interpreting SNP cluster plots. Instructions for executing the steps and visualizing SNP cluster
plots are provided in Chapters 7, 8, and 9.
•Chapter 4, “Additional functions and genotyping methods” discusses methods for changing
genotype calls and advanced methods for genotyping more than three genotype clusters.
•Chapter 5, “Additional sample and plate QC” discusses QC considerations for samples, and plates
that are in addition to those in the required Best Practices steps (Chapter 3, “Best Practices
Genotyping Analysis Workflow”).
•Chapter6, “SNP QC metrics and classification” describes metrics that are used in the
Best Practices workflow (Chapter 3, “Best Practices Genotyping Analysis Workflow”) for SNP
classification and additional metrics that are used in the field for SNP QC.
•Chapter 7, “Execute Best Practices steps with Axiom™ Analysis Suite” provides instructions for
executing all Best Practices Steps with Axiom™ Analysis Suite. Instructions for visualizing SNP
cluster plots with the suite are also provided in this chapter.
•Chapter 8, “Executing Best Practices steps with command line software” provides instructions
for executing the Best Practices with APT. Instructions for visualizing SNP cluster plots from the
command line are provided in this chapter.
Chapter1Introduction to Axiom™data analysis
Introduction
1
10 Axiom™ Genotyping Solution Data Analysis User Guide