STITCH

Last update: October 2, 2016

Overview of STITCH - more text

STITCH is an R program for reference panel free, read aware, low coverage sequencing genotype imputation. STITCH runs on a set of samples with sequencing reads in BAM format, as well as a list of positions to genotype, and outputs imputed genotypes in VCF format.

STITCH works by modelling each chromosome in the set of samples as a mosaic of K unknown founders or ancestral haplotypes. STITCH employs a hidden Markov model, whose parameters are sequentially updated using expectation maximization. Both steps are handled in a read aware fashion done without using external reference haplotype sets.

STITCH has been tested on low coverage mouse and human data. STTICH has been run on mouse genotype-by-sequencing (GBS) data, although it may encounter issues when the number of reads becomes close to 0. For guidance on parameter options (like K), please see the supplementary note in Davies et al. given below

License

STITCH is free for academic use. For commercial inquiries please contact Robert Davies (robertwilliamdavies at gmail dot com) and Simon Myers (myers at stats dot ox dot ac dot uk)

Changelog

Downloads

Complete working example

  1. Install R if not already installed. Install R dependencies parallel, Rcpp and RcppArmadillo from CRAN (using the "install.packages" option within R), and Rsamtools from BioConductor
  2. Download STITCH from above. Install by opening R and using install.packages, giving install.packages the path to the downloaded STITCH tar.gz file above
  3. Download example tar.gz from above and open using a command such as tar -xzvf
  4. Run STITCH. Open R, change your working directory using setwd() to the directory where the example tar.gz was unzipped, and then run STITCH(tempdir = tempdir(), chr = "chr19", bamlist = "bamlist.txt", posfile = "pos.txt", genfile = "gen.txt", outputdir = paste0(getwd(), "/"), K = 4, nGen = 100, nCores = 1). Once complete, a VCF should appear in the current working directory named stitch.chr19.vcf.gz

Contact

If you have any problems running STITCH please contact Robert Davies (robertwilliamdavies at gmail dot com)

Citation

If you use STITCH, please cite Davies, R. W., Flint J, Myers S., Mott R. Rapid genotype imputation from sequence without reference panels. Nat. Genet. 48, 965-969 (2016)