BioConductor
doppelgangR 1.8.0
Identify likely duplicate samples from genomic or meta-data
Released Apr 25, 2017 by Levi Waldron
This package cannot yet be used with Renjin it depends on other packages which are not available: sva 3.28.0
Dependencies
sva 3.28.0 Biobase 2.40.0 digest 0.6.16 impute 1.54.0 mnormt 1.5-5 BiocParallel 1.14.2
The main function is doppelgangR(), which takes as minimal input a list of ExpressionSet object, and searches all list pairs for duplicated samples. The search is based on the genomic data (exprs(eset)), phenotype/clinical data (pData(eset)), and "smoking guns" - supposedly unique identifiers found in pData(eset).