BioConductor

DupChecker 1.18.0

a package for checking high-throughput genomic data redundancy in meta-analysis

Released Oct 7, 2014 by "Quanhu SHENG"

This package cannot yet be used with Renjin it depends on other packages which are not available: R.utils 2.6.0

Dependencies

R.utils 2.6.0 RCurl

Meta-analysis has become a popular approach for high-throughput genomic data analysis because it often can significantly increase power to detect biological signals or patterns in datasets. However, when using public-available databases for meta-analysis, duplication of samples is an often encountered problem, especially for gene expression data. Not removing duplicates would make study results questionable. We developed a Bioconductor package DupChecker that efficiently identifies duplicated samples by generating MD5 fingerprints for raw data.

Source

R

Release History