2

Detecting Multiple Replicating Signals using Adaptive Filtering Procedures

Replicability is a fundamental quality of scientific discoveries: we are interested in those signals that are detectable in different laboratories, different populations, across time etc. Unlike meta-analysis which accounts for experimental …

Causal Inference for Heritable Phenotypic Risk Factors Using Heterogeneous Genetic Instruments

Progenitor cell diversity in the developing mouse neocortex

In the mammalian neocortex, projection neuron types are sequentially generated by the same pool of neural progenitors. How neuron type specification is related to developmental timing remains unclear. To determine whether temporal gene expression in …

Data Denoising and Post-Denoising Corrections in Single Cell RNA Sequencing

Single cell sequencing technologies are transforming biomedical research. However, due to the inherent nature of the data, single cell RNA sequencing analysis poses new computational and statistical challenges. We begin with a survey of a selection …

Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score

Mendelian randomization (MR) is a method of exploiting genetic variation to unbiasedly estimate a causal effect in presence of unmeasured confounding. MR is being widely used in epidemiology and other related areas of population science. In this …

Surface protein imputation from single cell transcriptomes by deep neural networks

While single cell RNA sequencing (scRNA-seq) is invaluable for studying cell populations, cell-surface proteins are often integral markers of cellular function and serve as primary targets for therapeutic intervention. Here we propose a transfer …

Powerful genome-wide design and robust statistical inference in two-sample summary-data Mendelian randomization

We develop an empirical partially Bayes statistical analysis approach where instruments are weighted according to their strength; thus weak instruments bring less variation to the estimator. The estimator is highly efficient with many weak genetic …

Admissibility in Partial Conjunction Testing

Meta-analysis combines results from multiple studies aiming to increase power in finding their common effect. It would typically reject the null hypothesis of no effect if any one of the studies shows strong significance. The partial conjunction null …

Data denoising with transfer learning in single-cell transcriptomics

Single-cell RNA sequencing (scRNA-seq) data are noisy and sparse. Here, we show that transfer learning across datasets remarkably improves data quality. By coupling a deep autoencoder with a Bayesian model, SAVER-X extracts transferable gene−gene …

Two-Sample Instrumental Variable Analyses Using Heterogeneous Samples

Instrumental variable analysis is a widely used method to estimate causal effects in the presence of unmeasured confounding. When the instruments, exposure and outcome are not measured in the same sample, Angrist and Krueger (J. Amer. Statist. Assoc. …