Repository logo

Bayesian Methods Under Unknown Prior Distributions with Applications to The Analysis of Gene Expression Data

dc.contributor.authorRahal, Abbas
dc.contributor.supervisorBickel, David
dc.date.accessioned2021-07-14T19:03:49Z
dc.date.available2021-07-14T19:03:49Z
dc.date.issued2021-07-14en_US
dc.description.abstractThe local false discovery rate (LFDR) is one of many existing statistical methods that analyze multiple hypothesis testing. As a Bayesian quantity, the LFDR is based on the prior probability of the null hypothesis and a mixture distribution of null and non-null hypothesis. In practice, the LFDR is unknown and needs to be estimated. The empirical Bayes approach can be used to estimate that mixture distribution. Empirical Bayes does not require complete information about the prior and hyper prior distributions as in hierarchical Bayes. When we do not have enough information at the prior level, and instead of placing a distribution at the hyper prior level in the hierarchical Bayes model, empirical Bayes estimates the prior parameters using the data via, often, the marginal distribution. In this research, we developed new Bayesian methods under unknown prior distribution. A set of adequate prior distributions maybe defined using Bayesian model checking by setting a threshold on the posterior predictive p-value, prior predictive p-value, calibrated p-value, Bayes factor, or integrated likelihood. We derive a set of adequate posterior distributions from that set. In order to obtain a single posterior distribution instead of a set of adequate posterior distributions, we used a blended distribution, which minimizes the relative entropy of a set of adequate prior (or posterior) distributions to a "benchmark" prior (or posterior) distribution. We present two approaches to generate a blended posterior distribution, namely, updating-before-blending and blending-before-updating. The blended posterior distribution can be used to estimate the LFDR by considering the nonlocal false discovery rate as a benchmark and the different LFDR estimators as an adequate set. The likelihood ratio can often be misleading in multiple testing, unless it is supplemented by adjusted p-values or posterior probabilities based on sufficiently strong prior distributions. In case of unknown prior distributions, they can be estimated by empirical Bayes methods or blended distributions. We propose a general framework for applying the laws of likelihood to problems involving multiple hypotheses by bringing together multiple statistical models. We have applied the proposed framework to data sets from genomics, COVID-19 and other data.en_US
dc.identifier.urihttp://hdl.handle.net/10393/42408
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-26628
dc.language.isoenen_US
dc.publisherUniversité d'Ottawa / University of Ottawaen_US
dc.subjectRobust Bayesian statisticsen_US
dc.subjectImprecise probabilityen_US
dc.subjectBayesian model checkingen_US
dc.subjectBlended inferenceen_US
dc.subjectPosterior predictive p-valueen_US
dc.subjectLocal false discovery rateen_US
dc.subjectEmpirical Bayesen_US
dc.subjectMultiple testingen_US
dc.subjectBayesian false discovery rateen_US
dc.subjectMeasure of evidenceen_US
dc.subjectDirect likelihood inferenceen_US
dc.subjectLikelihoodismen_US
dc.titleBayesian Methods Under Unknown Prior Distributions with Applications to The Analysis of Gene Expression Dataen_US
dc.typeThesisen_US
thesis.degree.disciplineSciences / Scienceen_US
thesis.degree.levelDoctoralen_US
thesis.degree.namePhDen_US
uottawa.departmentMathématiques et statistique / Mathematics and Statisticsen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Rahal_Abbas_2021_thesis.pdf
Size:
7.38 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: