Department of Mathematics
 Search | Help | Login | pdf version | printable version

Math @ Duke



Publications [#359358] of Marc D. Ryser

Papers Published

  1. Murgas, KA; Ma, Y; Shahidi, LK; Mukherjee, S; Allen, AS; Shibata, D; Ryser, MD, A Bayesian hierarchical model to estimate DNA methylation conservation in colorectal tumors., Bioinformatics, vol. 38 no. 1 (December, 2021), pp. 22-29 [doi]
    (last updated on 2023/03/23)

    MOTIVATION: Conservation is broadly used to identify biologically important (epi)genomic regions. In the case of tumor growth, preferential conservation of DNA methylation can be used to identify areas of particular functional importance to the tumor. However, reliable assessment of methylation conservation based on multiple tissue samples per patient requires the decomposition of methylation variation at multiple levels. RESULTS: We developed a Bayesian hierarchical model that allows for variance decomposition of methylation on three levels: between-patient normal tissue variation, between-patient tumor-effect variation and within-patient tumor variation. We then defined a model-based conservation score to identify loci of reduced within-tumor methylation variation relative to between-patient variation. We fit the model to multi-sample methylation array data from 21 colorectal cancer (CRC) patients using a Monte Carlo Markov Chain algorithm (Stan). Sets of genes implicated in CRC tumorigenesis exhibited preferential conservation, demonstrating the model's ability to identify functionally relevant genes based on methylation conservation. A pathway analysis of preferentially conserved genes implicated several CRC relevant pathways and pathways related to neoantigen presentation and immune evasion. Our findings suggest that preferential methylation conservation may be used to identify novel gene targets that are not consistently mutated in CRC. The flexible structure makes the model amenable to the analysis of more complex multi-sample data structures. AVAILABILITY AND IMPLEMENTATION: The data underlying this article are available in the NCBI GEO Database, under accession code GSE166212. The R analysis code is available at SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
ph: 919.660.2800
fax: 919.660.2821

Mathematics Department
Duke University, Box 90320
Durham, NC 27708-0320