Representation learning of the epigenome

May 14, 2025 @ 12:00 am to 12:00 am

Nathan Sheffield, University of Virginia

501 Wartik Lab
University Park

Event Website

Video Conferencing Link

Abstract:
Assays of the human epigenome capture the regulatory state of cells in health and disease. With tens of thousands of experiments completed, it is now feasible to train large-scale representation models. In this talk, I will share how we approach this challenge by first abstracting all epigenome data into genomic intervals. I will describe several types of neural embedding models we use to investigate what biological questions can be addressed with epigenome region embeddings. I will also describe our efforts to curate, standardize, annotate, and share genomic interval data to make it more broadly useful for machine learning and beyond.

Contact

Donna McMinn
dlp18@psu.edu