CSE 590C, Sp '18: Reading & Research in Comp. Bio.

Date

Presenters/Participants

Topic

Details

03/26

---- Organizational Meeting ----

04/02

Johannes

Deep Learning of millions of random Alternative Polyadenylation variants

Details

04/09

Xiaojie Qiu

Inferring developmental trajectories and causal regulations with single-cell genomics

04/16

Jacob

Multi-scale Deep Tensor Factorization Learns a Latent Representation of the Human Epigenome

04/23

No Meeting

04/30

Alex + Ayse

Hypoxemia + DeepProfile

Details

05/07

Erin + Yue

Two Short Talks on Single-Cell RNA-seq

Details

05/14

Daniel

Building probabilistic models of RNA-seq experiments using approximate likelihood

05/21

Dr. Simon Kahan, Biocellion/Dr. Ilya Shmulevich, ISB

Biocellion: high-performance software for modeling, simulation and visualization of many-cell systems

Details

05/28

Holiday

Abstract: Alternative polyadenylation (APA) is a major driver of transcriptome diversity in human cells. Here, we use deep learning to predict APA from DNA sequence alone. We trained our model (APARENT, APA REgression NeT) on isoform expression data from over three million APA reporters, built by inserting random sequence into twelve distinct 3' UTR contexts. Predictions are highly accurate across both synthetic and genomic contexts; when tasked with inferring APA in human 3' UTRs, APARENT outperforms a model trained exclusively on endogenous data. Visualizing features learned across all network layers reveals that APARENT recognizes sequence motifs known to recruit APA regulators, discovers previously unknown sequence determinants of cleavage site selection, and integrates these features into a comprehensive, interpretable cis-regulatory code.

For background reading, Johannes recommends:

DeepBind (one of the first deep learning attempts at sequencing data and a good intro to neural nets): https://www.nature.com/articles/nbt.3300 [offcampus]
A nice text on visualization of neural nets by Google (which the talk will concentrate on): https://distill.pub/2017/feature-visualization/ [offcampus]

Abstract: For decades, 3d models have been reducing cost, accelerating progress and improving results in the automotive, aerospace, and architecture and petroleum industries. Despite the continued failure of in vitro and animal testing to reliably demonstrate efficacy and establish safety of drug and consumer care products, the life science industries are only just beginning to embrace whole-system 3d modeling and simulation as an alternative.

Why? Because modeling complex living systems is hard; simulating these models at sufficient scale and duration demands purpose-built high-performance software; and interactive visualization of the highly dynamic simulation results poses new challenges for graphics engines.

We present Biocellion and Biovision software solutions. Biocellion is a platform that supports development of living system models at cell-resolution, integrating biological, chemical and mechanical rules of interaction. Biocellion simulates these models as they grow to tens of billions of cells. Biovision provides interactive exploration of the simulation results over time.

We illustrate results from the application of Biocellion at P&G to skin growth and response to toxic materials. We also show images from Pacific Northwest National Laboratory comparing simulations of intestinal response to a low- versus high-fiber diet.

Though only recently developed, our models are able already to recapitulate many aspects of tissue growth, homeostasis and response to some interventions. Using Biocellion, they can be incrementally extended and improved to become increasingly predictive under an ever broadening spectrum of interventions.


	Computer Science & Engineering University of Washington Box 352350 Seattle, WA 98195-2350 (206) 543-1695 voice, (206) 543-2969 FAX

Note on Electronic Access to Journals