Integrative network analysis of TCGA data for ovarian cancer

Background:

Tremendous efforts have been made to elucidate the molecular basis of the initiation and progression of ovarian cancer.However, most existing studies have been focused on individual genes or a single type of data, which may lack the power to detect the complex mechanism of cancer of formation by overlooking the interactions of different genetic and epigenetic factors.

These research propose an integrative framework to identify genetic and epigenetic features related to ovarian cancer and to quantify the casual among these features using probabilistic graphical model based on the Cacner Genome Atlas (TCGA) data. Identifies possible important genetic and epigenetic features that are related to complex cancer disease.This research constructed Bayesian Network that has identified some new genetic/epigenetic pathways, which may shed new light into the molecular mechanism of ovarian cancer.

  • First defined a set of seed genes by including 48 candidate tumor suppressor or oncogenes and an additional 20 ovarian cancer related genes reported .
  • Then the seed genes were then fed into stepwise correlation-based selector to identify 271 additional features including 177 genes, 82 copy number variation sites, 11 methylation sites and 1 somatinc mutation.
  • They built a Bayesian network model with a logit link function to quantify the casual relationships among these features and discovered a set of 13 hyb genes including ARID1A, C19orf53 and COLA52.
  • The directed graph revealed many potential genetic pathways, some of which confirmed the existing result .
  • Clustering analysis further suggested four gene cluster, three of which correspond to well-defines cellular process including cell division, tumor invasion and mitochondrial system.
  • In addition, two genes related to glycoprotein synthesis, PSG11 and GALNT10 were found highly predictive for the overall survival time of ovarian cancer patients.

Ovarian cancer, is one of the most malignant gynecologic cancers, is the fifth leading cause of cancer-related deaths among women in the United States. Studies have suggested that there are well-known oncogenes and tumor suppressors including TP53,PIK3C, BRCA1 and BRCA2

Systems Biology approach combines multiple genetic and epigenetic profeines for an integrative analysis provides a new direction to study the regulatory network associated with ovarian cancer.

Data for this research was taken from TCGA project.

The BN approach allows rigorous statistical inference of causality between genetic and epigenetic features. How to combine different types of complex data for casual inference in BN poses a big challange to identify a subset of the most relevant features and to remove irrelevant or redundant features.

Result

They consider four types of molecular data including gene expression, DNA copy number variation, promoter methylation and somatic mutation.

In this paper they assume that cancer phenotype is directly associated with gene expression, which can be potentially driven by genetic and epigenetic changes. They first identify a set of tumor suppressors and oncogenes by differential expression analysis between the cancer and control groups. This set of genes form the set of seed gens. Then these seeds genes are then fed into their proposed stepwise correlation based selector (SCBS) to select other features.

s12918-014-0136-9-2

Figure 2 illustrates the workflow of the proposed framework. They first identify a set of tumor suppressors and oncogenes by differential expression analysis between the cancer and control groups. The first step was to define a set of seed genes out of 12000 genes that have record of expression level and at least one of the three epigenetic factor. This procedure resulted in 48 potential tumor suppressor or oncogenes.

s12918-014-0136-9-3

Figure 3 shows the predicted network that contains 698 edges where the direction indicated the downstream feature is regulated by the upstream one. They found that copy number variation are the major factor that accounts for differential gene expression and this suggest that many amplified genes may act as cancer drivers, confirming finding from a breast cancer study. This network confirmed many previously reported gene-gene interactions.

These results suggested that the proposed pipeline is capable of revealing important genetic pathways that underline the complex cancer phenotype.

They identified 13 nodes with significantly larger outdegrees. These hub genes all have known function and have casual effect on at least seven neighboring genes suggesting that they play important roles in driving corresponding local sub-networks

s12918-014-0136-9-5

 

Figure 5.

13 hub genes that clearly distinguish the cancer samples from the normal samples. This is a multi-dimensional scaling plot based on the correlation dissimilarity. This suggest that 13 hub genes may present the major difference between cancer and normal samples.s12918-014-0136-9-7

For gene cluster, 245 genes were identified to fall into four major clusters corresponding to distinct function by k-means clustering methods.

  • Cluster 1 contains 18 genes, mainly related to cell division, mitosis, spindle formation etc.
  • Cluster 2 contains 23 genes, most of which are functionally related to growth factor, cell shape, cell motility, tumor invasion.
  • Cluster 3 contains 20 genes, mostly related to mitochondrial system, membrane process.
  • Cluster 4 is the largest and most complicated cluster harboring 184 genes. This large cluster communicates between the other three clusters with are independent from each other

These findings could be implicative of some important molecular pathways, which may or may not have been identified, that drive the development of ovarian cancer. Figure 7

The inferred Bayesian network identified two genes, PSG11 and GALNT10, that may be directly associated with the overall survival time of ovarian cancer patients. Both genes are functionally related to glycoprotein synthesis. This indicates the biological pathway related to glycoprotein synthesis may be implicative of death risk of ovarian cancer patients. Several tumor-associated glycoproteins were found on the surface of many cancer cells including ovary, breast, colon and pancreatinc cells and they may play protein roles in early detection of cancers. A well known protein is CA-125 which is the primary protein used to measure serous cancer tumor load. Some certain glycoproteins are closely associated with woman cancers such as ovarian cancer and breast cancer affecting the death risk, chemotherapy resistance and prognosis of ovarian cancer patients.

Discussion 

This research proposed an integrative approach int the Bayeasian network for casual inference between genetic and epigenetic features in complex cancer data.

First showed stepwise correlation-based selection approach is more effective than simple single-round selection method in identifying important features in the genetic/epigenetic pathways. The method they proposed relies on the correlation strength among connected nodes and may fail when the connections are weak.

Second a model was purposed for casual relationship between features of different types in a Bayesian network through a logit link function.

They also found that pathways related to glycoprotein synthesis, hematopoietic and immune systems correlate with the survival rate of ovarian cancer patients. In particular, that two genes related to glycoprotein synthesis, PSG11 and GALNT10 can significantly affect the overall survival time of ovarian cancer patients.

Conclusion 

Understanding the biological mechanism of ovarian cancer has significant practical importance for clinical diagnosed and treatment. In this research they propose a new integrative approach which present two innovations: a stepwise feature selection procedure and a Bayeasian network model that incorporates both continuous and discrete features for casual inference. Clustering analysis suggested four gene clusters corresponding to distinct biological process including cell division, tumor invasion and mitochondrial system. In addition, they found that genes related to glycoprotein synthesis, hematopoietic, immune system could be highly predictive of overall survival time of ovarian cancer patients.

 

The electronic version of this article is the complete one and can be found online at:http://www.biomedcentral.com/1752-0509/8/1338

© 2014 Zhang et al.; licensee BioMed Central.

Prediction of Signaling cross-talks contributing to acquired drug resistance in breast cancer cells by Bayesian statistical modeling.

Background

Targeted cancer therapies are drugs designed to interfere with specific molecules necessary for tumor growth and progression. Traditional cytotoxic chemo therapies usually kill rapidly dividing cells in the body by interfering with cell division. A primary goal of targeted therapies is to fight cancer cells with more precision and potentially fewer side effects.

The success of these inhibitors is limited because of the ability that cancer cells have to acquire drug resistance. Cancer cells may adopt several mechanisms against particular treatment by adjusting the signaling circuitry, activation of alternative pathways and cross-talks among various pathways to overcome the effects of inhibitors.

For example the resistance to Epidermal Growth Factor receptor tyrosine kinase inhibitors may occur to cross-talks among EGFR-mediated pathways and cross-talks with pathways triggered by other receptors. Therefore, targeting signaling cross-talks may have a potential to sensitize cancer cells to particular inhibitors. Many clinical studies have indicated that targeting EGFR could represent a significant contribution to cancer therapy.

Cross-talk among signaling pathways plays an important role in cancer drug resistance. It can occur at various levels. For example cross-talk at mediator level includes the activation of major components of mediator pathways by mutation/deletion of oncogenic driver genes.This resistance occurs when some critical effectors involved in cell survival and proliferation show an altered phenotype caused by other signaling pathways via RTK signaling cross-talk.

The identification and analyses of potential cross-talks among the signaling pathways may provide deeper insights into the mechanism of drug resistance, and can facilitate finding a range of compensatory pathways for overcoming resistance in targeted therapy.

Overview

For this research the authors collected the gene expression values of the ErB2-positive parental SKBR3 cell line and the lapatinib-resitant SKBR3-3 cell line in the presence and absence of laptinib

A gene-pair involved in cross-talk between two particular signaling pathways has high potential of being involved in acquired drug-resistance. The hypothesis was it should have high probability of appearing in the resistant network and low probability in the parental network. Breast cancer cell lines resistant to tamoxifen, a crFigure 1 Schematic Diagramoss-talk mechanism has previously been identified between EGFR and the IGF1R signaling pathway.

The stochastic nature of biological systems can be used to predict edge probabilities by formalizing them into a probabilistic model with other network properties.

Drug-resistance cross-talks can be informative to elucidate the complex mechanisms underlying drug-insensitivity and can help to develop novel therapeutics targeting signaling pathways.

 

Results 

  • Developing the Network

Used to find gene-gene relationship networks

  • Bayesian analysis

Modeling approach was required to identify and analyze characteristics drug-resistance cross-talks between EGFR/ErbB and other signaling pathways.Observed whether gene-pairs overlap with the list of potential cross-talks between EGFR/ErbB signaling and other signaling pathways.

Nodes are genes, and the edges are the cross talks.Note, all the cross-talks here possess posterior probabilities of that cross-talk for appearing in parental network is

The network view of all the cross-talk sets from the analyses of individual pathway sources

 

 

 

 

 

 

  • Netwalker Analyses

Observed the change in GE values for each gene in the identified list of potential cross-talks.Both genes involved in drug-resistant cross-talks should be up-regulated in resistant conditions compared to parental conditions. Which implies that the activation of other compensatory signaling pathways in resistant conditions can play a role in acquired resistance to inhibitors by activating the targeted pathways.

Discussion

This study proposed a computational framework that is able to predict putative cross-talks among signaling pathways that play a role in drug resistance in two breast cancer cell lines.The authors hypothesized that gene-pairs(cross-talks) that can potentially cause drug-resistance can have a high probability of occurring in the resistance conditions but a low probability in parental conditions. In this research they present literature evidence that the identified cross-talks of the compensatory signal pathways with EGFR/ErbB signaling may contribute to drug-resistance by maintaining key cell survival and/or proliferation signals in common down-stream pathways including PI3K/Akt signaling.

Komurov hypothesized that cross-talks between EGFR/ErbB signaling and metabolic pathways contribute to resistance to lapatinib. They identified that glucose deprivation reduces the inhibiting effects of lapatinib by up-regulating constituent genes and thus providing and EGFR/ErbB2 independent mechanism of glucose uptake and cell survival.

The authors believed that their method can be used to find a range of compensatory pathways that nullify/reduce the inhibiting effects of drugs via cross-talk with targeted pathways

 

 

 

 

 

 

 

 

 

AKM Azad1*, Alfons Lawen2 and Jonathan M Keith

Komurov K, Tseng JT, Muller M, Seviour EG, Moss TJ, Yang L, et al. The
glucose-deprivation network counteracts lapatinib-induced toxicity in
resistant ErbB2-positive breast cancer cells. Mol Syst Biol 2012;8(1):.
18. Knowlden JM, Hutcheson IR, Barrow D, Gee JM, Nicholson RI. Insulin-l

What is Systems in Biology ?

Systems Biology is the study of biological components, such as molecules, cells, organisms or entire species. To be able to understand biology at the System level it is necessary to examine the structure and dynamics of cellular and organism function.It can also be called as a computational and mathematical modeling of complex biological systems.According to Ann Neurol “Systems biology and its application to the understanding neurological diseases” Systems Biology is field that employs tools developed in physics and mathematics such as nonlinear dynamics, control theory and modeling of dynamic systems .

The purpose of systematic biology  is to answer questions related to complexity of living systems such as genomics. This is an engineering approach which is now applied to bio-medical and biological scientific research. Properties of systems emerge as central issues, and understanding these properties may have an impact of the future of medicine. Although systems biology is a relative new field, it is expected to revolutionize the understanding of complex biological regulatory systems and to provide major new opportunities for practical application of such knowledge.

The volume of information is overwhelming and the language of communication between the interactive parts of our organism is unknown. In order to better treat disease, we have to understand the principles that govern the design, function and interaction of these systems. Then we can ask what happens when there is a malfunction. Tracking networks of genes and proteins responsible for the processing of the information of cells which involves monitoring hundreds and thousands genes and cells communication channel simultaneously. So first collect the data and then analyze. SB takes a holistic approach.

A system understanding of biological systems can be derived from four different areas.

  1. Systematic structures: Includes the network of gene interactions and biochemical pathways
  2. Systems dynamics : Observes how a system behaves over time under various conditions
  3.  Control method: Mechanism which controls the behavior of the cell and can reduce malfunctions and provide potential therapeutic targets for treatment disease.
  4. Design method: Strategies to modify and construct biological systems having desired properties can be devised based on definite design principles and simulations.

Fig1. Hypothesis-driven research in systems biology.The model represents a computable set of assumptions and hypothesis that need to be tested or supported experimentally\

 Application of Systems Biology

The most common application of SB is to create detailed model of cell regulation focused on particular signal-transduction cascades and molecules to provide system-level insights into mechanism-based discovery.

“Systems biology is the in-between. It’s those new properties that arise when you go from the molecule to the system. It’s different from physiology or holism and it’s different from reductionist things like molecular biology. It’s the in-between.”

–Hans V. Westerhoff, PhD

http://sysbio.med.harvard.edu/welcome-to-the-department-of-systems-biology-hms

Systems Biology researches 

Prof Uri Alon investigates how cells make decision and process information to enable the creation of proteins and his goal is to develop a blue print of a living cell. He developed a tracking system that enables him to monitor in real time the simultaneously expression of multiple genes in living cells. He discovered that the biochemical circuit in the cell is much simpler because it is composed of repeated circuit patterns called network motifs and eventually each network motif performs a specific information process task. His maps are really helpful and revolutionary.

http://wws.weizmann.ac.il/mcb/UriAlon/research/network-motifs

http://sysbio.med.harvard.edu/slide-show-items/assembling-cancer-networks

 

Proudly powered by WordPress
Theme: Esquire by Matthew Buchanan.

Skip to toolbar