Mamatha Bhat, Elisa Pasini, Chiara Pastrello, Sara Rahmati, Marc Angeli, Max Kotlyar, Anand Ghanekar, Igor Jurisica
Mamatha Bhat, Elisa Pasini, Marc Angeli, Multi Organ transplant Program, University Health Network, Toronto M5G2N2, Canada
Chiara Pastrello, Sara Rahmati, Max Kotlyar, Igor Jurisica, Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute, University Health NetworkandKrembil Research Institute, University Health Network, Toronto M5T 0S8, Canada
Anand Ghanekar, Surgery, University Health Network, Toronto M5G 2C4, Canada
Igor Jurisica, Departments of Medical Biophysics and Computer Science, University of Toronto, Toronto M5T 0S8, Canada
Abstract BACKGROUND The broader use of high-throughput technologies has led to improved molecular characterization of hepatocellular carcinoma (HCC).AIM To comprehensively analyze and characterize all publicly available genomic, gene expression, methylation, miRNA and proteomic data in HCC, covering 85 studies and 3355 patient sample profiles, to identify the key dysregulated genes and pathways they affect. METHODS We collected and curated all well-annotated and publicly available highthroughput datasets from PubMed and Gene Expression Omnibus derived from human HCC tissue.Comprehensive pathway enrichment analysis was performed using pathDIP for each data type (genomic, gene expression, methylation, miRNA and proteomic), and the overlap of pathways was assessed to elucidate pathway dependencies in HCC.RESULTS We identified a total of 8733 abstracts retrieved by the search on PubMed on HCC for the different layers of data on human HCC samples, published until December 2016.The common key dysregulated pathways in HCC tissue across different layers of data included epidermal growth factor (EGFR) and β1-integrin pathways.Genes along these pathways were significantly and consistently dysregulated across the different types of high-throughput data and had prognostic value with respect to overall survival. Using CTD database, estradiol would best modulate and revert these genes appropriately.CONCLUSION By analyzing and integrating all available high-throughput genomic, transcriptomic, miRNA, methylation and proteomic data from human HCC tissue, we identified EGFR, β1-integrin and axon guidance as pathway dependencies in HCC.These are master regulators of key pathways in HCC, such as the mTOR, Ras/Raf/MAPK and p53 pathways.The genes implicated in these pathways had prognostic value in HCC, with Netrin and Slit3 being novel proteins of prognostic importance to HCC.Based on this integrative analysis, EGFR, and β1-integrin are master regulators that could serve as potential therapeutic targets in HCC.
Key Words: Hepatocellular carcinoma; Gene expression; miRNA; Methylation; Proteomics; High throughput data
The molecular basis of hepatocellular carcinoma (HCC) has been elusive, given the significant heterogeneity of this tumor that arises in the context of various chronic liver diseases[1].HCC remains a high-fatality cancer, despite large-scale efforts to better characterize and therapeutically target this malignancy.Since prevalence of cirrhosis due to hepatitis C and fatty liver disease is increasing in North America, HCC continues to rise[2].Five-year survival remains poor at 18% due to late diagnosis and inability to tolerate chemotherapy in patients with cirrhosis[2].Consequently, there is an urgent need to better understand the molecular basis of this highly fatal cancer.
Clinical management of HCC is optimized based on disease stage[3].Curative treatment with resection, radiofrequency ablation or transplantation is possible in early stage disease[4].When HCC is diagnosed at a later stage, sorafenib is the first-line chemotherapy, which is directed against the Ras/Raf/MAPK pathway[4].This is associated with a very modest improvement in overall survival of 3 additional months as compared to placebo (10.7 movs7.9 mo)[5].
The cancer genome atlas (TCGA) is a large-scale project that has enabled improved characterization of cancers with several layers of data.The TCGA multi-platform analysis of 196 HCC tumors described this cancer as highly heterogeneous and difficult to characterize, although certain key pathways did emerge including the Ras/Raf/MAPK, mTOR, Wnt/B-catenin, and Sonic Hedgehog pathways[1,6].Integration of various types of data has previously been performed to map interaction networks.By integrating genomic, transcriptomic and proteomic data, one can understand potential interactions that contribute to a disease condition or process[7,8]. These interactions may otherwise not be uncovered, on the basis of a single type of data.This systems biology approach has been especially important in cancer, given that alterations in one gene can have a ripple effect on proteins in the rest of a proteinprotein interaction network.Therefore, elucidating the layers of data in a disease can provide additional insights into the pathways that drive cancer[9].
In the current study, we aim to characterize the landscape of high-throughput data profiling in HCC and determine the patterns in key dysregulated genes and pathways across these different layers of data.The patterns that emerge could help in better understanding the pathways that drive HCC and could be considered as therapeutic targets.
We downloaded all available high-throughput genomic, transcriptomic, microRNA, methylation, and proteomic datasets related to human HCC samples from published datasets (PubMed, http://www.ncbi.nlm.nih.gov/PubMed and Gene Expression Omnibus (GEO), https://www.ncbi.nlm.nih.gov/geo).
Using PubMed, the following search was performed for whole exome sequencing data on HCC: ("carcinoma, hepatocellular" [MeSH Terms] OR ("carcinoma" [All Fields] AND "hepatocellular" [All Fields]) OR "hepatocellular carcinoma" [All Fields] OR ("hepatocellular" [All Fields] AND "carcinoma" [All Fields])) AND (whole [All Fields] AND ("exome" [MeSH Terms] OR "exome" [All Fields]) AND sequencing [All Fields]).The following MeSH terms were used to identify gene expression papers: ("carcinoma, hepatocellular" [MeSH Terms] OR ("carcinoma" [All Fields] AND "hepatocellular" [All Fields]) OR "hepatocellular carcinoma" [All Fields] OR ("hepatocellular" [All Fields] AND "carcinoma" [All Fields])) AND ("gene expression" [MeSH Terms] OR ("gene" [All Fields] AND "expression" [All Fields]) OR "gene expression" [All Fields]) AND ("humans" [MeSH Terms] OR "humans" [All Fields]) AND English [All Fields] NOT ("review" [Publication Type] OR "review literature as topic" [MeSH Terms] OR "reviews" [All Fields]).To identify suitable papers regarding methylation in HCC, we used the following terms: ("methylation" [MeSH Terms] OR "methylation"[All Fields]) AND ("carcinoma, hepatocellular" [MeSH Terms] OR ("carcinoma" [All Fields] AND "hepatocellular" [All Fields]) OR "hepatocellular carcinoma" [All Fields] OR ("hepatocellular" [All Fields] AND "carcinoma" [All Fields]) AND ("humans" [MeSH Terms] AND English [lang]).Proteomics papers were retrieved using the following search: [("proteomics" [MeSH Terms] OR "proteomics" [All Fields]) AND high [All Fields] AND throughput [All Fields]] AND ("carcinoma, hepatocellular" [MeSH Terms]) OR ("carcinoma" [All Fields] AND "hepatocellular" [All Fields]) OR "hepatocellular carcinoma" [All Fields] OR ("hepatocellular"[All Fields] AND "carcinoma"[All Fields]).MicroRNAs reported in HCC were identified using these MeSH terms: ("micrornas" [MeSH Terms] OR "micrornas"[All Fields] OR "mirna" [All Fields]) AND profile [All Fields] AND ("carcinoma, hepatocellular" [MeSH Terms] OR ("carcinoma" [All Fields] AND "hepatocellular" [All Fields]) OR "hepatocellular carcinoma" [All Fields] OR ("hepatocellular" [All Fields] AND "carcinoma" [All Fields]).
We considered for inclusion all datasets available in PubMed.
The datasets publicly available on the GEO, a public functional genomics data repository of high-throughput array data (https://www.ncbi.nlm.nih.gov/geo) were retrieved and analyzed using GEO2R (https://www.ncbi.nlm.nih.gov/geo/info/geo2r.html), a web tool available on the portal, identifying genes differentially expressed between samples of HCC and the non-tumoral liver portion.GEO2R compares original submitter-supplied processed data tables using the GEOquery and limma R packages from the Bioconductor project.Following instructions available online at (https://www.ncbi.nlm.nih.gov/geo/info/geo2r.html), we retrieved all dysregulated genes.Only those with an adjustedPvalue < 0.05, and expression fold change value below ≤ 0.5 or above ≥ 1.5 were considered for further analysis (Table 1, Supplementary Table 1).The genes included in our list from WES papers were reported as affected by nonsynonymous mutations, and synonymous mutations were not considered.Putative microRNA gene targets were identified using an online database, mirDIP 4.1[10], (http://ophid.utoronto.ca/mirDIP).The most stringent predictive search option (top 1%) was used to obtain the list of putative targets of all differentially expressed miRNAs.
From the selected 11 methylation datasets, raw data from eight studies wereavailable on the GEO website (https://www.ncbi.nlm.nih.gov/geo/).We selected the CpG sites or genes reported to be hyper-or hypo- methylated in these publications.The genomic region was considered differentially methylated between HCC tissue and the adjacent non-tumoral sample, if the FDR correctedPvalue < 0.01.Furthermore, we filtered out everything that did not satisfy the criteria: ?β ≥ 0.20 or ?β ≤ -0.20, where ?β = βHCC - βadjacent was the difference in methylation between above specified groups.When the CpG sites were considered, the Illumina HumanMethylation450K and 27K platforms were used for mapping to the genes.When multiple sites or genes were found to have the same sense of differential methylation, the mean value of ?β was calculated.Only the CpGs in the 5’UTR, 1st Exon, TSS200, TSS1500 or in CpG islands were considered in our analysis.Proteomic results were retrieved and included only if protein abundance was reported as different in HCC liver samples compared to control samples.

Table 1 List of the final 85 selected publications for each layer of data.For each publication the number of hepatocellular carcinoma samples and controls and the platform used for the analysis are reported

18 2004 15221772 20 20 19 2003 14673798 21 21 20 2003 14654528 21 21 21 2002 12481271 11 11 22 2013 23462207 7 7 23 2005 16335951 8 8 24 2006 16342242 10 10 25 2011 22034872 3 3 26 2005 15852300 7 7 27 2011 21913717 3 3 28 2007 17203974 25 28 29 2007 17586277 10 10 Whole exome sequencing No.year PMID HCC (n)Controls (n)GEO dataset 1 2013 23912677 3 3 N/A 2 2014 24055508 4 7 N/A 3 2017 28323123 5 5 N/A 4 2014 24798001 231 231 GSE54504 5 2012 22561517 24 24 N/A Epigenetic_miRNAs No.year PMID HCC (n)Controls (n)GEO dataset 1 2015 26190160 9 7 N/A 2 2014 24789420 10 9 GSE31383 3 2014 24564407 45 45 GSE10694 4 2011 21298008 73 73 GSE21362 5 2008 18649363 78 10 N/A 6 2012 22135159 20 20 N/A 7 2011 21319996 94 94 N/A 8 2009 19473441 20 20 N/A 9 2009 19173277 35 N/A 10 2007 18171346 10 10 N/A 11 2006 16331254 25 25 N/A 12 2015 26062888 30 30 N/A 13 2015 26046780 327 43 N/A 14 2015 25861255 66 66 GSE54751 15 2015 25500075 6 6 GSE54537 16 2014 24875649 24 24 17 2013 23812667 166 166 GSE31384 18 2013 23390000 9 17 GSE40744 19 2012 23082062 18 18 N/A 20 2014 24586785 29 29 N/A 21 2013 24417970 78 78 N/A Epigenetic methylation

HCC: Hepatocellular carcinoma; GEO: Gene Expression Omnibus; N/A: Not applicable.
Figure 1 outlines our study workflow.Papers were excluded from each specific search for the following reasons: Data from cell lines, or animal models, studying efficacy or drugs, or the presence of long non-coding RNA, mechanistic studies not performing high-throughput or evaluating the role of one molecule, papers focused on liver diseases but not HCC or liver tissue, not original data such as review articles, or those studies using already selected datasets, not reporting the modulation of the molecules, and papers without data available.
Available patient data, including etiology of liver disease (hepatitis C, hepatitis B, alcohol, fatty liver disease) on the basis of which the HCC tumors developed, presence of cirrhosis, the Model for End-stage Liver Disease score (MELD score, an assessment of the severity of liver dysfunction), tumor histology, stage of cancer, alpha-fetoprotein level, overall and recurrence-free survival following treatment were also documented (Supplementary Table 2).
The key dysregulated genes from each type of data (genomic, miRNA, methylation, transcriptomic, and proteomic) were fed into the Integrated Interactions Database[11](IID, http://ophid.utoronto.ca/iid), to obtain a list of the protein-protein interactions.For the miRNA dataset, we determined the target genes of the differentially expressed miRNAs in tumors using the miRNA Data Integration Portal mirDIP v4.1[10].The individual lists derived from each type of data were then fed into the pathway Data Integration Portal, pathDIP v3.0 (http://ophid.utoronto.ca/pathDIP)[12], in order to determine the significantly dysregulated pathways in HCC.pathDIP integrates data from 20 major pathway databases, and computationally predicts gene association to curated pathways using protein-protein interactions from IID significance of their connectivity[12].We used this comprehensive pathway enrichment analysis portal to obtain a list of significantly enriched pathways using literature curated (core) pathway membershipsPvalue (FDR: BH-method) less than 0.05.
The lists of pathways from each type of data were then assessed for overlap using Venny 2.1, an online tool for Venn diagram design (http://bioinfogp.cnb.csic.es/tools/venny/index.html).

Figure 1 Flow chart showing the paper selection process and exclusion criteria for each data type: Gene expression, proteomics, whole exome sequencing, microRNAs and methylation.
In order to determine whether key differentially expressed genes along the overlapping pathways had prognostic value, we used KMplotter, a web-based tool that enables survival analysis across multiple cancers and datasets[13].Patient samples were split into two groups per autoselection of the best cutoff for each gene, in order to assess its prognostic value. We ran multivariate overall survival analysis based on the highvslow expression of each gene in HCC tumors. The two groups were compared by a Kaplan-Meier survival plot, and the hazard ratio with 95% confidence intervals and log-rankPvalue were calculated.
The identification of putative therapeutic agents able to revert the modulation of genes of interest based on their modulation associated with a worse prognosis was obtained using the online Comparative Toxicogenomics Database http://ctdbase.org[14].This database provides manually curated information about chemical–gene/protein interactions, chemical–disease and gene–disease relationships.
We identified a total of 8733 abstracts retrieved by the search on PubMed on HCC for the different layers of data on human HCC samples, published until December 2016.The flow chart outlining the selection process is detailed in Figure 1.
The number of samples included in our analysis are as follows: (1) Whole exome sequencing: 267 HCC and 270 control samples; (2) Gene expression: 870 HCC and 814 control samples; (3) miRNA: 1172 HCC and 771 control samples; (4) Methylation: 354 HCC and 341 control samples; and (5) Proteomics: 421 HCC and 473 control samples.The methodologies and platforms used to obtain these high-throughput data are reported by type of data (genomic, transcriptomic, miRNA, methylation and proteomic) in Table 1.Clinical data, regarding etiology of liver disease (hepatitis C, hepatitis B, alcohol, fatty liver disease) were frequently reported, on the other side serum levels of liver enzymes, AST and ALT, frequently used to assess liver functions were not available.Pathological details relative to differentiation or stage were frequently absent as well as other crucial variables in the clinic setting, such as Child Pugh/MELD score (Supplementary Table 2).
There were 188 overlapping dysregulated genes/proteins across the different types of data.Independently for each type of data, we obtained a list of pathways using pathDIP.We merged the list of dysregulated pathways in miRNA and methylation, given that these epigenetically regulate gene expression, in order to assess for overlapping pathways across the datasets.
This resulted in a list of 3 common, overlapping pathways among the different types of data: EGFR, β1-integrin, and axon guidance pathways, as depicted in Figure 2.From the previous list of 188 common dysregulated elements in all different layers of data (Figure 3), we were able to identify 35/188 genes that were involved in these 3 shared pathways across the layers of data (Supplementary Table 1).
We then examined the prognostic value of the deregulated genes associated to pathways of interest in HCC using TCGA RNA seq dataset, as listed in Table 2.Median survival of 364 patients in the TCGA, which was used for validation purposes regarding the prognostic value is reported.KMplotter HR results from TCGA RNA seq data reflected the altered modulation identified for these 9 genes in the 19 HCC papers relative to the gene expression data (Table 2).Among the five upregulated genes associated with positive HR values, CDK5, was reported with the highest HR value (1.85,P= 0.0035) and involved in cell cycle (Table 3).The other 4/9 genes reported as upregulated, COL2A1, LAMC1, RPS6KA3 and ITGB1 were identified withpositive HR value by KM plotter analysis and involved in cellular migration (Table 2 and Table 3).

Table 2 Prognostic value of the 9 dysregulated genes associated with the 3 common dysregulated pathways (EGFR, epidermal growth factor, β1-integrin and axon guidance) among the 4 types of data in obtained with KMplotter
Four out of 9 genes were reported as downmodulated in the 19 HCC gene expression papers.Among these four, two genes, FGA and FGG, were identified as the top statistically significantly (P= 0.0009) associated with a protective role in HCC (HR values 0.52 and 0.59, respectively).FGA and FGG were consistently reported as downmodulated in about 45% of our 19 selected gene expression papers (Table 3).The other two downmodulated genes, EPHB1 and EFGR with negative HR values (Table 2) are reported to be affected by missense mutation leading to a loss of their protective role against cell migration.
Using CTD, we found that estradiol was able to appropriately down- or upmodulate 4 out of 9 cancer-related genes (Table 2).Particularly, CTD reported estradiol capabilities to upregulated FGA, FGG and EGFR reported downmodulated in HCC (Table 2) and counteracting the upregulation of RPS6KA3 in HCC, suggesting a possible role for this hormone in HCC treatment.
In this study, we evaluate the molecular pathogenesis of HCC using a unique approach, that of combining all publicly available high-throughput data from patient HCC tumors.This encompasses all miRNA, methylation, genomic, transcriptomic and proteomic profiling data present in the literature, and represents the first effort to derive a consensus molecular model of HCC through analysis of these different types of data.Although these datasets originated from different patient cohorts, presented integrative analysis offers the opportunity to explore common key pathway dependencies of HCC.Starting with the initial generation of genomics and whole exome sequencing data, previous high-throughput studies have brought forth different lists of dysregulated genes, depending on the type of data evaluated.Dysregulated genes may affect different parts of a pathway.Therefore, a pathwaybased approach when evaluating different types of high-throughput data offers the ability to assess the pathways most commonly affected in a given cancer.Additionally, the integrative analysis in our study encompasses a large number of patient samples.
Using this integrative approach, we confirm the importance of EGFR, β1-integrin and axon guidance as pathways critical in hepatocarcinogenesis.EGFR activates the signaling cascades of the Ras/Raf/MAPK and mTOR pathways, two pathways that were identified as key to HCC pathogenesis in the TCGA study[6].Theidentification of β1-integrin as being commonly dysregulated in HCC is novel, and its significance is confirmed through its consistent dysregulation across types of data.β1-integrin is a cell surface receptor that senses the extracellular matrix, thereby modulating the hallmarks of cancer such as proliferative signaling with continuous activated cell replication, evasion of growth suppressors, resistance to angiogenesis as well as cancer cell invasion and metastasis[14]. Ras/Raf/MAPK and mTOR are established pathways in hepatocarcinogenesis, and are integrin-dependent signaling pathways[15].Additionally, β1-integrin is known to crosstalk with EGFR.In fact, the downregulation of β1-integrin was found to decrease phosphorylation of EGFR and c-Met in hepatocytes during liver regeneration[16].A synergistic relationship between integrins and EGFR has also been demonstrated in tumor progression[17]. The finding of axon guidance pathway-related proteins as being dysregulated across types of data, thereby establishing consistent dysregulation of this pathway in HCC, is also novel.Netrin-1 is the best studied protein in the axon guidance pathway, and is known to be overexpressed in various cancers[13].It is responsible for regulation of apoptosis, with increased presence of netrin-1 leading to inhibition of apoptosis.The tumor suppressor p53, frequently mutated in the TCGA HCC study, regulates the cell cycle through netrin-1.The axon guidance pathway has previously been identified as a pathway that is significantly mutated in HCC based on integration of all genomic data in HCC[18].This analysis revealed mutations along the axon guidance pathway as being prognostic of a higher rate of HCC metastasis.We were able to additionally validate the prognostic importance of dysregulated proteins in these pathways proteins using TCGA data.

Table 3 Modulation of the 9 dysregulated genes associated with the 3 common dysregulated pathways (EGFR, epidermal growth factor, β1-integrin and axon guidance) identified in the 19 hepatocellular carcinoma gene expression papers.Their genetic alteration in hepatocellular carcinoma and their mechanism in cancer are reported
HCC is a cancer that develops in the context of various chronic liver diseases, which may influence the molecular characteristics of HCC.Additionally, the underlying cirrhosis and liver dysfunction that are often concurrent may influence HCC development and behavior[2].Patients are often diagnosed at an advanced stage of disease, when it is too late for curative treatment.A unique consideration in HCC is the inability to tolerate hepatotoxic chemotherapy in patients with liver dysfunction, as it is often patients with cirrhosis who develop HCC[19,20].Therefore, liver function must be considered prior to, during, and after any form of treatment for HCC.

Figure 2 Venn diagram shows the three common pathways (EGFR, epidermal growth factor, β1-integrin, and axon guidance pathways) across the four different types of data.
Thus, especially for HCC, it has been suggested that a multi-pronged approach to HCC therapy jointly targeting different pathways be adopted.
Omics technologies are essential in the progress towards elucidating the molecular basis of HCC.The current study represents the largest integration of all publicly available genomic, gene expression, methylation, miRNA and proteomic data in HCC, covering 85 studies and 3355 patient sample profiles.We identified consistently deregulated pathways associated with hepatocarcinogenesis across different types of data using integrative analysis tools, thereby confirming the importance of these genes in HCC pathogenesis.EGFR (activator of Ras/Raf/MAPK and mTOR) and β1-integrin (also modulator of the aforementioned pathways) were clearly identified as pivotal to HCC[5,21-23].This is in keeping with the efficacy of the Ras/Raf/MAPK inhibitors sorafenib and regorafenib in HCC[24].
Even beyond this, we found these consistently deregulated genes across pathways to be appropriately modulated by estradiol.HCC is less common in women, and there have been clinical studies demonstrating that hormone therapy and female sex are protective against HCC as described earlier in this thesis.
Other integrative multi-omics studies have been recently performed for other tumors with high mortality such as breast and ovarian cancer[6,25].Several breast cancer studies emphasizing how data integration of genomic/transcriptomic and proteomic has improved the molecular characterization of subtypes of breast cancer and elucidate its heterogeneity and its interaction with the microenvironment and aggressiveness[26,27].A single source of data was used in the ovarian cancer multi-omics mathematical integration performed by Bhardwajet al[25]. Copy number variation gene expression and methylation data from TCGA data portal were integrated using mathematical algorithm and identified 32 co-expressed genes and 6 pathways associated with survival.

Figure 3 From the previous list of 188 common dysregulated elements in all different layers of data.
The main limitation of our study is the different patient samples represented by the various types of data.Nonetheless, there is a large amount of high-throughput data, which allowed us to detect pathway dependency patterns that are compatible with the current HCC literature.Additionally, HCC tumors arise in the setting of various chronic liver diseases.We could not assess for etiology-specific genes and pathways in this study, given that the clinical and genetic data to evaluate these differences were not fully available for all the studies.Therefore, we could only evaluate gene differences over whole datasets, rather than individual patients, due not complete individual annotation of the samples available on GEO for each specific dataset.The HCC samples in this integrative analysis all came from patients who had undergone hepatectomy.There were no specimens from patients who were candidates for ablation therapy (early stage), those who were undergoing liver transplantation, or those with advanced HCC.One might anticipate that the molecular features of such tumors differ, given the different stages of HCC captured, but there is unfortunately scarcity of data in this regard.
In conclusion, our study represents the largest integrative analysis of all publicly available data in HCC, spanning different types of high-throughput data.Pathway enrichment analysis elucidated EGFR, β1-integrin and axon guidance as pathway dependencies in HCC.These are proteins known to serve as master regulators of key pathways in HCC such as Ras/Raf/MAPK, Wnt/β-catenin and mTOR[28], and may serve as potential overarching therapeutic targets in HCC.The axon guidance pathway was identified as being of potential importance to HCC for the first time, with prognostic value suggested in patient sample validation with TCGA. Estradiol affects a large number of deregulated genes across data with appropriate modulation and may be a therapeutic agent that helps in HCC. A combined therapeutic approach conjointly targeting different pathways may be more optimal in the treatment of HCC, especially when underlying hepatic dysfunction compromises the ability to tolerate optimal chemotherapeutic doses.
Hepatocellular carcinoma (HCC) is highly heterogeneous, difficult to characterize and the molecular basis of HCC has been elusive.
The Cancer Genome Atlas is a large-scale project that has enabled improved characterization of cancers with several layers of data.Elucidating the layers of data in a disease can provide additional insights into the pathways that drive cancer.
A novel integrative approach of all publicly available high-throughput data from patient HCC tumors was used to delineate critical pathway dependencies in HCC.
A comprehensive analysis and characterization of all publicly available genomic, gene expression, methylation, miRNA and proteomic data in HCC covered 85 studies and 3355 patient sample profiles and identified the key overlapping dysregulated genes and pathways affected.
We identified the prognostic value of these genes in HCC genes, specifically with Netrin and Slit3 being novel proteins of prognostic importance to HCC.
Our large integrative analysis of all publicly available data in HCC and our pathway enrichment analysis has elucidated epidermal growth factor, β1-integrin, and axon guidance as pathway dependencies in HCC.
Based on our integrative analysis, epidermal growth factor, and β1-integrin are master regulators that could be considered as potential therapeutic targets in HCC.
The authors thank undergraduate students Sujitha Srinathan, Emily Chen, Bishoy Lawendy, Nangi Suo and Amira Abdallah for their help in data curation.
World Journal of Hepatology2021年1期