Racial disparities in patient survival and tumor mutation burden, and the association between tumor mutation burden and cancer incidence rate

Zhang, Wensheng; Edwards, Andrea; Flemington, Erik K.; Zhang, Kun

doi:10.1038/s41598-017-13091-y

Download PDF

Article
Open access
Published: 20 October 2017

Racial disparities in patient survival and tumor mutation burden, and the association between tumor mutation burden and cancer incidence rate

Wensheng Zhang¹,
Andrea Edwards¹,
Erik K. Flemington² &
…
Kun Zhang¹

Scientific Reports volume 7, Article number: 13639 (2017) Cite this article

3129 Accesses
35 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The causes underlying racial disparities in cancer are multifactorial. In addition to socioeconomic issues, biological factors may contribute to these inequities, especially in disease incidence and patient survival. To date, there have been few studies that relate the disparities in these aspects to genetic aberrations. In this work, we studied the impacts of race on the patient survival and tumor mutation burden using the data released by the Cancer Genome Atlas (TCGA). The potential relationship between mutation burden and disease incidence is further inferred by an integrative analysis of TCGA data and the data from the Surveillance, Epidemiology, and End Results (SEER) Program. The results show that disparities are present (p < 0.05) in patient survival of five cancers, such as head and neck squamous cell carcinoma. The numbers of tumor driver mutations are differentiated (p < 0.05) over the racial groups in five cancers, such as lung adenocarcinoma. By treating a specific cancer type and a racial group as an “experimental unit”, driver mutation numbers demonstrate a significant (r = 0.46, p < 0.002) positive correlation with cancer incidence rates, especially when the five cancers with mutational disparities are exclusively focused (r = 0.88, p < 0.00002). These results enrich our understanding of racial disparities in cancer and carcinogenic process.

Feasibility of functional precision medicine for guiding treatment of relapsed or refractory pediatric cancers

Article Open access 11 April 2024

Arlet M. Acanda De La Rocha, Noah E. Berlow, … Diana J. Azzam

Evolutionary trajectories of small cell lung cancer under therapy

Article Open access 13 March 2024

Julie George, Lukas Maas, … Roman K. Thomas

Deep whole-genome analysis of 494 hepatocellular carcinomas

Article 14 February 2024

Lei Chen, Chong Zhang, … Hongyang Wang

Introduction

Eliminating racial disparities in cancer screening, diagnosis, treatment and mortality is an essential step toward the improvement of health outcomes for all cancer patients in America¹. The promise of this objective depends on identifying and addressing the multifactorial reasons underlying the disparities. It is well recognized that socioeconomic issues, such as income and treatment delays, play a critical role in the high mortality of several cancers in minority populations². Meanwhile, some studies show that biological factors may contribute to these inequities, especially in disease incidence and patient survival³.

Previous studied have associated the race-related survival stratification of cancer patients to the differences of genetic alterations present in tumor cells. Carethers et al.⁴ showed that the frequency of microsatellite instability (MSI) among African American colon cancers is half of that of MSI for the Caucasian counterpart. The authors proposed that, because MSI is associated with good survival for colon cancer patients, the relative lack of MSI in African American patients could be related to the high morality. Keenan et al.⁵ reported that racial differences in TP53 mutation, PAM50 basal subtype and triple-negative tumor prevalence influence the magnitude and significance of racial disparity in tumor recurrence of breast cancer. Petrovics et al.⁶ observed distinct prevalence between African American (AA) and Caucasian American prostate cancer (CaP) genomes in three recurrent genomic alterations, which occurred in the genes (loci) PTEN, LSAMP region and ERG. They further found that a novel deletion of the LSAMP locus, as a prevalent genomic alteration in AA CaP, was associated with rapid disease progression.

In this study, we first used the data released by the Cancer Genome Atlas (TCGA) to estimate the effect of race on patient survival time and mutation burden of tumors in 16 cancer types (subtypes). Then, we extended the analysis to the determination of potential relationship between mutation burden and disease incidence, a less investigated issue, by integrating TCGA data and the data from the Surveillance, Epidemiology, and End Results (SEER) Program. The results obtained from this study enrich our knowledge in cancer disparities and the related carcinogenic process.

Material and Methods

TCGA data

We downloaded the clinical and somatic data from the TCGA database (http://cancergenome.nih.gov/) on April 24, 2015. Those data, contributed by different institutes, are generated using various sequencing platforms, somatic mutation calling algorithms and computational tools. Except for ovarian carcinomas (OV), we choose one representative dataset for each cancer type according to the following criteria. First, the selected dataset contains the largest number of tumor samples (or patients). Second, if two or more datasets are of the same size, we choose the one in which the mutations are measured by the IlluminaGA DNASeq platform and are called by the latest automated system. Lastly, if the decision cannot be reached by the previous two steps, we select the dataset provided by the UCSC Genome Browser. For OV, we employ the datasets from Massachusetts Institute of Technology and Washington University in St. Louis. The basic information of the used somatic and clinical datasets is summarized in Supplementary Table 1. Synonymous mutations and those under the categories of “intron” and “rna” are excluded from further analysis.

SEER data

Age-adjusted race-specific cancer incidence rates, based on the registries in 18 areas from 2008–2012 (or from 1992–2007 for glioblastoma multiforme (GBM)), are retrieved from the SEER website (http://seer.cancer.gov/). In the SEER review reports, cancers are categorized by tissue sites. For a TCGA cancer, if it is the absolutely-predominant subtype of a SEER cancer, the incidence rate (the number of new cancer cases per 100,000 individuals per year) in a racial group is estimated by the incidence rate of the SEER cancer. Otherwise, a race-specific incidence rate of the TCGA cancer (Cancer-A) is estimated by multiplying the incidence rate of the SEER cancer (Cancer-B) that covers Cancer-A with a weight that represents the proportion of the tumor cases of Cancer-A among the total cases of Cancer-B. When the SEER reports do not include the distribution of histological subtypes for a cancer, the weight information for estimating the incidence rates of a TCGA cancer is obtained from other literature. In particular, the data in Olshan et al.⁷ are used in estimating the incidence rates of KIRC and KIPC, and the data in Wright et al.⁸ and Dubrow & Darefsky⁹ are applied to the estimations for UCEC and GBM, respectively. The details regarding the adaptation of incidence rates from the SEER cancers to the TCGA cancers are described in Supplementary Table 2.

Data of stem cell divisions

The lifetime number of stem cell divisions for eight cancer tissues (out of the 16 TCGA cancers summarized in Table 1) are estimated by Tomasetti and Vogelstein¹⁰. We directly use their estimations in this study.

Table 1 The summary of sample profiles^‡.

Full size table

Racial groups

The TCGA patients (or tumors) are partitioned into three racial groups, “White”, “Black” and “Asian”. We exclude the patients that do not belong to these groups. These groups are aligned to the SEER populations “White”, “Black” and “Asian and Pacific Islands”, respectively.

Statistical analysis

We use R to perform all statistical analyses. The race-specific Kaplan-Meier survival curves are created by the function “survfit()” in the package “survival”. P-values for the difference between two races in patient survival time is calculated by the function coxph() in the package “survival”¹¹ and the function rmst2() in the package “survRM2”¹². In the implementations, patient-age at the initial clinical date is included as a covariate and the default arguments are used. The functions wilcox() and lm() in the package “stats” are used in the Mann Whitney test and linear regression analysis, respectively.

Results

Among the 33 cancer types with clinically-annotated multi-omic data available at the TCGA database by April 24, 2015, sixteen are studied in this work. Each of the selected cancer types has at least 14 patients from a minority population (i.e. black or Asian Americans) besides the dominant white Americans (Table 1). The studied cancer types include bladder urothelial carcinoma (BLCA), glioblastoma multiforme (GBM), head and neck squamous cell carcinoma (HNSC), kidney renal clear cell carcinoma (KIRC), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), breast invasive carcinoma (BRCA), ovarian serous cystadenocarcinoma (OV), uterine corpus endometrial carcinoma (UCEC), colon adenocarcinoma (COAD), thyroid carcinoma (THCA), cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC), esophageal carcinoma (ESCA), kidney renal papillary cell carcinoma (KIRP) liver hepatocellular carcinoma (LIHC), and stomach adenocarcinoma (STAD). The sample sizes of those cancer types range from 171 to 967.

Racial disparity in cancer incidence rate

We use a naïve binomial test to estimate the p-value for the difference of cancer incidence rates between black (or Asian) and white groups for each cancer type (Results are presented in Supplementary Table 3 and the method is outlined in the table notes). We find that, except for Black versus White in three cancers (i.e. BRCA, CESC and ESCA) and Asian versus White in one cancer (i.e. THCA), all the other differences are significant (p < 0.01).

Racial disparity in patient survival

For each TCGA cancer type, the samples not belonging to the white, black or Asian racial groups are excluded from the survival analysis. Two statistical methods are employed. One is the conventional Cox proportional hazard (Cox-PH) regression, and the other is the Restricted Mean Survival Time (RMST)¹³. Compared to a Cox-PH model, RMST has an advantage in alleviating the potential low efficiency, which may happen when the Kaplan Meier survival curves of two groups substantially deviate from parallelism and/or cover different age domains. However, its implementation needs a cut-off for survival time, potentially leading to the loss of information and statistical power. In our analysis, the significance of a group comparison is determined by an aggregated p-value (p), which integrates p _COX ₋ _PH (the p-value obtained from the Cox-PH analysis) and p _RMST (the p-value obtained from the RMST method) by the conventional Bonferroni method¹⁴. The formula is p = K×min(p _COX−PH’ p _RMST), where K = 2.

Five cancer types demonstrate racial disparities in the overall survival time of patients (Fig. 1). The first is HNSC, in which the survival in black patients is significantly worse than that in white patients (p = 0.038). The second is LUAD in which Asian patients show a nearly perfect survival profile. Although the Asian group contains only eight samples, the comparison with white patients is extremely significant (p < 0.001). Black patients demonstrate a beyond-five-year survival advantage over white patients but the difference is not significant (p > 0.05). Nevertheless, the difference of ten-year survival rates between the black and white patients is significant (p−value = 0.002) if Fisher’s Exact Test is used. The third is LUSC, in which none of the nine Asian patients lived more than three years and their survival is significantly poorer compared to white patients (p = 0.024). For the last two cancer types, i.e. LIHC and STAD, the p-values of the comparisons between Asian and white groups are 0.035 and 0.017, respectively. The Asian group also demonstrates much desired survival rates (over 80%) until 40 months. In particular, the survival advantage of Asian patients over white and black STAD patients is still substantial after 90 months from the initial clinical dates.

Racial disparity in tumor mutation burdens

By a Mann Whitney test, in which the null hypothesis is that the mean ranks of the groups are the same, we evaluate the between-race differences of mutation burdens (i.e. the numbers of somatic mutations) in three gene sets (or catalogues). The first, pcDriver, contains 291 (high-confident) driver genes identified by a pan-cancer project¹⁵. The second consists of the 506 cancer genes collected by the Cancer Gene Census of COSMIC (Catalogue of Somatic Mutations in Cancer) database¹⁶. The third includes all the HUGO genes for which official symbols have been approved by the Human Genome Organization Nomenclature Committee. It is worth noting that, if a gene has two or multiple mutations in an individual sample, each of those mutations will be counted towards the calculation of mutation burden.

As shown in Table 2, racial disparities (p < 0.05) are observed in five cancer types regarding the mutations present in the pcDriver genes. Specifically, in BLCA, a median white patient has 11 driver mutations, 4 more than that of an average Asian patient. A similar but less significant pattern is found in KIRC. Among LUAD patients, black patients have heavier driver mutation burden compared to white patients. Their medians are 13 and 9, respectively. On the other hand, white patients suffer more mutations than black patients for UCEC and KIRP. In addition, the difference between black and Asian patients is significant in UCEC.

Table 2 The statistics of non-synonymous somatic mutations in the pan-cancer driver (pcDriver) genes^‡.

Full size table

We also observe the racial disparities in BLCA, KIRC and LUAD, but not UCEC and KIRP, regarding the mutations present in the COSMIC genes and HOGO genes (Supplementary Tables 4 and 5). The analysis of these two gene catalogues also shows some racial disparities that are not detected in the analysis of the pcDriver genes. Several cancers, including BRCA, CESC, OV and ESEA, are involved.

Relationship between tumor mutation burden and cancer incidence rate

We further investigate whether the observed mutational disparities can explain the variability of cancer incidence by a set of statistical analyses. In these analyses, we treat the combination of a racial group and a cancer type as an “experimental” unit, whose incidence and mutation quantities constitute an observation (or record) in the working dataset.

The first analysis (AS-1) focuses on the five cancers that demonstrate mutational disparities in driver genes (highlighted in Table 2). The association between cancer incidence rate and the number of mutations in the pan-cancer driver (pcDriver) genes or the log2 transformed number of mutations in the HOGO genes is estimated by the Pearson correlation (r). As showed in Fig. 2, the association is quite strong (r = 0.88 or 0.79, p < 0.00002 or 0.005) and the pattern approximately demonstrates a linear relationship.

The second analysis (AS-2) repeats the correlation tests using the information of 15 cancers (of the 16 cancers listed in Table 1). BRCA is excluded from the analysis because its extremely-high incidence rates could dominate the parameter estimation. The results (Fig. 3) largely confirm the positive association between cancer incidence and mutation burden observed in AS-1.

The third analysis (AS-3) is based on the information of 8 cancers, i.e. COAD, ESCA, GBM, HNSC, LIHC, LUAD, LUSC and THCA, which are a subset of the 31 cancers studied by Tomasetti and Vogelstein¹⁰. The effects of mutation burden and the lifetime number of stem cell divisions (SCD, Supplementary Table 6) on cancer incidence are evaluated by five regression models (Table 3). The results show that driver mutation burden (DM) can explain ~25% of the variability of cancer incidence across cancer types and racial groups, similar to the percentage explained by cell divisions. The model containing both DM and SCD as the explanatory variables is more predictive (R ² = 0.374) than the models with either DM or SCD as the only explanatory variable.

Table 3 The regression of cancer incidence on stem cell division and somatic mutation burden.

Full size table

Further analysis on the relationship between mutation burden and cancer incidence rate

AS-S1

Not all non-synonymous mutations occurring in pcDriver genes are driver mutations. DNA bases in which driver mutations occur tend to be, but not necessarily are, conservative in mammalian evolution^17,18. In this regard, the number of mutations in pcDriver genes in a tumor should be considered as an estimate (or a representative metric) of its driver mutation burden.

Of the total 33400 non-synonymous pcDriver gene mutations in all 4839 tumor samples analyzed in this study, 29623 (amounting to 88.7%) are single nucleotide variations (SNVs). The others include 11 double nucleotide polymorphisms (DNPs) and 3766 indels. We retrieve the PHRED-like deleteriousness scores (Scaled C-Scores) of these SNVs from Combined Annotation Dependent Depletion (CADD) (http://cadd.gs.washington.edu/)¹⁹. We find that 86.6% of the obtained scores are larger than 15 (a mutation with its C-Score over 15 is expected to be among the 3.2% of the most deleterious SNVs), the cutoff recommended by CADD for the identification of pathogenic variations. Among the 48 cancer-race groups, only KIRP-Asian group has the average Scaled C-Score (16.7) less than 20 (Supplementary Table 7).

By filtering the less deleterious (Scaled C-Score <15) SNVs from the mutation list of pcDriver genes, we generate an alternative estimate (or metric) of the driver mutation burden of a tumor sample. We find that the association pattern and strength (Supplementary Figure 1A) between this parsimoniously-measured mutation burden and cancer incidence rate are very similar to those shown Fig. 3A. This implies that the noise potentially introduced in measuring driver mutation burden do not seriously impact the validity of the findings presented in the previous subsection.

AS-S2

The mutations not occurring in cancer driver genes are typically known as passenger mutations. Passenger mutation burden is a proven, both empirically and theoretically, positive predictor for driver mutation burden²⁰. In the TCGA data, passenger mutations amount to ~93% of the total mutations. We calculate and test the correlation between the passenger mutation burden and cancer incidence rate (Supplementary Figure 1B). The result is similar to that between the total mutation burden and cancer incidence rate (Fig. 3B).

Discussion

In the literature, the mortality of a cancer and the variability across different racial groups are usually determined by epidemiological data^{7,8,9,21,22,23,24,25,26,27,28}. In this paper, we perform an integrative analysis of the clinical and genomic data of the TCGA tumor samples, finding racial disparities present in five cancer types with regard to the survival profile of patients. We also notice that, although some racial disparities observed from the analysis of epidemiological data are not identified due to the relatively small sample sizes of the minor racial groups, the Kaplan Meier curves still provide insight into the nature of these disparities. For example, it is well known that black lung cancer patients have a higher death rate compared to white patients²¹ and our result implies that the disparity is mainly due to the lower short-time survival chances of black LUSC patients. This is consistent with the opinion that the treatment of black patients has been more frequently delayed due to socioeconomic factors^21,26,27,29.

Personalized medicine is a new and exciting research field, being considered as the future of cancer patient management³⁰. The potential strength depends on the understanding of the biological and genetic characteristics of individual tumors³¹, for which the differences between racial populations may be an information source. In this study, we found that the numbers of tumor driver mutations are differentiated (p < 0.05) over the racial groups in five cancers. Theoretically, both genetic and environmental factors can contribute to these disparities. However, the detailed stories should vary, depending on cancer types. For example, the mutational disparity in LUAD is indicated by the small p-value for the White::Black comparison and is characterized by the high mutation burden in black patients. Since, among people of low socioeconomic status, black Americans have a higher smoking rate than the white³², it could not be too bold to attribute the mutational disparity to an environment factor. On the other hand, the racial disparity in BLCA is indicated by the small p-value for the White::Asian comparison and is characterized by the high mutation burden in white patients. Because there is no evidence showing that the lifestyles and diets of the black, whose mutational profile is similar to the Asian, are closer to the Asian than the white, the observed disparity in somatic mutations may be due to a genetic factor. These speculations warrant further validation with more relevant data.

The most remarkable finding of our work is that there is a significant positive correlation between the incidence rate and the race-specific median (driver) mutation burden of a cancer. This association seems to deviate from the well-known perception that relates cancer incidence rate to the total number of (driver) mutations that can be accumulated in a tissue during the lifespan of a person. The reason is that the measurement of mutation burden in a tumor is irrelevant to the size of stem cell populations (or the divisions) that varies substantially in an exponential scale across tissues. A potential explanation for the paradox is that: the requirement for driver mutations to develop cancer in a tissue with a large population of stem cells (and/or being readily subject to mutagens) could be relatively high but the precancerous cells meeting the threshold in such a tissue still outnumber the precancerous cells in a “smaller” (and/or “safe”) tissue. Similar hypotheses have been proposed to explain the famous Peto’s paradox, i.e. biological species of larger body mass and/or longer lifespan exhibit smaller than expected incidences of cancer³³. Different from the “bad luck” theory that attributes cancer to random mutations¹⁰, our results indicate the causal complexity of cancer. That is, besides tissue types, the race-related genetic and environmental factors are among the mediators for the association between the variabilities of mutation burden and disease incidence across tissues. Theoretically, mutation burden in a tumor is directly related to the number of somatic cells derived from a single stem cell. In this regard, there is a similarity between our result and that reported by Noble et al.³⁴. The publication shows that both components of the lifetime number of stem cell divisions, i.e. standing stem cell number and per stem cell lifetime replication rate, have a statistically significant and independent effect on explaining variation in cancer incidence over the 31 cancer types studied by Tomasetti and Vogelstein¹⁰.

References

Koh, H. K. Toward the elimination of cancer disparities: clinical and public health perspectives. (Springer, 2009).
Ashing-Giwa, K. T. et al. Diagnostic and therapeutic delays among a multiethnic sample of breast and cervical cancer survivors. Cancer 116, 3195–3204, https://doi.org/10.1002/cncr.25060 (2010).
Article PubMed Google Scholar
Anderson, N. B., Bulatao, R. A., Cohen, B. & National Research Council (U.S.). Panel on Race Ethnicity and Health in Later Life. Critical perspectives on racial and ethnic differences in health in late life. (National Academies Press, 2004).
Carethers, J. M. et al. Influence of race on microsatellite instability and CD8+T cell infiltration in colon cancer. PLoS One 9, e100461, https://doi.org/10.1371/journal.pone.0100461 (2014).
Article ADS PubMed PubMed Central Google Scholar
Keenan, T. et al. Comparison of the Genomic Landscape Between Primary Breast Cancer in African American Versus White Women and the Association of Racial Differences With Tumor Recurrence. J Clin Oncol 33, 3621–3627, https://doi.org/10.1200/JCO.2015.62.2126 (2015).
Article CAS PubMed PubMed Central Google Scholar
Petrovics, G. et al. A novel genomic alteration of LSAMP associates with aggressive prostate cancer in African American men. EBioMedicine 2, 1957–1964, https://doi.org/10.1016/j.ebiom.2015.10.028 (2015).
Article PubMed PubMed Central Google Scholar
Olshan, A. F. et al. Racial difference in histologic subtype of renal cell carcinoma. Cancer Med 2, 744–749, https://doi.org/10.1002/cam4.110 (2013).
PubMed PubMed Central Google Scholar
Wright, J. D. et al. Racial disparities for uterine corpus tumors: changes in clinical characteristics and treatment over time. Cancer 115, 1276–1285, https://doi.org/10.1002/cncr.24160 (2009).
Article PubMed Google Scholar
Dubrow, R. & Darefsky, A. S. Demographic variation in incidence of adult glioma by subtype, United States, 1992–2007. BMC Cancer 11, 325, https://doi.org/10.1186/1471-2407-11-325 (2011).
Article PubMed PubMed Central Google Scholar
Tomasetti, C. & Vogelstein, B. Cancer etiology. Variation in cancer risk among tissues can be explained by the number of stem cell divisions. Science 347, 78–81, https://doi.org/10.1126/science.1260825 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Therneau, T. Package ‘survival’. https://cran.r-project.org/web/packages/survival/survival.pdf (2015).
Uno, H. Vignette for survRM2 package: Comparing two survival curves using the restricted mean survival time. https://cran.r-project.org/web/packages/survRM2/vignettes/survRM2-vignette3-1.pdf (2015).
Zucker, D. Restricted mean life with covariates: Modification and extension of a useful survival analysis method. J Am Stat Assoc 93, 702–709 (1998).
Article MathSciNet MATH Google Scholar
Vovk, V. Combining p-values via averaging. arXiv:1212.4966 [math.ST] (2012).
Tamborero, D. et al. Comprehensive identification of mutational cancer driver genes across 12 tumor types. Sci Rep 3, 2650, https://doi.org/10.1038/srep02650 (2013).
Article PubMed PubMed Central Google Scholar
Forbes, S. A. et al. COSMIC (the Catalogue of Somatic Mutations in Cancer): a resource to investigate acquired mutations in human cancer. Nucleic Acids Res 38, D652–657, https://doi.org/10.1093/nar/gkp995 (2010).
Article CAS PubMed Google Scholar
McFarland, C. D., Korolev, K. S., Kryukov, G. V., Sunyaev, S. R. & Mirny, L. A. Impact of deleterious passenger mutations on cancer progression. Proc Natl Acad Sci USA 110, 2910–2915, https://doi.org/10.1073/pnas.1213968110 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, W., Edwards, A., Flemington, E. K. & Zhang, K. Significant Prognostic Features and Patterns of Somatic TP53 Mutations in Human Cancers. Cancer Inform 16, 1176935117691267, https://doi.org/10.1177/1176935117691267 (2017).
PubMed PubMed Central Google Scholar
Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat Genet 46, 310–315, https://doi.org/10.1038/ng.2892 (2014).
Article CAS PubMed PubMed Central Google Scholar
Bozic, I. et al. Accumulation of driver and passenger mutations during tumor progression. Proc Natl Acad Sci USA 107, 18545–18550, https://doi.org/10.1073/pnas.1010978107 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Parker, S. L., Davis, K. J., Wingo, P. A., Ries, L. A. & Heath, C. W. Jr. Cancer statistics by race and ethnicity. CA Cancer J Clin 48, 31–48 (1998).
Article CAS PubMed Google Scholar
Adegoke, O., Kulasingam, S. & Virnig, B. Cervical cancer trends in the United States: a 35-year population-based analysis. J Womens Health (Larchmt) 21, 1031–1037, https://doi.org/10.1089/jwh.2011.3385 (2012).
Article Google Scholar
Long, B., Liu, F. W. & Bristow, R. E. Disparities in uterine cancer epidemiology, treatment, and survival among African Americans in the United States. Gynecol Oncol 130, 652–659, https://doi.org/10.1016/j.ygyno.2013.05.020 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kim, J. et al. Race and ethnicity correlate with survival in patients with gastric adenocarcinoma. Ann Oncol 21, 152–160, https://doi.org/10.1093/annonc/mdp290 (2010).
Article CAS PubMed Google Scholar
Terplan, M., Smith, E. J. & Temkin, S. M. Race in ovarian cancer treatment and survival: a systematic review with meta-analysis. Cancer Causes Control 20, 1139–1150, https://doi.org/10.1007/s10552-009-9322-2 (2009).
Article PubMed Google Scholar
Shavers, V. L. & Brown, M. L. Racial and ethnic disparities in the receipt of cancer treatment. J Natl Cancer Inst 94, 334–357 (2002).
Article PubMed Google Scholar
Bach, P. B., Cramer, L. D., Warren, J. L. & Begg, C. B. Racial differences in the treatment of early-stage lung cancer. N Engl J Med 341, 1198–1205, https://doi.org/10.1056/NEJM199910143411606 (1999).
Article CAS PubMed Google Scholar
Setiawan, V. W. et al. Racial/ethnic differences in endometrial cancer risk: the multiethnic cohort study. Am J Epidemiol 165, 262–270 (2007).
Article PubMed Google Scholar
Lathan, C. Lung Cancer Disparities in the Era of Personalized Medicine. The American Journal of Hematology/Oncology 11, 5–8 (2015).
Google Scholar
Diamandis, M., White, N. M. & Yousef, G. M. Personalized medicine: marking a new epoch in cancer patient management. Mol Cancer Res 8, 1175–1187, https://doi.org/10.1158/1541-7786.MCR-10-0264 (2010).
Article CAS PubMed Google Scholar
Cho, S. H., Jeon, J. & Kim, S. I. Personalized medicine in breast cancer: a systematic review. J Breast Cancer 15, 265–272, https://doi.org/10.4048/jbc.2012.15.3.265 (2012).
Article PubMed PubMed Central Google Scholar
Delva, J. et al. Cigarette smoking among low-income African Americans: a serious public health problem. Am J Prev Med 29, 218–220 (2005).
Article PubMed PubMed Central Google Scholar
Noble, R., Kaltz, O. & Hochberg, M. E. Peto’s paradox and human cancers. Philos Trans R Soc Lond B Biol Sci 370 (2015).
Noble, R., Kaltz, O. & Hochberg, M. E. Statistical interpretations and new findings on Variation in Cancer Risk Among Tissues. arXiv:1502.01061[q-bio.PE] (2015).

Download references

Acknowledgements

This publication was made possible by funding from the NIH grants 2G12MD007595 and P01 CA214091, the DOD ARO grant #W911NF-15-1-0510 and the Louisiana Cancer Research Consortium (LCRC). The contents are solely the responsibility of the authors and do not necessarily represent the official views of the NIH, DOD or LCRC. The analyses presented here are based upon the data published by The Cancer Genome Atlas (TCGA). We downloaded the data sets via The TCGA Data Portal, which are managed by the NCI and NHGRI. At present, all TCGA data resides at the Genomic Data Commons (https://gdc-portal.nci.nih.gov/legacy-archive/search/f). The authors thank the two reviewers for their constructive comments.

Author information

Authors and Affiliations

Department of Computer Science, Bioinformatics facility of Xavier RCMI Center of Cancer Research, Xavier University of Louisiana, 1 Drexel Drive, New Orleans, LA, 70125, USA
Wensheng Zhang, Andrea Edwards & Kun Zhang
Tulane School of Medicine, Tulane Cancer Center, Tulane University, 1700 Tulane Ave, New Orleans, LA, 70112, USA
Erik K. Flemington

Authors

Wensheng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Edwards
View author publications
You can also search for this author in PubMed Google Scholar
Erik K. Flemington
View author publications
You can also search for this author in PubMed Google Scholar
Kun Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and designed the experiments: W.Z., K.Z. Performed the experiments: W.Z. Analyzed the data: W.Z., K.Z. Wrote the paper: W.Z., A.E., E.F., K.Z. Helped with experiment design: E.F., A.E. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Kun Zhang.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, W., Edwards, A., Flemington, E.K. et al. Racial disparities in patient survival and tumor mutation burden, and the association between tumor mutation burden and cancer incidence rate. Sci Rep 7, 13639 (2017). https://doi.org/10.1038/s41598-017-13091-y

Download citation

Received: 10 March 2017
Accepted: 19 September 2017
Published: 20 October 2017
DOI: https://doi.org/10.1038/s41598-017-13091-y

This article is cited by

Algorithmic fairness in artificial intelligence for medicine and healthcare
- Richard J. Chen
- Judy J. Wang
- Faisal Mahmood
Nature Biomedical Engineering (2023)
The prognosis and treatment of newly diagnosed bone metastasis of head and neck squamous cell cancer: an analysis of racial disparity
- Huimei Huang
- Shiying Zeng
- Gangcai Zhu
Clinical and Translational Oncology (2023)
Age influences on the molecular presentation of tumours
- Constance H. Li
- Syed Haider
- Paul C. Boutros
Nature Communications (2022)
Demographic reporting across a decade of neuroimaging: a systematic review
- Elijah Sterling
- Hannah Pearl
- Candace C. Fleischer
Brain Imaging and Behavior (2022)
JCGA: the Japanese version of the Cancer Genome Atlas and its contribution to the interpretation of gene alterations detected in clinical cancer genome sequencing
- Masakuni Serizawa
- Maki Mizuguchi
- Ken Yamaguchi
Human Genome Variation (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Material and Methods

TCGA data

SEER data

Data of stem cell divisions

Racial groups

Statistical analysis

Results

Racial disparity in cancer incidence rate

Racial disparity in patient survival

Racial disparity in tumor mutation burdens

Relationship between tumor mutation burden and cancer incidence rate

Further analysis on the relationship between mutation burden and cancer incidence rate

AS-S1

AS-S2

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links