Toward early detection of Helicobacter pylori-associated gastric cancer

Walker, Rachel; Poleszczuk, Jan; Mejia, Jaime; Lee, Jae K.; Pimiento, Jose M.; Malafa, Mokenge; Giuliano, Anna R.; Enderling, Heiko; Coppola, Domenico

doi:10.1007/s10120-017-0748-z

Toward early detection of Helicobacter pylori-associated gastric cancer

Original Article
Published: 19 July 2017

Volume 21, pages 196–203, (2018)
Cite this article

Download PDF

Gastric Cancer Aims and scope Submit manuscript

Toward early detection of Helicobacter pylori-associated gastric cancer

Download PDF

Rachel Walker¹,
Jan Poleszczuk²,
Jaime Mejia³,
Jae K. Lee⁴,
Jose M. Pimiento⁵,
Mokenge Malafa⁵,
Anna R. Giuliano⁸,
Heiko Enderling^1,9 &
…
Domenico Coppola^6,7

2402 Accesses
5 Citations
5 Altmetric
Explore all metrics

Abstract

Background

Gastric cancer is typically diagnosed at a late stage, leading to poor prognoses. Helicobacter pylori is responsible for 70% of gastric cancers globally, and patients with this bacterial infection often present with early stages of the carcinogenic pathway such as inflammation or gastritis. Although many patients continue to progress to advanced-stage disease after antibacterial treatment, there are no follow-up screening protocols for patients with a history of H. pylori.

Methods

Several biomarkers (Lgr5, CD133, CD44) become upregulated during gastric carcinogenesis. A logistic regression model is developed using clinical data from 59 patients at different stages of the carcinogenic pathway to identify the likelihood of being at an advanced stage of disease for all combinations of age, sex, and marker positivity. Using these likelihood distributions and the observed rate of marker positivity increase, time to high likelihood (probability >0.8) of advanced disease for individual patients is predicted.

Results

A strong correlation between marker positivity and disease stage was found for all three markers. Disease stage was accurately classified by the respective regression models for more than 86% of retrospective patients. Highly patient-specific predictions of time to onset of dysplasia were made, allowing the classification of 17 patients initially diagnosed with intestinal metaplasia into high-, intermediate-, or low-risk categories.

Conclusions

We present an approach designed to integrate pathology, mathematics, and statistics for detection of the earliest precancerous, treatable lesion. Given the simplicity and robustness of the framework, such technique has the potential to guide personalized screening schedules to minimize the risk of undetected malignant transformation.

Gastric Cancer

Potential Non-invasive Biomarkers of Helicobacter pylori-Associated Gastric Cancer

Article 12 November 2021

Helicobacter pylori and Gastric Cancer: Timing and Impact of Preventive Measures

Introduction

At present, as many as 80% of gastric cancer patients reach stage IV before clinical diagnosis [1]. To reduce the particularly high mortality rate of these cancers, early intervention is essential. Although it is known that several biomarkers become upregulated during gastric carcinogenesis [2,3,4], a concerted effort is still needed to thoroughly evaluate their clinical applicability. Here, we show how these biomarkers may facilitate the development of a screening methodology for the early detection of preneoplastic lesions.

Helicobacter pylori is responsible for up to 70% of gastric cancers worldwide, initiating a carcinogenic cascade from chronic active gastritis to intestinal metaplasia, dysplasia, and ultimately carcinoma: the Correa pathway [5]. Symptoms of the early stages of this pathway are common because of the extensive inflammation and tissue damage induced by bacterial colonization, and often lead patients to the clinic at the chronic gastritis or metaplasia stage. However, despite a continued risk of progression through the carcinogenic pathway (because of damage that is not reversed following bacterial eradication [6,7,8]), no follow-up screening is routinely provided. Patients who continue to progress predominantly return to the clinic when the disease is already invasive or metastatic, at which point curative treatment is a near impossibility. If patients with a history of H. pylori infection underwent periodic follow-up, they could benefit from the early detection of progression to low-grade dysplasia. This program would allow endoscopic monitoring and early surgical intervention, with the promise of significantly improved outcomes. As such, the aim of this study is to demonstrate how the quantitative tools of mathematics and statistics may complement known biomarkers of disease progression and contribute to the development of effective screening protocols.

Lgr5, CD44, and CD133 are upregulated in gastric cancer tissue [3, 4], and their expression levels have several implications for metastasis, therapy resistance, and overall prognosis [9,10,11]. Expression of these markers also increases incrementally between each respective stage of the Correa pathway [2]. This change suggests these markers could be used as indicators of patient-specific disease progression and could be incorporated into predictive tools designed to optimize follow-up screening schedules for patients with a history of H. pylori infection.

Here, we investigated if statistical models could determine the likelihood of a patient being either early in the Correa pathway (gastritis or metaplasia) or late in the pathway (low- or high-grade dysplasia or carcinoma) based on patient age, sex, and biomarker-positive cell fraction (obtained from immunohistochemical staining of tissue samples). These models are calibrated using an initial cohort of patients at different stages of disease, and model ability to accurately classify tissue samples is demonstrated by comparing model predictions to disease stages determined by pathology. If the rate of increase of the marker-positive cell population during disease progression can be derived from existing clinical data, such a model can be used to identify the time at which the risk of progressing to low-grade dysplasia will be above a certain threshold for that patient. This time can be used to classify an individual patient’s current risk status dependent on their patient-specific input parameters (marker positivity, age, sex), and recommend follow-up screening at an optimal stage in the pathway: late enough to avoid frequent and costly overscreening, but early enough for treatment to have a high likelihood of success.

Utilizing candidate biomarkers of disease progression to improve actual clinical outcomes is a necessarily multidisciplinary challenge. The purpose of the present work is to demonstrate the potential for a quantitative framework to contribute to bridging this gap, toward early detection and intervention in cancers with typically late diagnoses.

Materials and methods

Patients and sample

Retrospective gastric biopsy samples were collected from 59 H. pylori-positive patients from the Instituto de Patologia Mejia Jimenez in Cali, Colombia during 2014 and shipped as formalin-fixed paraffin-embedded (FFPE) tissue blocks to Moffitt Cancer Center, Tampa, FL (USA). H. pylori infection prevalence is approximately 70% in Colombia, reaching even higher levels in populations residing in the south of the country and in mountainous regions [12]. This locale provides an optimal site for the analysis of H. pylori-associated disease. The prevalent bacterial strain in this region is CagA+/VacA+, the strain typically associated with carcinogenesis; because of the history of the bacterial infection, gastric cancer (GC) patients primarily presented with intestinal-type gastric cancers of the antrum and corpus. Disease stage was assessed by hematoxylin and eosin (H&E) staining, and H. pylori status was evaluated by immunohistochemical staining of endoscopy samples from each patient. Patients were selected from four different stages of the Correa pathway based on pathological analysis of histological lesions: normal gastric mucosa (NM), complete intestinal metaplasia (IM), low-grade dysplasia (DS), and adenocarcinoma (GC). Baseline patient characteristics are summarized in Table 1.

Table 1 Baseline characteristics of the studied cohort

Full size table

Immunohistochemistry

All samples were stained for three putative carcinogenesis biomarkers (Lgr5, CD44, CD133). A 4-µm section of all selected blocks was stained using a Ventana Discovery XT automated system (Ventana Medical Systems, Tucson, AZ, USA) as per manufacturer’s protocol with proprietary reagents. The antibodies used were the rabbit anti-human LGR5 primary antibody (ab75850; Abcam, Cambridge, MA, USA; 1:100 dilution), rabbit anti-human CD44 primary antibody (#HPA005785; Sigma Aldrich, St. Louis, MO, USA; 1:1000 dilution), and mouse anti-human CD133 monoclonal antibody (MAB4399; Millipore, Billerica, MA, USA; 1:100 dilution). Stained slides were read by two independent pathologists. Marker positivity was quantified by fraction of epithelial cells staining. Inflammatory cell staining was not included in marker positivity scores.

Regression modeling

Logistic regression models based on several predictors (age, sex, and biomarker-positive cell fraction) were developed for estimating the probability of an individual being at either early stage (gastritis or metaplasia, stage <DS) or late stage (low-grade dysplasia or carcinoma, stage ≥DS). Note that high-grade dysplasia was observed in the surrounding tissue of several samples of gastric carcinoma; however, the most advanced histological lesion visible on the sample was used to classify stage for model development. The transition from metaplasia to low-grade dysplasia is a clinically relevant and clearly histologically defined timepoint, and if identified early could allow further clinical action in the form of endoscopic monitoring or surgical intervention.

Model fitting was conducted using the binomial family of the inbuilt generalized linear model (GLM) fitting function of statistical software R; a series of regression coefficients was generated from which it was possible to evaluate the contribution of each of the respective predictors to outcome. The Wilcoxon rank-sum test with a normal approximation was used to compare marker positivity between males and females and age between males and females. The independence of continuous variables age and marker-positive fraction was tested using Pearson’s product-moment correlation test. The significance level was set at p value = 0.05. Given the relatively small sample size, jackknife resampling was conducted to evaluate the generalizability of the observed regression coefficients and subsequent model predictions. All regression modeling and statistical analyses were performed using R software.

Classification performance

Given patient-specific input information (age, sex, marker positivity), the model generated the likelihood that the patient is at either stage <DS or stage ≥DS. For our initial cohort of 59 patients, the ability of the model to correctly classify patients into the category matching their pathology report was evaluated, and model under- and over-predictions were scored.

Follow-up screening times

Based on this regression modeling, we calculate the probability of a patient being at an advanced stage of disease (stage ≥DS) based on their marker-positive fraction and age for both females and males. From the derived probability distributions and initial patient-specific characteristics, a mathematical model of marker-positive fraction increase is used to predict the time until the risk of progressing to low-grade dysplasia is above a specific threshold (here set to 80%). For demonstrative purposes, the growth rates of the three markers were approximated based on simple first- and second-order curve fitting to the average marker-positivity data. Time for progression from clinically symptomatic gastritis to gastric cancer of 2 years was selected based on the average progression time of 12 patients in an independent cohort from the same institution.

We derived model-predicted times until the likelihood of low-grade dysplasia reaches 80% for all realistic combinations of patient ages and marker-positive fractions at clinical presentation. From these times, patients could be classified as “high risk” (predicted time to progression <100 days), “intermediate risk” (predicted time to progression between 100 days and 1 year), or “low risk” (predicted time to progression >1 year). These classifications were compared for all three markers to evaluate prediction robustness and verify independent biomarker choice.

Results

Clinical data

Immunohistochemical analysis identified a significant stepwise increase in marker-positive fraction between each stage of disease for all three markers (Fig. 1). The intensity of immunoreactivity also increased (from weak to strong) during the progression from normal tissue to metaplasia, dysplasia, and cancer. Immunoreactivity for CD44 and CD133 was localized to the cytoplasm, and the LGR5 stain had membranous localization. For Lgr5 and CD44, higher marker positivity was observed in males (Supplementary Fig. 1). As sex and marker positivity are not independent predictors, all further analysis was conducted independently for male and female cohorts. There was no statistically significant difference in age between males and females (Supplementary Fig. 2), or correlation between age and marker positivity for any of the three markers (Supplementary Fig. 3). Correlations between the three respective markers at each stage of disease and corresponding standard deviations are provided in Supplementary Table 2A, B.

Regression modeling

Regression models were fitted to patient data to evaluate the association of patient-specific characteristics with outcome (stage <DS). Table 2 shows model coefficients and intercepts for each of the three respective markers. In five of six logistic regressions, biomarker positivity demonstrates a strong and statistically significant association with probability of being at stage <DS (Table 2), except for CD133 in males. Negative coefficients imply that greater marker positivity decreases the likelihood of being in an early stage. As increasing marker positivity positively correlates with advancing disease stage, this was to be expected. No statistically significant association with probability of outcome was found with age.

Table 2 Estimated coefficients from three respective regression models

Full size table

Resampling the cohort by systematically omitting each observation and recalculating the regression coefficients and corresponding probabilities demonstrated that the majority of model outputs remained approximately constant (within 10% of the initial calculation) (Supplementary Fig. 8). However, for as many as three patients in each group (LGR5, CD133, and CD44 for both males and females), omission led to coefficients significantly (>10%) different from the initial calculation. In all cases omission of these “outliers” generated a higher correlation of marker expression with disease stage (Supplementary Table 1); in no case did correlation decrease. This result suggests that the observed positive correlation between marker positivity and disease stage is unlikely to be an artifact of the sample size.

Classification performance

Based on marker positivity, sex, and age, the Lgr5 model correctly classified 49 of 57 cases (86%), with overestimation and underestimation in 4 and 4 cases, respectively. For CD44, the model correctly classified 52 of 59 cases (88%), with overestimation and underestimation in 3 and 4 cases, respectively. The CD133 model correctly classified 53 of 59 cases (91%), with stage overestimation and underestimation in 1 and 4 cases, respectively. Note that each marker was not assessed in an identical number of cases as insufficient tissue was available for some patients.

Follow-up screening times

The predicted probability of a patient being at an advanced stage of disease (low- or high-grade dysplasia or carcinoma, stage ≥DS) based on Lgr5-positive fraction and age for both females and males is shown in Fig. 2, and for CD44 and CD133 in Supplementary Figs. 4 and 5. Based on patient-specific averages, the increase in Lgr5 positivity was linear in males and quadratic in females (Fig. 3). For all combinations of age, sex, and initial marker positivity, iterative increases in the continuous variables were simulated based on these increase rates. The statistical model was used to predict the likelihood of the patient having reached a stage ≥DS. At forecasted likelihoods greater than 80%, a follow-up screen may be suggested. A map of suggested follow-up screening times for all potential combinations of patient input parameters is shown in Fig. 3. Comparable analyses for markers CD44 and CD133 are included in Supplementary Figs. 6 and 7. From these suggestions, patients can be classified into high (predicted time to progression <100 days), intermediate (predicted time to progression between 100 days and 1 year), or low (predicted time to progression >1 year) risk categories. Figure 4 demonstrates that for 8 of the 17 patients from our initial cohort diagnosed with intestinal metaplasia, all three models independently apply the same classification. For a further 8 patients, classifications are in successive categories, in which case the earlier of the two would be used for follow-up recommendation. In only 1 of 17 cases are both low- and high-risk classifications made for the same patient depending on the marker used for evaluation.

Discussion

Although the high mortality in gastric cancer patients is primarily attributable to late diagnosis, no candidate biomarkers for disease progression are utilized clinically to guide screening for early detection. Several markers have been identified, including Lgr5, CD44, and CD133, that are known to not only be upregulated in gastric cancer but also to increase in a stepwise manner throughout the carcinogenic pathway. This sequential increase introduces the potential for predicting progression through a continuous pathway as opposed to only a binary outcome such as the onset of neoplasia. However, further effort is needed to find means of utilizing these markers in the clinical setting. The tools of mathematics and statistics have the power to analyze and optimize the use of these markers as predictors of progression, if sufficient data become available with which to calibrate and validate quantitative models.

The statistical tool described here was calibrated with an initial cohort of 59 patients to generate—given a set of patient-specific characteristics—the likelihood the patient is at each respective stage of disease in the Correa pathway. For more than 88% of patient samples, all three of the predefined biomarkers of gastric carcinogenesis when combined with clinical information of patient age and gender were able to accurately predict disease stage of an initial cohort of patients. Only 4% of samples were underscored and 8% of samples were overscored. Without sufficient high-resolution longitudinal data, the growth of the marker-positive cell fraction was fitted to average expression levels across different patients and average disease progression times. As more data on intermediate stages in individual patients become available, more accurate, patient-specific predictive models of marker-positive cell population increase can be derived for higher accuracy predictions.

We demonstrated that increased marker-positive cell fraction during carcinogenesis may be used to identify whether a patient is at high risk of progression to an advanced stage of disease (low-grade dysplasia or later). These classifications can aid clinicians in determining a more cost-effective follow-up screening protocol on a personalized level. Based on these screening recommendations patients can return to the clinic for a follow-up at a time that minimizes the risk of undetected progression to low-grade dysplasia but also does not require excessive and costly overscreening. If low-grade dysplasia or a later stage in the pathway is evident from pathological analysis, the patient can be submitted for closer monitoring or surgical intervention where necessary. If late-stage disease is not yet evident but progression is suggested, either histologically or by a noticeable biomarker increase, a new follow-up screening time may be suggested according to the observed growth rate in marker positivity between the patients’ initial presentation and first follow-up. Alternatively, if the patient is experiencing minimal to no persistent inflammation and demonstrates no evidence of progression by increase in marker positivity (marker positivity growth rate approximately zero), it can be assumed that antibacterial triple therapy has been successful and the patient can be removed from the screening protocol at the discretion of the physician.

Although the proposed model provides only approximate screening interval recommendations, under the current paradigm these patients would not be required to undergo any follow-up screening despite the current understanding that eradicating H. pylori does not necessarily eradicate gastric cancer risk. The current work presents a proof of concept for an integrated framework to help bridge the gap between candidate biomarkers of disease progression and currently devastating clinical outcomes of late-stage disease. With almost 1 million new cases of gastric cancer each year, a paradigm shift toward early detection and intervention should be a high priority. Importantly, the current approach is not biomarker- or cancer specific; with thorough validation, such frameworks could be applied in the setting of other cancers for which precancerous histological lesions can be monitored and progression-associated biomarkers are established. This approach could have far-reaching implications for the early detection of cancers with typically late diagnoses resulting from currently absent or insufficient, cost-ineffective screening.

References

Layke JC, Lopez PP. Gastric cancer: diagnosis and treatment options. Am Fam Physician. 2004;69(5):1133–41.
PubMed Google Scholar
Wang T, et al. Sequential expression of putative stem cell markers in gastric carcinogenesis. Br J Cancer. 2011;105:658–65.
Article CAS PubMed PubMed Central Google Scholar
Nosrati A, Naghshvar F, Khanari S. Cancer Stem Cell Markers CD44, CD133 in primary gastric adenocarcinoma. Int J Mol Cell Med. 2014;3(4):279–86.
PubMed PubMed Central Google Scholar
Zheng ZX, et al. Intestinal stem cell marker LGR5 expression during gastric carcinogenesis. World J Gastroenterol. 2013;19(46):8714–21.
Article PubMed PubMed Central Google Scholar
Correa P, Haenszel W, Cuello C, Tannenbaum S, Archer M. A model for gastric cancer epidemiology. Lancet. 1975;2(7924):58–60.
Article CAS PubMed Google Scholar
Asaka M, Kato M, Graham DY. Prevention of gastric cancer by Helicobacter pylori eradication. Intern Med. 2010;49:633–6.
Article PubMed Google Scholar
Graham DY, Shiotani A. The time to eradicate gastric cancer is now. Gut. 2005;54:735–8.
Article CAS PubMed PubMed Central Google Scholar
Rugge M, Cassaro M, Leo G, Farinati F, Graham DY. Helicobacter pylori and gastric cancer: both primary and secondary preventive measures are required. Arch Intern Med. 1999;159:2483–4.
Article CAS PubMed Google Scholar
Yiming L, et al. CD133 overexpression correlates with clinicopathological features of gastric cancer patients and its impact on survival: a systematic review and meta-analysis. Oncotarget. 2015;6(39):42019.
Article PubMed PubMed Central Google Scholar
Cho SH, et al. CD44 enhances the epithelial-mesenchymal transition in association with colon cancer invasion. Int J Oncol. 2012;41:211–8.
CAS PubMed Google Scholar
Xi HQ, et al. Leucine-rich repeat-containing G protein-coupled receptor 5 is associated with invasion, metastasis, and could be a potential therapeutic target in human gastric cancer. Br J Cancer. 2014;110:2011–20.
Article CAS PubMed PubMed Central Google Scholar
Bravo LE, Cortes A, Carrascal E, Jaramillo R, Garcia LS, Bravo PE, Badel A, Bravo PA. Helicobacter pylori: paralogia y prevalencia en biopsias gastricas en Colombia. Colombia Med. 2003;34:124–31.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Integrated Mathematical Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, 33612, USA
Rachel Walker & Heiko Enderling
Nalecz Institute of Biocybernetics and Biomedical Engineering, Polish Academy of Sciences, Warsaw, Poland
Jan Poleszczuk
Instituto de Patología Mejía Jiménez, Cali, Colombia
Jaime Mejia
Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, 33612, USA
Jae K. Lee
Department of Gastro Intestinal Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, 33612, USA
Jose M. Pimiento & Mokenge Malafa
Department of Anatomic Pathology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, 33612, USA
Domenico Coppola
Department of Tumor Biology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, 33612, USA
Domenico Coppola
Department of Cancer Epidemiology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, 33612, USA
Anna R. Giuliano
Department of Radiation Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, 33612, USA
Heiko Enderling

Authors

Rachel Walker
View author publications
You can also search for this author in PubMed Google Scholar
Jan Poleszczuk
View author publications
You can also search for this author in PubMed Google Scholar
Jaime Mejia
View author publications
You can also search for this author in PubMed Google Scholar
Jae K. Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jose M. Pimiento
View author publications
You can also search for this author in PubMed Google Scholar
Mokenge Malafa
View author publications
You can also search for this author in PubMed Google Scholar
Anna R. Giuliano
View author publications
You can also search for this author in PubMed Google Scholar
Heiko Enderling
View author publications
You can also search for this author in PubMed Google Scholar
Domenico Coppola
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Domenico Coppola.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Human participants

All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1964 and later versions. Informed consent or substitute for it was obtained from all patients for being included in the study. All information linked to the patients is protected and de-identified, and all tissue samples are collected from retrospective cases, thus posing no risks to the patients. Identification was removed from all specimens by the IRB at the Instituto de Patología Mejía Jiménez in Cali, Columbia, both for the initial selection of the samples and for the subsequent data collection.

Electronic supplementary material

Below is the link to the electronic supplementary material.

10120_2017_748_MOESM1_ESM.tif

Supplementary Figure 1. Correlation between sex and biomarker. CD133: pval = 0.09; CD44: pval = 0.02; Lgr5: pval = 0.02 (TIFF 2260 kb)

Supplementary Figure 2. Comparison of age distribution for males and females. pval = 0.18 (TIFF 986 kb)

10120_2017_748_MOESM3_ESM.tif

Supplementary Figure 3. Correlation between age and biomarker. CD133: R ² = 0.012, pval = 0.41; CD44: R ² = 0.013, pval = 0.40; Lgr5: R ² = 0.002, pval=0.723 (TIFF 2324 kb)

10120_2017_748_MOESM4_ESM.tif

Supplementary Figure 4. Model-predicted probability of patient being in advanced stage (Stage ≥ DS) depending on CD44-positive fraction and age for males and females. Green circles represent patients at advanced stage (DS or GC); Blue circles represent patients at early stage (NM or IM) (TIFF 1796 kb)

10120_2017_748_MOESM5_ESM.tif

Supplementary Figure 5. Model-predicted probability of patient being in advanced stage (Stage ≥ DS) depending on CD133-positive fraction and age for males and females. Green circles represent patients at advanced stage (DS or GC); blue circles represent patients at early stage (NM or IM). Note that the significant difference in gradient of the surface plots shown in Fig. 2 is attributable to the significant difference in the clinically observed range of marker-positivity values; for CD133 a marker-positivity fraction above 40% is rarely observed, whereas for Lgr5 the distribution of marker-positivity fractions is more evenly distributed, from 0% to more than 80% (TIFF 1878 kb)

10120_2017_748_MOESM6_ESM.tif

Supplementary Figure 6a, b. Equations approximately governing the increase in positivity of CD44 in males (a) and females (b), respectively, based on patient-specific averages (indicated by blue diamonds) obtained in preliminary data (Fig. 1). Figure 3(c,d) model-suggested screening times for males (c) and females (d), respectively, based on the combined statistical regression tool (Fig. 2) and approximate growth models for the CD44-positive cell population (a, b) (TIFF 1333 kb)

10120_2017_748_MOESM7_ESM.tif

Supplementary Figure 7a, b. Equations approximately governing the increase in positivity of CD133 in males (a) and females (b), respectively, based on patient-specific averages (indicated by blue diamonds) obtained in preliminary data (Fig. 1). Figure 3(c,d) model-suggested screening times for males (c) and females (d), respectively, based on the combined statistical regression tool (Fig. 2) and approximate growth models for the CD133-positive cell population (a, b) (TIFF 1563 kb)

10120_2017_748_MOESM8_ESM.tif

Supplementary Figure 8. Upper panels of a–f demonstrate the probability of a patient being at stage >DS based on marker positivity of each of our respective markers (CD44, CD133, LGR5). For clarity, these probabilities are shown only at the midpoint of our age range (52.5 years). Each line (n = 23 for males; n = 32 for females) represents model-calculated probabilities based on regression modeling with 1 respective patient omitted. Patients for whom omission led to significant variance from the initial regression coefficient (“outliers”) have correspondingly different probability profiles. Lower panels of a–f demonstrate the probabilities generated from an all-inclusive model (black line) versus with all “outliers” excluded (red line). This exclusion leads to a significant increase in correlation of marker positivity with disease stage (Supplementary Table 1) (TIFF 9818 kb)

10120_2017_748_MOESM9_ESM.pdf

Supplementary Table 1. Change in model coefficients upon removal of all patients for whom omission significantly (>10%) alters probability distribution (PDF 29 kb)

10120_2017_748_MOESM10_ESM.tif

Supplementary Table 2. Strong correlations are observed between CD133 and CD44 (NM: p = 0.049; IM: p = 1.43E-05; DS: p = 5.59E-05; GC: p = 2.65E-05), and between CD133 and LGR5 (NM: p = 0.032; IM: p = 0.0018; DS: p = 0.00044; GC: p = 1.40E-06) (A). Note that although the pattern of behavior is qualitatively similar for CD44 and LGR5, the correlations are not statistically significant because of the large standard deviation in both these respective subgroups (B). CD133 has significantly lower standard deviation than CD44 and LGR5, making observed correlations more likely to meet quantitative requirements of significance (TIFF 129 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Walker, R., Poleszczuk, J., Mejia, J. et al. Toward early detection of Helicobacter pylori-associated gastric cancer. Gastric Cancer 21, 196–203 (2018). https://doi.org/10.1007/s10120-017-0748-z

Download citation

Received: 19 March 2017
Accepted: 08 July 2017
Published: 19 July 2017
Issue Date: March 2018
DOI: https://doi.org/10.1007/s10120-017-0748-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Toward early detection of Helicobacter pylori-associated gastric cancer

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Introduction

Materials and methods

Patients and sample

Immunohistochemistry

Regression modeling

Classification performance

Follow-up screening times

Results

Clinical data

Regression modeling

Classification performance

Follow-up screening times

Discussion

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Human participants

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation