ABSTRACT
Pulmonary neuroendocrine cells (PNECs) are sensory epithelial cells that transmit airway status to the brain via sensory neurons and locally via CGRP and GABA. Several other neuropeptides and neurotransmitters have been detected in various species, but the number, targets, functions, and conservation of PNEC signals are largely unknown. We used scRNAseq to profile hundreds of the rare mouse and human PNECs. This revealed >40 PNEC neuropeptide and peptide hormone genes, most cells expressing unique combinations of 5-18 genes. Peptides are packaged in separate vesicles, release presumably regulated by the distinct, multimodal combinations of sensors expressed by each PNEC. Expression of cognate receptors predicts an array of local targets, and we show the new PNEC signal angiotensin directly activates innervating sensory neurons. Many signals lack lung targets so may have endocrine activity like those of PNEC- derived carcinoid tumors. PNECs are an extraordinarily rich and diverse signaling hub rivaling the enteroendocrine system.
INTRODUCTION
Pulmonary neuroendocrine cells (PNECs) are neuroepithelial cells scattered throughout the epithelium lining the airways, many as solitary cells and others in clusters of ∼30 cells (called neuroepithelial bodies or NEBs) at airway branchpoints (Scheuermann, 1997). PNECs are thought to monitor airway oxygen and respiratory status, and rapidly signal this internal sensory information via afferent sensory neurons to the brain to regulate breathing and related functions (Adriaensen et al., 2009; Cutz et al., 2013; Nonomura et al., 2017; Xu et al., 2020). Such signaling may be mediated by the classical neurotransmitter serotonin (Lauweryns et al., 1973), but PNECs also produce GABA (Schnorbusch et al., 2013) and several neuropeptides including CGRP, which can signal locally to lung goblet (GABA) and immune cells (CGRP) and may contribute to asthma (Barrios et al., 2019; Sui et al., 2018). PNECs also function as reserve stem cells that repair the surrounding epithelium after injury (Ouadah et al., 2019; Song et al., 2012; Stevens et al., 1997), and they can be transformed by loss of tumor suppressor genes into the deadliest human lung cancer, small cell lung cancer (SCLC) (Ouadah et al., 2019; Park et al., 2011; Song et al., 2012; Sutherland et al., 2011), and likely other neuroendocrine tumors such as lung carcinoids accompanied by systemic symptoms from signals secreted by the tumor (Davila et al., 1993).
Despite the physiological and clinical significance of PNECs, molecular interrogation and understanding of PNEC function and diversity (Mou et al., 2021) has lagged because the cells are so rare, comprising just ∼0.01% of human lung cells (Boers et al., 1996; Travaglini et al., 2020). Here we describe the isolation, expression profiling by single cell RNA sequencing (scRNA-seq), and analysis of hundreds of mouse and human PNECs. The results reveal an extraordinary molecular diversity of these cells, including the expressed sensors along with dozens of neuropeptides and peptide hormones whose predicted targets indicate they transmit airway sensory information throughout the lung, to the brain, and potentially to the rest of the body.
RESULTS
Enrichment and single cell RNA sequencing of mouse pulmonary neuroendocrine cells
Because PNECs are among the rarest of lung cell types, they are not found, or only poorly represented, in lung scRNA-seq studies (Han et al., 2018; Tabula Muris Consortium, 2018; Travaglini et al., 2020), www.lungmap.net). We therefore genetically labeled PNECs using a Cre recombinase-dependent fluorescent reporter (ZsGreen) by tamoxifen induction of Ascl1CreER/+; Rosa26ZsGreen/+ mice, and then depleted other, abundant lung cell populations and enriched for the labeled PNECs prior to scRNA-seq (Figs. 1A, S1A and B). Poly-adenylated mRNA from each sorted ZsGreen+ or control cell was reverse transcribed and PCR-amplified using Smart-seq2 protocol (Picelli et al., 2014), and the obtained cDNA was used to generate libraries sequenced to a depth of 105 - 106 reads/cell and quantified to determine expression levels of each gene in each cell. Cells with similar expression profiles were computationally clustered (Butler et al., 2018), and the cell type identity of each cluster was assigned based on expression of canonical lung cell type markers including Ascl1, Calca, and Chga for PNECs (Fig. S1C). After filtering low quality cells and cell doublets, we obtained high quality transcriptomes (2025 +450 genes (mean +SD) detected per cell) of 176 PNECs. We also obtained 358 other pulmonary cells (Fig. S1D) including Ascl1 lineage-labeled glial cells.
We identified dozens of mouse PNEC markers (Fig. 1B, C, Table S1) beyond the canonical four (Calca (encoding calcitonin gene-related peptide CGRP), Ascl1, Syp (synaptophysin), and Chga (chromogranin A)) (Table S2) using Wilcoxon rank sum (Seurat v2.3.4) to find differentially-expressed genes selectively expressed by PNECs, then prioritizing those with high sensitivity (expressed in >85% of PNECs), specificity (<5% of the other lung cell types), and fold-difference in mean expression over other lung cell types. Many of the identified markers were expressed at higher levels (Resp18, Pcsk1, Scg5) and were more sensitive (Resp18, Pcsk1) and specific (Chgb, Sez6l2) than the canonical PNEC markers (Fig. 1C). Among the top reported tracheal neuroendocrine cell (TNEC) markers, just over half (54% (19/35) (Montoro et al., 2018); 63% (31/49) (Plasschaert et al., 2018) were also top PNEC markers, defining a core set of airway neuroendocrine markers (Table S1). But there were also notable differences between TNECs and PNECs (e.g. TNEC-selective Cib3, Cxcl13, Scna, Mthfd2, Slc38a5, Slc26a9, St8sai, Kcnj1, and Sox1), consistent with their different development, distribution, innervation, and functions (Kuo and Krasnow, 2015; Montoro et al., 2018; Mou et al., 2021; Ouadah et al., 2019; Song et al., 2012). Selective expression of the most robust PNEC markers was validated by single molecule in situ hybridization (smFISH) (Resp18, Scg5, Fig. 1D) and immunohistochemistry (Pcsk1, see below).
PNECs express dozens of neuropeptide and peptide hormone genes
PNECS were discovered by their hallmark secretory vesicles (Scheuermann, 1997) and later shown to secrete serotonin (Lauweryns et al., 1972), GABA (Barrios et al., 2017), and several neuropeptides including CGRP (Johnson et al., 1988; Scheuermann et al., 1987) and chromogranin A (Lauweryns et al., 1987) (Table S2). To determine the full set of PNEC neurotransmitters and neuropeptides, we searched our scRNA-seq dataset for neurotransmitter biogenesis genes (Table S3) and neuropeptide and peptide hormone genes (“peptidergic” genes, Table S4) expressed in PNECs. This confirmed that PNECs are GABAergic because many (28%) expressed biosynthetic gene Gad1. Although none expressed serotonin biosynthetic genes Tph1 or Tph2, many (24%) expressed the reuptake transporter Slc6a4, suggesting they can signal through serotonin imported from other sources. Some PNECs may also use other neurotransmitters because rare cells expressed key dopaminergic (Th, Ddc), glutamatergic (Slc17a6), cholinergic (Slc18a3/VAChT), or histaminergic (Hdc) genes (Table S3).
PNECs expressed dozens of genes that encode neuropeptides and/or peptide hormones.
Thirty-one peptidergic genes were expressed in our dataset, and 30 of those plus an additional 12 were found in our second, independent dataset (see below), totaling 43 (47%) of the 91 peptidergic genes in mice (Fig. 1E, Table S4). The full set includes previously known Calca (encodes CGRP) and Chga but also some of the most biologically and medically significant neuropeptide and peptide hormone genes (e.g., Pomc (pro-opiomelanocortin), Hcrt (hypocretin/orexin), Agt (angiotensinogen), Gnrh1 (gonadotropin releasing hormone), Oxt (oxytocin), Lhb (luteinizing hormone), Crh (corticotropin-releasing hormone), Adm (adrenomedullin), Ghrl (ghrelin), Igf2 (insulin-like growth factor), Inha, Inhba and Inhbb (inhibins), Edn1 and Edn3 (endothelins), Nppa (atrial natriuretic peptide)). The seven granin genes (Chga, Chgb, Scg2, Scg3, Scg5, Pcsk1n, Vgf) and 16 others (Calca, Calcb, Cartpt, Agt, Iapp, Adcyap1, Igf1, Igf2, Lhb, Inha, Inhba, Inhbb, Adm, Oxt, Pomc, Crh) have also been detected in neuroendocrine (NE) cells outside the lung, but this is the first description of NE cell expression for 17 PNEC peptidergic genes (Dbi, Nppa, Nppb, Nppc, Nmb, Npb, Npff, Npw, Pnoc, Prok2, Gal, Gnrh1, Hcrt, Ucn2, Agrp, Thpo, Uts2).
Some of the peptidergic genes including Calca, Igf2, and granins (Chga, Chgb, Scg2, Scg3, Scg5, Pcsk1n) were expressed in most PNECs analyzed, but others were detected in minor subpopulations (Fig. 1E, Table S4). We analyzed peptidergic gene expression in our second scRNA-seq dataset of 92 PNECs that used a different strategy for PNEC labeling and capture (Ouadah et al., 2019) and obtained a similar distribution of peptidergic gene expression, with the exception of Pcsk1n (expressed in 15% of PNECs vs. 60% in original dataset) and Ucn2 (35% vs 0%) (Fig. 1E, Table S4). We also validated expression of eight peptidergic genes and determined the distribution and abundance of the expressing cells in vivo by immunostaining and multiplex smFISH. The canonical mouse PNEC marker and neuropeptide CGRP (Calca) was detected in 95% of PNECs by scRNA-seq, 95% of PNECs by immunostaining (n=237 scored cells in 3 mice) (Fig. 1F), and 94% by smFISH (n=100 scored cells in 2 mice) (Fig. 1D). Cartpt was detected in 34% of PNECs by scRNA-seq and twice that by immunostaining (68%) (Fig. 1F) and smFISH (67%) (Fig. 1G). smFISH also confirmed PNEC expression of the six other peptidergic genes examined (Scg5, Chga, Agt, Pomc, Nmb, Adcyap1) but generally gave higher percentages of expressed PNECs especially for the less abundantly expressed genes, implying smFISH is more sensitive than scRNA-seq in detecting gene expression (Figs. 1G, 2B). Indeed, when PNECs with very low expression (1-4 mRNA puncta detected per cell) were not included in the smFISH quantification, there was excellent agreement between smFISH and scRNA-seq values (Fig. 1G). The broadly-expressed peptidergic genes Calca, Scg5, Chga, and Cartpt were detected in both solitary and clustered PNECs, and the four genes detected in smaller subpopulations (Agt, Pomc, Nmb, Adcyap1) were most commonly detected in NEBs at bronchial branchpoints, with local clustering observed within a NEB of the Pomc-expressing cells (Fig. S2A).
We conclude that PNECs express dozens of peptidergic genes, nearly half (47%) of all annotated peptidergic genes and over an order of magnitude more than previously known, with most expressed in rare PNEC subpopulations. Statistical modeling of our PNEC sampling in scRNA-seq indicated we likely achieved saturation of PNEC peptidergic genes (Fig. S2B), however the value of 43 expressed genes is a lower limit because even more sensitive methods of detecting gene expression in individual cells such as smFISH could identify additional PNEC peptidergic genes.
PNECs express myriad combinations of peptidergic genes
The scRNA-seq analysis showed that individual mouse PNECs expressed 7.2 ±1.9 (mean ±S.D.) peptidergic genes, with some cells expressing up to 13 (range 2-13). Remarkably, almost every PNEC expressed a different combination of peptidergic genes: 154 peptidergic patterns were identified among the 176 PNECs analyzed (Fig. 2A). Because the number of combinations detected by scRNA-seq could be artificially inflated by “technical dropout” (failure to detect an expressed gene), we also scored peptidergic combinations by the more sensitive technique of multiplex smFISH (Fig. 2B). For the eight peptidergic genes probed in 100 PNECs, we identified 30 different cellular patterns of expression, similar to the 24 patterns detected by scRNA-seq for the same eight genes (Fig. 2C, D). Thus, each PNEC expresses multiple peptidergic genes and in an extraordinary number of combinations.
The number and diversity of neuropeptides and peptide hormones expressed by PNECs is further expanded by post-transcriptional processing. Some of the expressed genes are alternatively spliced to produce transcripts encoding different neuropeptides or hormones with distinct expression patterns and physiological functions. For example, Calca, which encodes the classical PNEC neuropeptide CGRP, can be alternatively spliced to generate transcripts encoding the thyroid hormone calcitonin that regulates calcium homeostasis (Amara et al., 1980, 1982).
Mapping of PNEC scRNA-seq reads at the Calca genomic locus revealed that Calca transcripts are alternatively spliced in PNECs (Fig. 3A); most cells expressed both calcitonin and CGRP transcripts, although the ratio varied greatly across individual PNECs and some cells exclusively expressed CGRP (20% of cells) or calcitonin (10%) (Fig. 3B). Similar results obtained by co- staining for CGRP and calcitonin proteins (Fig. 3C).
Many neuropeptide and peptide hormone mRNAs are translated as larger pre-pro- peptides that are proteolytically processed and modified to generate up to eight different neuropeptides and/or peptide hormones, typically with different expression patterns and functions (Table S5), which would further increase the PNEC signaling repertoire. A classic example is POMC, cleaved by proproprotein convertases PCSK1 and PCSK2 to generate ACTH (adrenocorticotrophic hormone), MSH (melanocyte stimulating hormone), β-endorphin as well as others by additional processing events (Fig. 3D). We surveyed expression of the processing enzyme genes in our scRNA-seq dataset (Fig. 3E) and found that nearly all PNECs expressed Pcsk1 (Figs. 1C, 3F) and a subset of those expressed Pcsk2, predicting that POMC is processed in PNECs into multiple neuropeptides and peptide hormones including ACTH, α3-MSH, and β- endorphin. Confocal imaging of PNEC immunostains for POMC, CGRP, and calcitonin showed that each co-expressed neuropeptide or peptide hormone localized largely to its own secretory vesicles (Fig. 3G), even ones expressed from the same gene (CGRP and calcitonin, Fig. 3C), implying distinct vesicular targeting and packaging pathways. Thus, post-transcriptional and post-translational processing expands the number and diversity of peptidergic signals expressed by each PNEC, and their separate vesicular packaging raises the possibility of differentially- regulated release and impact on targets.
Diverse targets of PNEC signals
To predict the direct targets of the PNEC signals, we searched our molecular cell atlas of the lung, comprising the full expression profiles of nearly all lung cell types (Travaglini et al., 2020) and pulmonary sensory neurons (Liu et al., 2021), for cells that express the cognate receptors (Table S5). The only previously defined targets of PNEC signals are a subpopulation of immune cells (IL5 lineage-positive) proposed to be attracted to PNECs by secreted CGRP in a mouse model of asthma (Sui et al., 2018), and goblet cells, which are increased in macaque and mouse models of inflammation through neurotransmitter GABA from PNECs (Barrios et al., 2019; Sui et al., 2018). Recently, CGRP from tracheal TNECs has been found to support tracheal epithelial cells following hypoxic injury (Shivaraju et al., 2021).
Receptors for serotonin and GABA, the two major PNEC neurotransmitters, are expressed by the two pulmonary sensory neuron (PSN) subtypes that innervate NEBs (PSN4 (Olfr78+) and PSN7 (Calb1+)) (Fig. 4A), identifying the first signals from PNECs to these afferent fibers that communicate pulmonary sensory information to the brain. GABA receptor genes were also expressed in goblet cells, supporting the conclusion from macaque and mouse models and identifying the specific GABA receptor subunit as GABRP, as well as in club cells, epithelial cells that neighbor PNECs (Fig. S4). Receptors for the neurotransmitters glutamate, dopamine and histamine predicted by our transcriptomic data to be produced by rare PNECs (see above) were also expressed in innervating PSNs (Fig. 4A) as well as in plasmacytoid dendritic cells (glutamate receptor Grm8) and basophils (histamine receptor Hrh4) (Fig. S4B).
Of the more than 90 neuropeptides and peptide hormones encoded by the 43 PNEC peptidergic genes, 36 have known receptors (Table S5). The lung expression patterns of these receptors are shown in Fig. 4B, identifying dozens of lung cell types that can directly receive PNEC peptidergic signals. Indeed, every lung cell type expressed a receptor for at least one PNEC peptidergic signal, and most expressed receptors for multiple signals. This suggests that PNECs can function as a signaling hub broadcasting airway sensory information to cells throughout the lung. The richest targets by far were the innervating pulmonary sensory neurons PSN4 and PSN7, which expressed receptors for 17 and 19 PNEC peptidergic signals, respectively. Other rich targets included goblet cells (4 signals) and club cells (3 signals). PNECs themselves expressed IGF receptors, implying autocrine signaling. The results also suggest a broad neuroendocrine-immune signaling axis, with CGRP potentially targeting at least five different types of immune cells (monocytes, dendritic cells, T cells, and alveolar and interstitial macrophages) plus airway smooth muscle and goblet cells (Fig. 4B), and other signals targeting NK cells, B cells, T cells, and dendritic cells (PCSK1N), basophils and neutrophils (IGFs), and dendritic cells (adrenomedullin).
We experimentally validated one of the inferred signaling interactions -- the predicted angiotensin signal from PNECs to pulmonary sensory neurons. Angiotensin is among the most medically important hormones because of its key role in vasoconstriction and blood pressure regulation (Fig. S5), and because one of its processing enzymes (angiotensin converting enzyme 2, ACE2) also serves as the entry receptor for SARS and Covid-19 coronaviruses. Indeed, the lung plays an essential role in this hormone pathway by providing angiotensin converting enzyme (ACE), the target of a ubiquitous class of anti-hypertensive drugs (ACE inhibitors), that proteolytically processes circulating angiotensin I peptide (Agt I) into the potent vasoconstrictor Agt II. Our discovery that PNECs express angiotensinogen (Agt), the preprohormone for Agt II, reveals a pulmonary source of the hormone, and our molecular cell atlas points to three potential lung targets: pericytes, and the PSN4 and PSN7 pulmonary sensory neurons that innervate NEBs, each of which selectively expressed Agt II receptor gene Agtr1a (Fig. 4B).
We confirmed Agt expression in PNECs by smFISH, which showed expressing cells localized within NEBs (Fig. 4C). We also confirmed expression of its receptor Agtr1a in NEB- innervating sensory neurons by injecting Agtr1a-2A-Cre mice (Leib et al., 2017) with a Cre- responsive reporter virus (AAV-flex-tdTomato) in the vagal nodose ganglion where cell bodies of pulmonary sensory neurons reside, and found their tdTomato-labeled termini ramifying on NEBs at airway branchpoints (Fig. 4D). To determine if angiotensin can activate pulmonary sensory neurons, we labeled pulmonary sensory neurons by introducing a fluorescently-labeled wheat germ agglutinin (WGA-A647) into the lung, and allowing 4-5 days for label uptake at sensory neuron termini and retrograde transport to their cell bodies in the vagal ganglia (Fig. 4E). We then isolated and cultured vagal ganglion cells and visualized their neuronal activity by Fluo- 4 calcium imaging during perfusion of Agt II into the imaging chamber. Agt II activated 1-15% of cultured pulmonary sensory neurons, as well as rare sensory neurons from other organs (Fig. 4F). We identified responding pulmonary sensory neurons as the PSN4 subtype: they expressed Agtr1a-2A-Cre lineage-label and stained positive for PSN4-specific marker neurotrophic receptor tyrosine kinase 1 (TRKA). PSN7 (Calb1+) neurons, which are larger but express Agtr1a at lower levels and do not express TrkA (Liu et al., 2021), were not activated under these conditions. These results suggest that in addition to its classical role as a circulating vasopressor, Agt II can function as a local neuromodulator from PNECs directly to PSN4 sensory neurons, transmitting airway sensory information to the brain.
For one-third (36%) of the PNEC peptidergic signals (calcitonin/Calca, amylin/Iapp, ACTH/Pomc, neuropeptide W/Npw, galanin/Gal, urotensin 2/Uts2, gonadotropin-releasing hormone/Gnrh1, hypocretin/Hcrt, ghrelin/Grhl, oxytocin/Oxt, prokinectin 2/Prok2, leutinizing hormone, subunit B/Lhb,thrombopoietin/Thpo), we did not detect appreciable expression of their receptors in any lung cell type or PSN (Fig. 4B). All of these classically function as circulating hormones with targets throughout the body, such as calcitonin (bones, kidneys), ACTH (adrenal cortex, adipocytes), amylin (brain stem), and inhibin (pituitary), so PNECs too may secrete these hormones into circulation. However, we cannot exclude that some have local targets but their receptors were expressed below detection or in rare or fragile cells not captured in our lung cell atlas.
PNECs are diverse, multimodal sensors
Classical physiological studies of PNECs indicate that signal secretion is triggered by a variety of stimuli including hypoxia, hypercapnia, mechanical stimuli, and allergens (Lembrechts et al., 2012; Livermore et al., 2015; Sui et al., 2018; Youngson et al., 1993). However, the full diversity of PNEC sensory functions are unknown, and the molecules that mediate these functions have only recently begun to be identified (Nonomura et al., 2017). To more fully elucidate PNEC sensory functions and the molecules that mediate them, and to determine how sensors are paired with the myriad PNEC signals described above, we curated a list of over 1500 mouse genes encoding extant mammalian sensory receptors and their homologues (Table S6) including ones previously implicated in PNEC sensory function, then searched our scRNA-seq dataset for ones selectively expressed in PNECs.
The mechanically-activated channel PIEZO2 (Fam38b) gene was specifically and broadly (>90%) expressed by PNECs (Fig. 5A, Fig. S6A), as previously described (Nonomura et al., 2017). In contrast, PIEZO1 (Fam38a) was expressed across all major lung compartments but excluded from PNECs (Fig. S6A). Mechanically-activated two-pore potassium channel TREK-2 (Kcnk10) gene (Fig. 5A, S6B) (Bang et al., 2000) and family member Kcnk16 (Fig. S6B) were selectively expressed in PNECs, whereas TREK-1 (Kcnk2) was expressed in other airway epithelial cells and almost completely excluded from PNECs (Fig. S6B). Three TRP family cation channel genes also showed selective or enriched expression in PNECs: auditory hair cell stereocilium channel Trpml3 (Mcoln3), Trpc4, and Trpv2, a noxious heat sensor also implicated in mechano- and osmoregulation (Fig. 5A, Fig. S6A). Casr, a G-protein coupled receptor implicated in ciliary mechanosensing and previously proposed to integrate NEB signals (Lembrechts et al., 2013), was also selectively expressed in PNECs (Fig. 5A, Fig. S6A), as was Lhfp15, an integral membrane protein of the auditory hair cell mechanotransduction complex.
These results support the role of PNECs as specialized airway mechanosensors and suggest potentially new mechano- or thermosensory functions mediated by proteins encoded by Kcnk10, Kcnk16, Trpml3 (Mcoln3), Trpc4, Trpv2, and Lhfp15.
PNECs are proposed to function in CO2 sensing because they can be activated by hypercapnic challenge (Lauweryns et al., 1977), and by bicarbonate and acid in vitro (Ebina et al., 1997; Livermore et al., 2015), a response dependent on carbonic anhydrase, but the proteins that mediate this function are unknown. PNECs selectively expressed the classic acid-sensing potassium channel TREK (Kcnk3), and rare PNECs expressed acid-sensing sodium channels ASIC3 (Accn3) and ASIC4 (Accn4) (Fig. 5A, S6A). Expression of the widely-distributed cytoplasmic carbonic anhydrase Car2 gene was not detected in PNECs, but some expressed membrane-bound Car12 and the carbonic anhydrase-related gene Car11 (Fig. 5A, S6A).
One of the first and still the most prominent proposed function of PNECs is as airway oxygen sensors because they can be activated by hypoxic challenge in vivo (Lauweryns et al., 1978) and in cultured lung slices or as isolated PNECs (Youngson et al., 1993). The oxygen sensing mechanism is still uncertain but the dominant hypothesis proposes that low oxygen reduces H2O2 generation by a membrane-bound NADPH oxidase (heterodimer of gp91phox/Cybb and p22phox/Cyba, plus regulatory subunits p47phox/Ncf1, p67phox/Ncf2), which inhibits an oxygen-sensitive potassium channel (Kv3.3/Kcnc3 and Kv4.3/Kcnd3) that activates L-type voltage-gated calcium channels (Fu and Cutz, 2002), triggering neurosecretion that may act locally or be propagated to the brainstem breathing center. We did not detect PNEC expression of NADPH oxidase subunits p91phox/Cybb, p47-phox/Ncf1, or p67-phox/Ncf2, and p22phox/Cyba was broadly expressed in all cells (Fig. S6A). Kv3.3/Kcnc3 and Kv4.3/Kcnd3 were detected only at low levels or in rare cells, however we found robust and selective PNEC expression of many other potassium channel genes (Fig. S6B; e.g., voltage-gated: Kcnc1, Kcnc2, Kcnb1, Kcnv1, Kcnf1, Kcnq2, Kcnq5, Kcnh2, Kcnh6, Kcnh7, Kcnh8; cyclic-nucleotide gated Na/K channel: Hcn1, Hcn2, Hcn3, Hcn4; calcium-activated: Kcnn3; 2-pore: Kcnk1, Kcnk3) that could contribute to the hypoxia-sensitive potassium current required for PNEC secretion. We also did not detect selective expression in PNECs of any of the genes (mitochondrial respiratory complexes) implicated in the mitochondrial hypothesis of oxygen sensing (Mulligan et al., 1981; Quintana et al., 2012; Stettner et al., 2011) (Fig. S6A). Thus, the identity of the acute oxygen sensor in PNECs remains uncertain, although several of the newly identified PNEC potassium channels are appealing candidates. Chronic hypoxia also influences PNECs, and hypoxia inducible factor Hif1a is expressed though not selectively in PNECs, whereas Hif3a is a PNEC- selective family member (Fig. S6A).
Primate PNECs have been proposed as volatile chemical sensors based on expression of olfactory receptors OR2W1 and OR2F1 in some solitary human PNECs, and the response of PNECs in tracheobronchial cultures to nonanal and other chemicals (Gu et al., 2014). We identified 19 olfactory and two pheromone receptor superfamily genes expressed in rare PNECs (Fig. 5A, S6A). We curiously also detected expression of photoreceptor opsin Opn1sw and non-visual opsin Opn3 (Fig. S6C) as well as two TNF receptor family genes (Tnfrsf12a, Tnfrsf21; Fig. S7A), suggesting possible light and immune sensing functions for PNECs.
Individual PNECs express multiple sensors and are predicted to sense multiple modalities. For example, one PNEC (combination 55) expressed mechanoreceptor/transducer genes Piezo2, Casr, and Lhfpl5, acid-sensitive channel Kcnk3, and chemoreceptors Olfr90, Olfr92, and Vmn2r29 (Fig. 5A,B). Individual PNECs expressed different combinations of sensory genes, indicating diversity in their sensory roles (Fig. 5A,B). Comparison of the sensors and signals expressed in each PNEC did not identify any strong correlations, suggesting that specific sensory inputs are coupled to different output signals in different PNECs.
Human PNECs also show diverse sensory, signaling and target profiles
To explore the generality and biomedical significance of the properties of mouse PNECs uncovered by scRNA-seq, we performed a similar analysis of human PNECs. Although human PNECs are also extremely rare, in our scRNA-seq study of ∼75,000 human lung cells (Travaglini et al., 2020) we obtained expression profiles of 55 PNECs. We analyzed these PNEC profiles as we did above for mouse PNECs and found that, even with this more limited sample, all the features uncovered for mouse PNECs are also apparent for human PNECs, although in more extreme form for some features and with species-specific specializations.
Human PNEC markers are largely conserved with mouse and include 26 that are more sensitive (e.g., GRP, SCGN, SCG5, BEX1), specific (SLC35D3, CPLX2), and/or highly expressed (SCG2) than the four common clinical markers (CHGA, SYP, INSM1, ASCL1) (Fig. S8). Some of the best markers are species-specific (human-specific: GRP, SCGN; mouse- specific: Resp18), and transcript isoform mapping of the human ortholog (CALCA) of the classic mouse PNEC marker CGRP (Calca) revealed alternative splicing such that only 60% of human PNECs expressed CGRP isoforms whereas all expressed calcitonin isoforms (Fig. S9), explaining why calcitonin but not CGRP is a good human PNEC marker (Weichselbaum et al., 2005) whereas both are valuable for mouse (Figs. 1B, 3C, S3).
Like mouse, human PNECs have a large and diverse signaling output. Human PNECs express biosynthetic genes for neurotransmitters serotonin (TPH2) and GABA (GAD1) (Table S8), the major neurotransmitters of mouse PNECs. Some human PNECs are also likely glutamatergic because 14% expressed glutamate vesicular transporter SLC17A6, and some may be catecholaminergic or glycinergic because rare PNECs expressed key catecholamine synthetic enzymes (DBH, PNMT) or glycine re-uptake transporter SLC6A5 (Table S8). Expression of dopaminergic genes was detected in rare mouse PNECs but none of the analyzed human PNECs.
Human PNECs expressed 40 different peptidergic genes (Fig. 6A), 45% of the 93 annotated human peptidergic genes (Table S4), with individual PNECs expressing 12.2 ±2.7 (mean ±SD, range 6-18), twice as many as mouse PNECs (Fig. 6A). Like mouse, almost every PNEC (54 of 55 cells, 98%) expressed a different combination (Fig. 6B). Two-thirds (26/40, 65%) of the expressed peptidergic genes are also expressed in mouse PNECs, although CARTPT was expressed in many fewer human PNECs (4% vs 18% in mouse) and POMC in many more (57% vs 4% in mouse) (Fig. 6A, Table S4). Remarkably, 13 of the 14 human-specific peptidergic genes encode hormones (Fig. 6A, Table S4), including some of the most biomedically significant: erythropoietin/EPO, renin/REN, five hypthothalamic releasing/inhibitory hormones and pituitary regulators (thyrotropin-releasing hormone/TRH, prolactin-releasing hormone/PRLH, gonadotropin-releasing hormone 2/GNRH2, corticotropin releasing hormone urocortin/UCN and inhibitory hormone somatostatin/SST), the common subunit of multiple pituitary hormones (glycoprotein hormone subunit A/CGA), reproductive organ developmental regulator anti-mullerian hormone/AMH, digestive hormones gastrin- releasing peptide/GRP and cholecystokinin/CCK, and the potent vasoregulators urotensin 2B/UTS2B and kininogen/KNG1. Of the 17 mouse-specific PNEC peptidergic genes (Table S4), all except three (Igf2, 72% of PNECs; Ucn2, 13%; Iapp, 7%) were detected only in rare PNECs (1-3%) so could also be rare in human PNECs and found on further profiling.
The human lung cell expression patterns of receptors for the 32 PNEC peptidergic signals with known receptors are shown in Fig. S10A, identifying potential direct targets in lung. As for mouse, expression patterns were diverse and almost all lung cell types expressed receptors for one or more signals, implying human PNECs can also transmit pulmonary sensory information throughout the lung. The predicted targets of the conserved PNEC signals were also largely conserved, for example broad stromal and vascular targeting by inhibin, immune cell targeting by CGRP and VGF, and pericyte targeting by angiotensin. Autocrine signaling appears prominent, as PNECs express receptors for ghrelin and erythropoietin, one of the human-specific signals, and almost every PNEC neurotransmitter (GABA, glutamate, dopamine, and epinephrine/norepinephrine) (Fig. S10B). Curiously, the sole PNEC autocrine signal identified in mouse, IGF2, was not detected in any profiled human PNEC (Fig. 6A, Table S4). No receptor expression was detected in lung for nearly half (45%, 15 of 33) the human PNEC peptidergic signals, including 10 of the 14 human-specific signals. While some of these signals may target pulmonary sensory neurons or rare pulmonary cells not captured in our human lung atlas, others may enter circulation and target sites beyond the lung.
Expression of sensory genes (Fig. S11, S12) indicates that, as in mouse, human PNECs are multimodal sensors with almost all cells expressing different combinations of sensor genes for diverse stimuli. These include orthologues of mouse PNEC sensors such as mechanically- activated channels PIEZO2 and KCNK10, thermosensor TRPV1, carbonic anhydrase CA11, and acid-sensitive channel TASK-1/KCNK3, plus human-specific TASK-3/KCNK9 with a more acidic range (pK 6.0-6.7) (Duprat et al., 1997). Like mouse, human PNECs express 19 different olfactory receptor genes but in a greater proportion of cells (33% vs. 11% for mouse), with individual cells expressing up to five different OLFR genes. As in mouse, NADPH-oxidase complex genes proposed as PNEC oxygen sensors were not detected or not specifically expressed by PNECs (Fig. S12), but two human PNEC OLFR genes (OR51E1/Olfr558, the most widely-expressed, in 16% of PNECs, and OR51E2/Olfr78) are a close family member and orthologue of OLFR78, which is activated by lactate and implicated in acute hypoxia-sensing in the mouse carotid body (Chang et al., 2015). OR51E1 and OR51E2 were co-expressed along with acid-sensitive channel KCNK3 in a single human PNEC (Fig. S11, combination 25), so this cell may be specialized for hypoxia or chemosensing. As in mice, rare human PNECs also expressed a pheromone receptor (VN1R1) and opsins (OPN1SW, OPN3) (Figs. S11, S12), but human PNECs also expressed bitter (TAS2R10, TAS2R31, TAS2R5) and sour taste receptors (PKD2L1, PKD1L3) and the trace amino acid receptor TAAR1.
Thus, as in mouse, almost every human PNEC is equipped to perceive multiple diverse stimuli and expresses a large number of peptidergic signals and neurotransmitters that can be received by a variety of cells within the lung and potentially throughout the body, like the signals from PNEC-derived tumors described below.
scRNA-seq profile of a human lung carcinoid: amplification of a rare PNEC
PNECs are the presumed origin of a variety of human lung neuroendocrine tumors (Travis et al., 2015) that can cause diverse ectopic hormone syndromes such as classic carcinoid syndrome (wheezing, flushing, diarrhea, increased heart rate), Cushing’s syndrome (Arioglu et al., 1998), and acromegaly (Athanassiadi et al., 2004). To determine the full sensory and signaling potential of a PNEC tumor and their relationship to those of normal PNECs, we obtained scRNA-seq profiles of NE tumor cells of a lung carcinoid from a 51 year-old female patient with onset of idiopathic hypertension in the year preceding therapeutic lung resection.
The profiled tumor cells expressed most general PNEC markers (e.g., SCG5, CHGB, SCG2, PCSK1N, CHGA, SCG3) indicating retention of PNEC identity. However, they did not recapitulate the full spectrum of PNEC diversity in their sensory and signaling gene profiles.
Some peptidergic genes such as CALCA and GRP that are expressed in almost all normal PNECs were expressed in few if any tumor cells, whereas NPW, NMB, and CARTPT that are expressed in only rare PNECs were expressed in most (NPW) or many (NMB, CARTPT) of the tumor cells (Fig. 6D). Likewise, the tumor cells lacked expression of the common PNEC mechanosensor PIEZO2 but showed broad expression of the acid-sensitive channel KCNK3 and opsin OPN1SW, which is expressed only by rare normal PNECs (Fig. S11A,B). This suggests that proliferating tumor cells retain a “memory” (albeit imperfect) of the expression profile of the PNEC from which they originated. And, because this tumor’s unusual expression profile nearly matched that of a rare normal PNEC (#50, Fig. 6B,C), which like the tumor cells expressed NPW, NMB, CARTPT, KCNK3, and OPN1SW and lacked expression of common PNEC genes CALCA, GRP, and PIEZO2 (Fig. 6D, E, Fig. S11), we suggest that this carcinoid arose by transformation of a cell similar to PNEC #50. Tumor cell expression of NPW may have caused the patient’s hypertension because the active peptide (NPW-23) is a circulating hormone that regulates vascular tone and has been proposed to play a role in the pathophysiology of hypertension (Ji et al., 2015; Yu et al., 2007).
To determine the peptidergic signaling profiles of other lung carcinoids, we analyzed peptidergic gene expression in 111 human lung carcinoids profiled by bulk RNA-seq (Alcala et al., 2019). This revealed prominent expression of 70% (28 of 40) of the normal human PNEC peptidergic genes, including new genes we identified in rare subpopulations of normal PNECs, in at least some of the tumors (Fig. S13). Ten peptidergic genes expressed by lung carcinoids, including four previously detected in clinical samples (GHRH, PENK, TAC1, VIP), were not found in our normal PNECs; however, all of these were expressed in only rare carcinoids so may be expressed in rare PNECs and if so should be identified on further PNEC profiling.
Interestingly, some of the peptidergic genes expressed in only minor subpopulations of normal PNECs (e.g., NPW, NPPA, SST, CARTPT) were detected in many carcinoids, whereas the nearly ubiquitous PNEC peptidergic gene CALCA was absent from most carcinoids. This suggests that not all normal PNECs are equally susceptible to carcinoid transformation.
By single cell expression profiling of hundreds of the exceedingly rare PNECs in mouse and human, we discovered they express over 40 peptidergic genes, nearly half of all such genes including many classic hormones. Individual PNECs express up to 18 peptidergic genes, with almost every cell expressing a distinct combination. The diversity of expressed signals is further increased by alternative splicing, and by post-translational processing as inferred from expression of prohormone processing genes. These diverse signals can directly target a wide arra y of cell types in the lung, predicted by expression of the cognate receptors, including almost every cell type across all five tissue compartments: epithelial (including putative autocrine PNEC signals), endothelial, stromal, immune, and neural. The richest targets are pulmonary sensory neurons (PSNs) that innervate PNECs. We confirmed one predicted signal to PSNs, angiotensin (Agt II, mature product of Agt), can directly activate a recently identified PSN subtype (PSN4) that innervates PNECs, expresses its receptor (Agtr1a), and projects to the brainstem to regulate respiratory rate (Liu et al., 2021); Diaz de Arce, et al., unpublished data. Hence in addition to its classical role as a circulating vasopressor whose receptor is targeted by major anti-hypertensive drugs, Agt II may serve as a neuromodulator in the breathing circuit. Eighteen other PNEC signals are also classical hormones, but unlike Agt II their receptors are not expressed in any cell types in the lung cell atlases, suggesting that PNECs could contribute to the circulating pool of these hormones. PNECs are thus extraordinarily rich and diverse signaling hubs that produce scores of neuropeptides and peptide hormones that can signal directly to many cells in the lung, to the brain through pulmonary sensory neurons, and potentially to cells throughout the body through the circulation.
PNECs are scattered throughout the airway epithelium and form large clusters at bronchial branchpoints, so they are ideally positioned to serve as sentinels that monitor inhaled air and airway status. Our single cell data imply that each PNEC is in fact a multimodal sensor, expressing a distinct combination of mechanical, thermal, and acid sensors along with carbonic anhydrases important in CO2 sensing, diverse chemosensors including olfactory receptors, vomeronasal receptors and taste receptors, and even light-sensing opsins. Different combinations of sensors are co-expressed in individual PNECs along with different combinations of signals (Figs. 5B, 6C). Because the peptidergic signals are packaged in separate vesicles, secretion of each signal could be independently regulated, presumably in response to activation of a different sensor (or sensor combination). In this way PNECs can simultaneously monitor many aspects of airway status at many positions along the airway, and selectively transmit this information to target cells in the lung, the brain, and the rest of the body.
In many ways PNECs resemble enteroendocrine cells (EECs), the sentinels scattered along the gut epithelium to monitor nutrients, microbial products, and other luminal contents and signal that information locally in the gut to coordinate ingestion, absorption, metabolism, and disposal, and throughout the body and brain to regulate mood and appetite (Bai et al., 2019; Bellono et al., 2017; Kaelberer et al., 2018). The enteroendocrine system is commonly called the “gut-brain axis” and is considered the largest endocrine organ because of its many endocrine cells and signals. Our data suggest it may be rivaled or even surpassed by the pulmonary neuroendocrine system, which expresses more than double the ∼20 signals produced by EECs (Beumer et al., 2020). Each EEC apparently expresses only one or a few peptidergic signals and neurotransmitters, which define 12 classical EEC subtypes (Worthington et al., 2018), whereas individual PNECs express 5-10 times more and their expression patterns define at least an order of magnitude more molecular subtypes. While PNECs have long been speculated to serve as local signaling centers in the lung and fast conduits of sensory information to the brain through afferent sensory neurons, our data suggest that like EECs they also serve a more global endocrine function. This would explain why the basal surface of some PNECs, where secretory vesicles are densely packed, are apposed to fenestrated capillaries (Lauweryns et al., 1973). Although PNEC contribution to circulating hormone pools under normal physiological conditions is yet to be demonstrated, they contribute at least under pathological conditions. Indeed, we found that the extensive PNEC signaling repertoire described here including most of newly identified peptidergic genes, were collectively expressed in the 111 available cases of profiled human lung carcinoids. An individual human carcinoid, however, expresses a discrete set of signals resembling that of a normal PNEC, suggesting that each such tumor amplifies the set of signals expressed by the tumor-initiating PNEC, thereby explaining the diversity of carcinoid syndromes (Limper et al., 1992; Pernow and Waldenstrom, Jan, 1957; Shalet et al., 1979). PNECs may comprise a second global signaling axis we dub “the lung-brain axis”.
This new understanding of PNEC function and their extraordinary diversity, including many sensor and signaling genes detected in only a single profiled cell, required pre-enrichment (mouse) or massive profiling (human) to obtain just the first few hundred expression profiles of these exceedingly rare cells. The neuroendocrinology of the lung our study reveals has broad implications for medicine even beyond NE cell tumors (Rudin et al., 2019; Travis et al., 2015; Young et al., 2011), including the many other pulmonary diseases such as asthma (Sui et al., 2018), SIDS (Cutz et al., 2007; Mou et al., 2021), and bronchopulmonary dysplasia (Gillan and Cutz, 1993) that have been associated with PNEC abnormalities and perhaps now extending to diseases outside the lung. The results already have implications for the Covid-19 pandemic because SARS-CoV-2 virions use an angiotensin pathway regulator (ACE2) to enter and destroy lung cells and cripple gas exchange along with the ability of the patient to sense the deficit.
Although expression of angiotensinogen and many other PNEC sensory and signaling genes are conserved so their functions can be explored in mice, we also uncovered 13 human-specific PNEC signaling genes encoding classical hormones (e.g., ACTH (POMC), GRP, TRH, AMH, CCK). Hence the lung-brain axis may be especially prominent in humans.
AUTHOR CONTRIBUTIONS
C.S.K and M.A.K conceived the project. C.S.K and S.D. designed the mouse PNEC scRNA-seq isolation experiments. C.S.K processed tissue and C.S.K and S.D. sorted cells and prepared sequencing libraries. S.D. processed and aligned sequencing data, and performed quality control.
C.S.K. and M.A.K. designed bioinformatic analyses, implemented methods, and interpreted results. Y.L. performed scRNA-seq and analysis of pulmonary sensory neurons. A.D.d.A. and C.S.K. designed and performed pulmonary sensory neuron functional imaging experiments.
T.W. performed rarefaction analysis. C.S.K. and N.A. performed immunostaining and in situ hybridization. C.S.K. created clinical protocols for human lung tumor and normal tissue acquisition and assessed clinical data. C.S.K, S.R.Q., and M.A.K. provided resources and supervised the project. C.S.K, and M.A.K. wrote the manuscript, and all authors reviewed and edited the manuscript.
DECLARATION OF INTERESTS
The authors declare no competing interests.
RESOURCE AVAILABILITY
Data and code availability
Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead author Christin Kuo (ckuo{at}stanford.edu) or lead contact Mark Krasnow (krasnow{at}stanford.edu). All raw and processed data with accompanying metadata from scRNA-seq of mouse and human PNECs will be submitted to the Gene Expression Omnibus (GEO) database www.ncbi.nlm.nih.gov/geo. Raw data consists of fastq files corresponding to paired- end reads for each cell, and processed data is in the form of a raw gene counts matrix. All code used to generate data objects and plots is available at github: https://github.com/sdarmanis/Neuroendocrine_scRNA-seq
EXPERIMENTAL MODEL AND SUBJECT DETAILS
Animals
Mouse lines used were: wild-type mouse strains CD-1 and C57BL/6NJ, aged 2-4 months, tamoxifen-inducible, PNEC-specific knock-in Cre recombinase driver Ascl1CreERT2 (Kim et al., 2011a) and Cre-dependent fluorescent reporter Rosa26Zsgreen1 (Madisen et al., 2009), and knock- in Cre recombinase drivers Agtr1aCre (de Kloet et al., 2017), and Agtr1a-2A-Cre (Leib et al., 2017). Mice were maintained in 12h light/dark cycle with food and water provide ad libitum.
Genotyping was performed on tail clips with oligonucleotide primers as described (Kim et al., 2011a). All animal husbandry, maintenance, and experiments were performed in accordance with Stanford University’s IACUC-approved protocols (APLAC 9780, 32092).
Mouse PNEC labeling, enrichment, and isolation
Mouse PNECs were permanently labeled with ZsGreen by intaperitoneal injection of PN15 (n=4 mice, gender-balanced), PN85 (n=2), and PN115 (n=2) Ascl1CreERT2/+;Rosa26Zsgreen1 mice with 3 mg tamoxifen (MilliporeSigma T5648; prepared at 20 mg/ml in corn oil, stored at -20°C) to induce Cre-ERT2, and 3-7 days later (PN21, PN90, and PN120) euthanized by CO2 inhalation and cervical dislocation. For the PN21 time point, a second dose of tamoxifen (3 mg) was provided to the dam at E13.5 when PNEC progenitors robustly express Ascl1 (Kuo and Krasnow, 2015). Immediately after euthanasia, lungs (unperfused) were micro-dissected en bloc at room temperature, and the trachea and peripheral regions of each lobe removed with a razor blade. The remaining bronchiolar regions (where PNECs reside) of individual lobes were minced with a razor blade and then digested in DMEM/F12 media containing dispase at 1 U/ml (StemCell Technologies, 07923), type 4 collagenase at 10 U/ml (Worthington Biochemical, CLS-4), and elastase at 3 U/ml (Worthington Biochemical) at 37°C for 30-60 minutes. Samples were manually triturated with a 1000 ul micropipet tip every 10 minutes during the incubation to generate a uniform cell suspension. An equal volume of cold (4°C) FACS buffer [phosphate buffered saline (PBS, 137 mM NaCl, 2.7 mM KCl, 2 mM ethylenediaminetetraacetic acid (EDTA), 8 mM Na2HPO4, 2 mM KH2PO4, pH 7.4) with 10% fetal bovine serum (Thermo Fisher 10082147)] was added to quench the enzymatic reactions. All subsequent steps were carried out at 4°C. DnaseI (Stem Cell Technologies, 07900) was added to a final concentration of 5 ug/ml with mixing by tube inversion ten times during the 5 minute incubation. Cell suspensions were then filtered sequentially through 100 um (Corning 431752) and 40 um mesh filters (Corning 352340) to remove cell aggregates, then centrifuged at 400xg for 5 minutes. Cell pellets were resuspended in 2 ml of 1x RBC lysis buffer (BD Biosciences 555899) and incubated for 5 minutes to deplete red blood cells (RBCs). FACS buffer (8 ml) was added to terminate the reaction, and the suspension centrifuged at 400xg for 5 minutes, and cell pellets were resuspended in FACS buffer to 105 - 106 cells/ml. To deplete endothelial cells and monocytes, 10 ul anti-CD31 MicroBeads (Miltenyi Biotec, 130-097-418) and 10 ul anti-CD45 MicroBeads (Miltenyi Biotec, 130-052-301) were added to 1 ml of cell suspension, incubated for 15 min, and loaded on an LD MACS column (Miltenyi Biotec, 130-042-901) pre-equilibrated with FACS buffer, according to manufacturer’s instructions. The column flow-through was centrifuged at 400xg for 5 minutes, and the cell pellet was resuspended in 0.8 - 1 ml of FACS buffer to 106 -107 cells/ml.
The cell suspensions were incubated with the following antibodies: allophycocyanin (APC)–conjugated anti-mouse CD31 (BioLegend, 102409, 1:800 dilution), APC-anti-mouse CD45 (BioLegend, 103111, 1:800), APC-anti-mouse F4/80 (BioLegend 123115, clone BM8, 1:800 dilution), phycoerythrin and cyanin 7 (PE-Cy7)-conjugated anti-mouse CD326 (anti- EpCam) (Thermo Fisher Scientific, 25-5791-80, clone G8.8, 1:400 dilution). After 15 min, cells were centrifuged at 300xg x 5 minutes, the cell pellet was resuspended in 1 ml FACS buffer, and this wash step was repeated twice. The final cell suspension was flow-sorted in a FACSAriaTMII (Becton-Dickinson) using the indicated sorting gates (Fig. S1). Cells were collected in 96-well plates (BioRad, HSP9631) containing 4 ul per well of cell lysis buffer containing 4U RNase Inhibitor (Takara Bio, 2313A), 0.05% Triton X100 (Thermo Fisher), 2.5mM dNTPs (Thermo Scientific) and 2.5uM oligo-dT30VN (AAGCAGTGGTATCAACGCAGAGTACT30 VN-3′ ) (IDT) as previously described for Smart-seq2 (Picelli et al., 2014), where ‘V’ represents A,C,G, ‘N’ represents A,T,C, or G, and the synthesized product contains a mix of all possible combinations in approximately equal proportions (variance up to 10%). Plates with sorted cells were sealed with microplate sealing film, vortexed 3-5 seconds, centrifuged at 1000xg for 1 minute, and immediately placed on dry ice and stored at -80°C until complementary DNA (cDNA) generation and sequencing.
Single cell mRNA sequencing
RNA from individual sorted cells was reverse transcribed to cDNA amplified, and Illumina sequencing libraries prepared as previously described (Darmanis et al., 2017). Briefly, 96-well plates containing single-cell lysates were thawed on ice, heated to 72°C for 3 minutes and immediately put back on ice. For cDNA synthesis, 6 ul of reverse transcriptase mix (1X First- Strand Buffer (Takara Bio, 639538) with 100U SMARTScribeTM Reverse Transcriptase (Takara Bio, 639538), 10U Recombinant RNase Inhibitor (Takara, 2313A) , 8.5mM DTT (Invitrogen, P2325), 0.4mM Betaine (Sigma, B0300-5VL), 10mM MgCl2 (Invitrogen, AM9530G) and 1.6uM template switching oligonucleotide containing one locked nucleic acid-modified guanosine (+G) at 3’end (5′-AAGCAGTGGTATCAACGCAGAGTACATrGrG+G) (Exiqon)) were added to each well, and the reactions were incubated at 42°C for 90 minutes followed by 70°C for 5 minutes. For PCR amplification of cDNA, 15 ul of PCR mix (1x KAPA HiFi HotStart ReadyMix (Kapa Biosystems, KK2602) with 0.16uM 1SPCR (one step PCR) oligonucleotide (5′-AAGCAGTGGTATCAACGCAGAGT) (IDT) and 0.56 U of Lambda Exonuclease (New England Biolabs, M0262L)) was added to each well, followed by thermal-cycling at: (i) 37°C for 30 min, (ii) 95°C for 3 min, (iii) 21 cycles of 98°C for 20s, 67°C for 15s and 72°C for 4 min, and (iv) 72°C for 5 min. Amplified cDNA was purified using 0.7x AMPure XP beads (Beckman Coulter, A63880) then analyzed by capillary electrophoresis on a Fragment Analyzer (Advanced Analytical Technologies) and the concentration of cDNA (in fragment size range 500 to 5000 bp) adjusted and Nextera DNA sequencing libraries prepared as described (Darmanis et al., 2015). Libraries from wells on each plate were pooled using a Mosquito liquid handler (SPT Labtech), purified twice using 0.7x AMPure XP beads, and library pool quality assessed on a Fragment Analyzer. Libraries from 679 single cells were sequenced (75 bp paired-end reads) on a NextSeq 500 (Illumina) using High-output v2 kits (Illumina). Raw sequence reads were demultiplexed using bcl2fastq (v1.8.4, Illumina), and remaining sequences aligned to the mouse reference genome (GRCm38-mm10, UCSC, supplemented with Zsgreen1 sequence) with STAR (v2.5.2b, default parameters except Stranded set to false and Mode set to intersection-nonempty), and the number of reads that mapped to each annotated gene (gene counts) determined with HTSEQ (v0.6.1p1, default parameters except Stranded set to false and Mode set to intersection-nonempty) (Anders et al., 2015). Cells with less than 50,000 mapped reads or less than 1000 detected genes were excluded as a quality metric, leaving 534 cell expression profiles for further analysis.
Analysis of CALCA alternative RNA splicing in PNECs
Sequence reads from both mouse and human PNEC scRNA-seq datasets were aligned to mouse mm10 and human gh38 reference genomes, respectively, using STAR, and the BAM (binary compressed version of sequence alignment map) output visualized by sashimi plots of the Calca (mouse) and CALCA (human) genomic loci using Integrative Genomics Viewer (IGV v2.4.14) (Katz et al., 2010; Robinson et al., 2011) and the mouse CGRP (RefSeq ID: NM_001289444) and calcitonin (NM_001305616), and human CGRP (NM_001033953) and calcitonin (NM_001741) reference mRNA sequences.
Validation set of mouse PNEC scRNA-seq profiles
A second set of adult mouse PNECs that were lineage-labeled, purified, and profiled by scRNA-seq as described (Ouadah et al., 2019) was used as validation set. Briefly, PNECs were lineage-labeled by tamoxifen administration to adult (age 2-3 months) Ascl1CreERT2;Rosa26LSL- ZsGreen or CGRPCreERT2;Rosa26LSL-ZsGreen mice, and whole lungs (excluding trachea) were processed into a single cell suspension. Red blood cells were lysed, and endothelial and immune cells depleted using MACS. Lineage-labeled epithelial cells (ZsGreen+ EpCam+) were sorted by FACS into a single collection tube, and individual cells captured and cDNA generated using an integrated microfluidic platform (Fluidigm C1). cDNA sequencing libraries were prepared in 96 well format and sequenced on a NextSeq 500 (Illumina) device, and obtained sequences were demultiplexed, processed, aligned to individual genes, and quantified to define gene expression levels in each cell. PCA analysis was performed on cell expression patterns using highly-variably expressed genes, and identities of cell clusters with related expression patterns assigned based on enriched expression of canonical lung cell type markers. Of the 100 PNEC expression profiles obtained, the eight with “transitional” profiles were excluded from our analysis. A threshold of 5 transcripts/million (TPM) was used for determining if a gene was expressed.
Human PNECs and carcinoid scRNA-seq analysis
The expression profiles of human PNECs characterized here are from our scRNA-seq analysis of cells from histologically normal lung tissue obtained from therapeutic lobectomies and matched blood from three patients with focal lung tumors; these profiles were used to construct our comprehensive molecular cell atlas of the human lung (Travaglini et al., 2020). Among the profiled cells, a cluster of 66 PNECs was identified by their selective expression of classical markers CALCA and ASCL1. For our PNEC analysis, we excluded all 11 human PNEC profiles obtained by droplet-based 10X scRNA-seq, which had less extensive expression profiles than the ones profiled by SS2, the plate-based method used here to profile mouse PNECSs. We also excluded one SS2-profiled cell that was designated a PNEC (cell ID: C7_B002464.gencode.vH29) but was an outlier in the original PNEC cluster (Travaglini et al., 2020); we found it expressed only one PNEC neuropeptide (DBI) but not any classic PNEC markers (CALCA, CHGA, ASCL1, GRP) or our newly identified PNEC markers (SCGN, PCKS1N, SCG3, SCG5), so it is likely a related but distinct and extremely rare lung cell type.
We included in our analysis one SS2-profiled cell that was not originally designated a PNEC (Cell ID: H4.B002460.gencode.vH29), which we found expressed both classic and new PNEC markers (ASCL1, GRP, CHGB, SCGN, SCG2, SCG3, SCG5). In total, our analysis included 55 PNECS, 50 from patient 1 and 5 from patient 3. The SS2 scRNA-seq sequencing reads from these 55 PNECs were re-aligned to the primary assembly of human reference genome GRCH38 (and further analyzed as above), to exclude an alternative contig at the CHGA locus (contig KI270847.1) in the reference genome used in the original analysis (GRCH38.p12) that caused vast undercount of CHGA expression.
The scRNA-seq expression profiles of human carcinoid cells characterized here are from a parallel analysis of one of the tumors in the above study, a typical carcinoid (1.3 x 0.9 cm) in the left bronchus of the resected left lower lung lobe of patient 3, a 51 year-old female mild adult-onset asthma and recent worsening hypertension. The tumor sample was processed and profiled in parallel with the accompanying normal tissue from this patient, and the 330 cells described here are sorted cells from the epithelial compartment of the tumor sample that were analyzed by SS2 and identified as carcinoid cells by their abundance in the tumor sample and expression of many classic PNEC markers and peptidergic genes, consistent with the underlying clinical and pathological diagnosis. A full description of the carcinoid tumor expression data will be provided elsewhere.
Peptidergic and sensory genes
The comprehensive list of mouse and human neuropeptide and peptide hormones and their genes (“peptidergic genes”) (Table S4) and receptors were compiled from the literature (Kim et al., 2011b; Secher et al., 2016) and an online database (www.neuropeptides.nl), then verified and updated with newly identified receptors by PubMed literature searches (through July 2020) for each included neuropeptide and peptide hormone.
The comprehensive list of sensory genes (Tables S6, S7) was curated from literature reviews of each sensory modality including mechanosensors (Clapham, 2003; Ranade et al., 2015), thermosensors (Caterina et al., 1999; McKemy et al., 2002; Peier et al., 2002; Vandewauw et al., 2018), acid sensors (Lin et al., 2004; Tominaga et al., 1998; Waldmann et al., 1997), hypoxia sensors (Chang et al., 2015; Kumar and Prabhakar, 2012), olfactory receptors (Buck and Axel, 1991), pheromone receptors (Dulac and Axel, 1995), trace amine-associated receptors (Zucchi et al., 2009), taste receptors (Chandrashekar et al., 2006), and opsins/light sensors (Blackshaw and Snyder, 1999; Haltaufderhyde et al., 2014; Terakita, 2005), and includes all genes with biochemical, genetic, or functional data to support their role as sensors plus related members of the gene family including full families of ion channels (Yu and Catterall, 2004). We included all genes previously implicated in PNEC sensory functions including hypoxia-sensing (Buttigieg et al., 2012; Fu et al., 2000) and mechanosensory (Lembrechts et al., 2012, 2013) pathways and genes.
Mouse lung immunohistochemistry and in situ hybridization
For immunohistochemistry, adult wild type CD-1 or C57BL/6NJ mice as indicated were euthanized as above, and lungs were perfused with room temperature PBS and then inflated with 2% low-melting point agarose (ThermoFisher, UltraPure 16520050). Individual lobes were isolated, fixed at 4°C for 18 - 24 hrs in 4% paraformaldehyde (PFA) in PBS, cryoprotected in 30% sucrose/PBS solution, transferred to cryomold blocks (22 x 40 x 20 mm, VWR) and embedded as entire lobes in Optimal Cutting Temperature (O.C.T.) Compound (Tissue Tek), and stored at -80°C until sectioning. Frozen tissue blocks were sectioned with a cryostat (Leica Biosystems) and the sections (20-50 μm) washed in PBS with 0.1% Tween-20 (PBST) and then incubated with blocking solution (5% goat serum, 0.3% Triton X-100 in PBS) for 1 - 5 hrs, then washed in PBST. Washed sections were incubated with primary antibodies overnight at 4°C, washed with PBST, and then incubated with secondary antibodies at room temperature for 45 minutes followed by counterstaining with DAPI at 0.1 μg/ml in PBS to mark nuclei. Primary antibodies were: anti-PC1/Pcsk1 (rabbit, Abcam ab3532, used at 1:750 dilution), anti-CGRP (rabbit, Sigma C8198, 1:500), anti-POMC (rabbit, Phoenix Pharmaceuticals H-029-30, 1:500), anti-Cartpt (rabbit, Phoenix Pharmaceuticals H-003-62, 1:2000). Secondary antibodies were directly conjugated to Alexa-488, -555, or 633 (Invitrogen) or to Alexa 647 (donkey anti-rat, Jackson ImmunoResearch) and used at 1:250 dilution.
For multiplex single molecule FISH (smFISH), wild type mouse lungs were perfused, inflated, fixed, imbedded in O.C.T. Compound, and stored as above. Cryosections (12 μm) were probed by RNAscope Hiplex12 technology (Advanced Cell Diagnostics, 324140) according to manufacturer’s instructions. The proprietary RNAscope probes were: Mm-Agt-T1 (426941-T1), Mm-Nmb-T2 (459931-T2), Mm-Adcyap1-T3 (405911-T3), Mm-Cartpt-T5 (432001-T5), Mm- Pomc-T6 (314081-T6), Mm-Chga-T9 (447851-T9), Mm-Resp18-T11 (493871-T11), Mm-Calca-T7 (custom probe), and Mm-Scg5-T10 (custom probe). Probed sections were imaged by confocal fluorescence microscopy (Zeiss LSM 880, Airyscan mode), and images were aligned using RNAscope HiPlex Registration software and processed with Zen software (Zeiss). To resolve secretory vesicles immunostained for peptides (Figs. 3C, G), confocal images were acquired in super-resolution mode.
In vitro imaging of mouse pulmonary sensory neuron response to angiotensin
Pulmonary sensory neurons (PSNs) were prepared from adult (PN120) Agtr1aCre/+;Rosa26LSL- tdTom/+ mice that selectively label the two types of PSNs that innervate NEBs (Liu et al, unpublished data). Three to five days prior to PSN isolation, 50 ul (1 mg/mL) of a fluorescent wheat germ agglutinin (WGA-647, Thermo Fisher, W32466) was instilled into the trachea to retrograde label PSNs. After 3-5 days to allow for WGA uptake by PSNs and retrograde transport to their cell bodies in the tenth (vagus) cranial nerve ganglia, mice were euthanized as above and the vagal ganglia were dissected and immediately placed in cold-buffered Hanks Balanced Salt Solution without calcium or magnesium (HBSS, ThermoFisher, 14190144).
Ganglia were digested with 60 U papain (Worthington Biochemical, LS003126) in 1 ml HBSS (with 10 mM HEPES pH 7.4, 0.5 mM EDTA, and 0.4 mg/mL L-cysteine) for 10 minutes at 37°C. The papain solution was then replaced with 3 ml of a second enzymatic digestion solution (1.5 mg/ml collagenase IV (Worthington, LS004186) and 1 mg/ml dispase (Worthington, LS02109) in HBSS with 10 mM HEPES) and the incubation continued at 37°C for 30 minutes, with tube inversion 5 times every 10 minutes. The sample was centrifuged for 4 minutes at 400g, and the pelleted cells were resuspended in 1 ml L-15 medium (Gibco 11415) with 10 mM HEPES (pH 7.4) by 3 sequential rounds of manual trituration using custom pulled glass micropipettes (Sutter Instrument Company, Model P-87) of successively finer tip diameter (to final range 0.1-0.12 mm). Tips were pre-coated with complete L-15 medium with 10% FBS (Gemini Bio Products, 100-50, diluted in 10 mM HEPES, pH 7.4) to limit neuronal loss. The cell suspension was gently layered on 5 ml of 20% Percoll (Sigma, P4937) in L-15 medium (Gibco, 11415) and centrifuged for 9 min at 400xg to separate dissociated neurons from lower density connective tissue and smaller cells. The cell pellet was resuspended in 2 ml of L15 medium with 10 mM HEPES (pH 7.4), giving a typical yield of ∼1000 cells. Cells were transported to the imaging facility at room temperature and centrifuged for 3 minutes at 750xg. Cells were re- suspended in 100 ul of warm CO2-equilbrated DMEM/F12 medium (Gibco, 10565018), and 30 - 40 ul of the cell suspension were plated in the center of a laminin-coated inset of a poly-lysine- coated 12 mm circular Nunc glass bottom culture dish (Thermo Fisher, 150680), prepared as described below. Cells were incubated at 37°C for 60-90 minutes to initiate cell adherence to the inset; although only some neurons adhere during this period, incubations beyond 2 hours caused evaporation of the small volume of medium and decreased neuron viability. DMEM/F12 medium (500 ul) was added to each well, and the cultures were incubated at 37°C (with 5% CO2) for 12-16 hours to increase cell adherence and equilibrate cells to the culture environment prior to functional imaging. In healthy preparations, typically ∼100 - 150 neurons adhered to the inset and ∼10% formed extended projections. [We found that coating of the culture insets with fresh reagents as follows was critical for cell adherence and viability: Poly-lysine coating of clean Nunc glass bottom dishes was done by incubating the dish in a solution of 50 ug/ml poly-D- lysine (Millipore, A-003-E) in HBSS at 37°C overnight, then washing the dish with HBSS three times and removing residual solution by aspiration; laminin coating of the insets was done by covering insets with a solution of 20 ug/ml laminin (Sigma L2020) in HBSS at 4°C for at least 45 min (typically 3-4 hours), then carefully removing the laminin solution by aspiration, washing the inset 3 times with HBSS at 4°C, and leaving the inset covered in HBSS until cell plating.] For functional imaging, cells were loaded with fluorescent calcium indicator Fluo-4 by incubating cells with 10 um Fluo-4 (Invitrogen) in HBSS buffered with 10 mM HEPES (pH 7.4) for 15-20 min. Nunc glass bottom dishes with insets containing cultured neurons were placed on a perfusion chamber platform (Warner Instruments, RC-37W) on a Zeiss 880 LSM confocal microscope stage housed within an incubation chamber adjusted to 37°C and 5% CO2; platform perfusion was by gravity-dependent flow controlled with a stopcock. Cells were continuously perfused with HBSS buffered with 10 mM HEPES (pH 7.4), and fields containing retrograde- labeled pulmonary sensory neurons were identified by WGA-647 fluorescence. Calcium imaging data (Fluo-4 flourescence) were collected every second with 488 nm wavelength excitation and 500-550 nm emission. After baseline recording for 60 seconds, 500 nM Angiotensin II peptide (Sigma-Aldrich, A9525) in HBSS with 10 mM HEPES (pH 7.4) was perfused for 60 seconds, followed by a wash and re-equilibration with HBSS buffered with 10 mM HEPES (pH 7.4) for 3 minutes, and finally a 15 second infusion of 50 mM KCl in HBSS buffered with 10 mM HEPES (pH 7.4) to assess cell excitability/viability. Following the infusions, the dish was removed from the perfusion chamber and cells immediately fixed with 4% PFA at 4°C for 30-60 min for subsequent immunohistochemistry as above to confirm identity of the monitored neurons. For analysis of the time-lapse recordings, Image J software (v2.0) was used to define cell boundaries and determine the mean fluorescence intensity value for each cell in the imaging field at each time point; the obtained fluorescence values were normalized to the average baseline fluorescence value (prior to Angiotensin II exposure) for the cell.
QUANTIFICATION AND STATISTICAL ANALYSIS
Computational clustering and identification of mouse PNEC scRNA-seq profiles
Counts for each gene were normalized across cells, scaled per million and converted to logarithmic scale. Dimensionality reduction was used to compare and cluster the obtained cell expression profiles using Seurat v2.3.4 (Butler et al., 2018). First, genes with highly variable expression across the sample population were identified (‘FindVariableGenes’), selecting genes with >1 standard deviation dispersion in mean expression values. Second, the dimensionality of the expression matrix data for the highly variable genes was reduced using principal component analysis (PCA), and the significant principal components (PCs) that captured the majority of variation in the dataset were selected by their standard deviations (PCElbowPlot function) and by examining the top gene loadings in each component as heatmaps. We selected the first fifteen PCs, and the five genes with the highest PC scores along each PC were inspected for biological relevance and for canonical markers of known lung cell types. Then, the relatedness of cell expression profiles was visualized in two dimensional tSNE plots (’RunTSNE for R’ with perplexity=30). Third, genes enriched in each cluster of cells with similar expression profiles were identified using Wilcoxon rank sum test with multiple testing correction (‘FindAllMarkers’). Cell doublets were identified and removed using Scrublet (Wolock et al., 2019).
The identities of the PNEC cell cluster and the 9 other obtained cell clusters were assigned based on enriched expression of canonical lung cell type markers: PNECs (Calca, Ascl1, Chga), multiciliated cells (Foxj1, Ccdc153, Cdhr3), basal cells (Krt5, Trp63, Krt15), AT1 cells (Ager, Rtkn2), AT2 cells (Sftpc, Sftpb), club cells (Scgb3a2, Scgb1a1), endothelial cells (Pecam, Tie1), stromal cell populations 1 and 2 (Col1a1, Col1a2), and glial cells (Gfap, S100b, Plp1). The expression profiles of the 176 obtained high quality PNECs were used for all analyses.
Analysis of peptidergic gene expression in human carcinoids profiled by bulk RNA-seq
Bulk RNA-seq datasets of 239 lung neuroendocrine neoplasms (Alcala et al., 2019; Fernandez- Cuesta et al., 2014; Laddha et al., 2019) were obtained from <https://nextjournal.com/rarecancersgenomics/a-molecular-map-of-lung-neuroendocrine-neoplasms/>, and the histopathologic classification in the attributes metadata file was used to identify the 111 typical and atypical lung carcinoids whose expression profiles of peptidergic genes and PNEC markers were analyzed here. Log-transformed gene read counts were normalized by quantile normalization (Dillies et al., 2013), and the obtained values were represented as heatmaps (Fig. S13).
Rarefaction analysis to estimate saturation of PNEC neuropeptide diversity
We modeled each PNEC peptidergic expression profile as an incidence sampling of all total possible peptidergic genes expressed by the PNEC population, and estimated the peptidergic diversity using rarefaction and extrapolation analysis, a technique used in ecology to assess species richness (Chao et al., 2014). In this analogy, each ‘species’ is a peptidergic gene, and a given PNEC that may express any number of distinct peptidergic genes is analogous to a sampling of the total assemblage of species (peptidergic genes). Using the iNEXT package to estimate species richness (Hsieh et al., 2016), we constructed an integrated curve to smoothly link rarefaction (interpolation) and prediction (extrapolation), and the associated 95% confidence intervals, by bootstrapping (N=200). Only the incidence (presence or absence), and not the abundance, of each peptidergic gene RNA was used to estimate the underlying neuropeptide accumulation curve (Chao et al., 2014; Garcia-Ortega and Martinez, 2015).
SUPPLEMENTARY FIGURE LEGENDS
ACKNOWLEDGMENTS
We thank Kyle Travaglini and Ahmad Nabhan (preparation of carcinoid tumor from patient 3), Y. Ouadah (mouse Fluidigm scRNA-seq data), and all members of Krasnow lab and Jeffrey Wine (discussions and comments on manuscript); Norma Neff and members of the Quake laboratory (single cell expertise and resources); Cathy Crumpton, Brandon Carter, and Stanford FACS facility (flow sorting); Joseph Shrager and Jalen Benson (IRB protocol and tissue procurement); the patient in this study; Rong Lu (normalization of bulk RNAseq data), and Maria Peterson (figure preparation), the Pediatric Pulmonary Division, and Department of Pediatrics for space and resources. This work was supported by grants from the NIH/NHLBI (K08HL129081, C.S.K), Doris Duke Charitable Foundation (2018105, C.S.K), Howard Hughes Medical Institute (M.A.K.), Ludwig Foundation (M.A.K.), NIH/NCI 5U01CA231851 (M.A.K, C.S.K.) and the Chan Zuckerberg Initiative (M.A.K.). M.A.K. is an investigator of the Howard Hughes Medical Institute.
REFERENCES
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.↵
- 7.↵
- 8.↵
- 9.↵
- 10.↵
- 11.↵
- 12.↵
- 13.↵
- 14.↵
- 15.↵
- 16.↵
- 17.↵
- 18.↵
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.↵
- 31.↵
- 32.↵
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.↵
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.↵
- 47.↵
- 48.↵
- 49.↵
- 50.↵
- 51.↵
- 52.↵
- 53.↵
- 54.↵
- 55.↵
- 56.↵
- 57.↵
- 58.↵
- 59.↵
- 60.↵
- 61.↵
- 62.↵
- 63.↵
- 64.↵
- 65.↵
- 66.↵
- 67.↵
- 68.↵
- 69.↵
- 70.↵
- 71.↵
- 72.↵
- 73.↵
- 74.↵
- 75.
- 76.↵
- 77.↵
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.↵
- 83.↵
- 84.↵
- 85.↵
- 86.↵
- 87.↵
- 88.↵
- 89.↵
- 90.↵
- 91.↵
- 92.↵
- 93.↵
- 94.↵
- 95.↵
- 96.↵
- 97.↵
- 98.↵
- 99.↵
- 100.↵
- 101.↵
- 102.↵
- 103.↵
- 104.↵
- 105.↵
- 106.↵
- 107.↵
- 108.↵