Abstract
Time-series scRNA-seq data have opened a door to elucidate cell differentiation, and in this context, the optimal transport theory has been attracting much attention. However, there remain critical issues in interpretability and computational cost. We present scEGOT, a comprehensive framework for single-cell trajectory inference, as a generative model with high interpretability and low computational cost. Applied to the human primordial germ cell-like cell (PGCLC) induction system, scEGOT identified the PGCLC progenitor population and bifurcation time of segregation. Our analysis shows TFAP2A is insufficient for identifying PGCLC progenitors, requiring NKX1-2. Additionally, MESP1 and GATA6 are also crucial for PGCLC/somatic cell segregation. These findings shed light on the mechanism that segregates PGCLC from somatic lineages. Notably, not limited to scRNA-seq, scEGOT’s versatility can extend to general single-cell data like scATAC-seq, and hence has the potential to revolutionize our understanding of such datasets and, thereby also, developmental biology.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
↵1 toshiaki.yachimura.a4{at}tohoku.ac.jp
↵2 wang.hanbo.48z{at}st.kyoto-u.ac.jp
↵3 imoto.yusuke.4e{at}kyoto-u.ac.jp
↵4 yoshida.momoko.34f{at}st.kyoto-u.ac.jp
↵5 tasaki{at}math.sci.hokudai.ac.jp
↵6 kojima.yoji.i97{at}kyoto-u.jp
↵7 yabuta.yukihiro.4w{at}kyoto-u.ac.jp
↵8 saitou{at}anat2.med.kyoto-u.ac.jp
↵9 hiraoka.yasuaki.6z{at}kyoto-u.ac.jp
We have extensively revised both the main manuscript and the supplemental materials.