User profiles for Z. Gan

Zhe Gan

Research Scientist, Apple
Verified email at apple.com
Cited by 18615

[HTML][HTML] Software for the frontiers of quantum chemistry: An overview of developments in the Q-Chem 5 package

…, AF White, MP Coons, AL Dempwolff, Z Gan… - The Journal of …, 2021 - pubs.aip.org
This article summarizes technical advances contained in the fifth major release of the Q-Chem
quantum chemistry program package, covering developments since 2015. A …

Vision-language pre-training: Basics, recent advances, and future trends

Z Gan, L Li, C Li, L Wang, Z Liu… - Foundations and Trends …, 2022 - nowpublishers.com
… Zhe Gan and Jianfeng Gao initiated the project. Zhe Gan and Linjie Li took lead in the
writing of Section 1. Linjie Li and Jianfeng Gao took lead in the writing of Section 2. Zhe Gan

[PDF][PDF] Mini review: gas cell stabilisation and gas retention in wheat bread dough

Z Gan, PR Ellis, JD Schofield - Journal of Cereal Science, 1995 - academia.edu
Gas cell stabilisation and gas retention are of considerable interest because of their technological
significance in bread making. We review recent studies in relation to the stabilisation …

Advances in molecular quantum chemistry contained in the Q-Chem 4 program package

Y Shao, Z Gan, E Epifanovsky, ATB Gilbert… - Molecular …, 2015 - Taylor & Francis
A summary of the technical advances that are incorporated in the fourth major release of the
Q-Chem quantum chemistry program is provided, covering approximately the last seven …

Modelling one‐and two‐dimensional solid‐state NMR spectra

…, B Alonso, JO Durand, B Bujoli, Z Gan… - Magnetic resonance …, 2002 - Wiley Online Library
… of the angle setting and higher order or crossterms which affect the non-symmetric satellite
transitions (Z. Gan, Rocky Mountain Conference, August 2001, Denver, CO, USA; L. Frydman, …

An empirical study of training end-to-end vision-and-language transformers

ZY Dou, Y Xu, Z Gan, J Wang, S Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Vision-and-language (VL) pre-training has proven to be highly effective on various VL
downstream tasks. While recent work has shown that fully transformer-based VL models can be …

Git: A generative image-to-text transformer for vision and language

J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper, we design and train a Generative Image-to-text Transformer, GIT, to unify
vision-language tasks such as image/video captioning and question answering. While generative …

Attngan: Fine-grained text to image generation with attentional generative adversarial networks

…, P Zhang, Q Huang, H Zhang, Z Gan… - Proceedings of the …, 2018 - openaccess.thecvf.com
… Here, z is a noise vector usually sampled from a standard normal distribution. e is a
global sentence vector, and e is the matrix of word vectors. Fca represents the Conditioning …

Uniter: Universal image-text representation learning

…, L Li, L Yu, A El Kholy, F Ahmed, Z Gan… - European conference on …, 2020 - Springer
Joint image-text embedding is the bedrock for most Vision-and-Language (V+L) tasks,
where multimodality inputs are simultaneously processed for joint visual and textual …

Generalized decoding for pixel, image, and language

X Zou, ZY Dou, J Yang, Z Gan, L Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
… Given an input image I ∈ RH×W ×3, we first use an image encoder EncI to extract
features Z. Afterwards, we use the text encoder EncT to encode a textual query T into Qt = ⟨q1 …