User profiles for Han Cai

Han Cai

Massachusetts Institute of Technology
Verified email at mit.edu
Cited by 7527

Path-level network transformation for efficient architecture search

H Cai, J Yang, W Zhang, S Han… - … conference on machine …, 2018 - proceedings.mlr.press
We introduce a new function-preserving transformation for efficient neural architecture
search. This network transformation allows reusing previously trained networks and existing …

Efficient architecture search by network transformation

H Cai, T Chen, W Zhang, Y Yu, J Wang - Proceedings of the AAAI …, 2018 - ojs.aaai.org
Techniques for automatically designing deep neural network architectures such as reinforcement
learning based approaches have recently shown promising results. However, their …

Tinytl: Reduce memory, not parameters for efficient on-device learning

H Cai, C Gan, L Zhu, S Han - Advances in Neural …, 2020 - proceedings.neurips.cc
Efficient on-device learning requires a small memory footprint at training time to fit the tight
memory constraint. Existing work solves this problem by reducing the number of trainable …

Apq: Joint search for network architecture, pruning and quantization policy

…, J Lin, Z Liu, H Wang, Y Lin, S Han - Proceedings of the …, 2020 - openaccess.thecvf.com
We present APQ, a novel design methodology for efficient deep learning deployment. Unlike
previous methods that separately optimize the neural network architecture, pruning policy, …

Proxylessnas: Direct neural architecture search on target task and hardware

H Cai, L Zhu, S Han - arXiv preprint arXiv:1812.00332, 2018 - arxiv.org
Neural architecture search (NAS) has a great impact by automatically designing effective
neural network architectures. However, the prohibitive computational demand of conventional …

Once-for-all: Train one network and specialize it for efficient deployment

H Cai, C Gan, T Wang, Z Zhang, S Han - arXiv preprint arXiv:1908.09791, 2019 - arxiv.org
We address the challenging problem of efficient inference across many devices and resource
constraints, especially on edge devices. Conventional approaches either manually design …

Product-based neural networks for user response prediction

Y Qu, H Cai, K Ren, W Zhang, Y Yu… - 2016 IEEE 16th …, 2016 - ieeexplore.ieee.org
Predicting user responses, such as clicks and conversions, is of great importance and has
found its usage inmany Web applications including recommender systems, websearch and …

Hat: Hardware-aware transformers for efficient natural language processing

…, Z Wu, Z Liu, H Cai, L Zhu, C Gan, S Han - arXiv preprint arXiv …, 2020 - arxiv.org
Transformers are ubiquitous in Natural Language Processing (NLP) tasks, but they are
difficult to be deployed on hardware due to the intensive computation. To enable low-latency …

Long text generation via adversarial training with leaked information

J Guo, S Lu, H Cai, W Zhang, Y Yu… - Proceedings of the AAAI …, 2018 - ojs.aaai.org
Automatically generating coherent and semantically meaningful text has many applications
in machine translation, dialogue systems, image captioning, etc. Recently, by combining with …

Lite pose: Efficient architecture design for 2d human pose estimation

…, M Li, H Cai, WM Chen, S Han - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Pose estimation plays a critical role in human-centered vision applications. However, it is
difficult to deploy state-of-the-art HRNet-based pose estimation models on resource-…