Going deeper with embedded FPGA platform for convolutional neural network

…, K Guo, B Li, E Zhou, J Yu, T Tang, N Xu… - Proceedings of the …, 2016 - dl.acm.org
In recent years, convolutional neural network (CNN) based methods have achieved great
success in a large number of applications and have been among the most powerful and widely …

FP-DNN: An automated framework for mapping deep neural networks onto FPGAs with RTL-HLS hybrid templates

Y Guan, H Liang, N Xu, W Wang, S Shi… - 2017 IEEE 25th …, 2017 - ieeexplore.ieee.org
DNNs (Deep Neural Networks) have demonstrated great success in numerous applications
such as image classification, speech recognition, video analysis, etc. However, DNNs are …

FPMR: MapReduce framework on FPGA

Y Shan, B Wang, J Yan, Y Wang, N Xu… - Proceedings of the 18th …, 2010 - dl.acm.org
Machine learning and data mining are gaining increasing attentions of the computing society.
FPGA provides a highly parallel, low power, and flexible hardware platform for this domain…

Clicknp: Highly flexible and high performance network processing with reconfigurable hardware

B Li, K Tan, L Luo, Y Peng, R Luo, N Xu… - Proceedings of the …, 2016 - dl.acm.org
Highly flexible software network functions (NFs) are crucial components to enable multi-tenancy
in the clouds. However, software packet processing on a commodity server has limited …

The FATP1–DGAT2 complex facilitates lipid droplet expansion at the ER–lipid droplet interface

N Xu, SO Zhang, RA Cole, SA McKinney, F Guo… - Journal of Cell …, 2012 - rupress.org
At the subcellular level, fat storage is confined to the evolutionarily conserved compartments
termed lipid droplets (LDs), which are closely associated with the endoplasmic reticulum (…

Parallel inference for latent dirichlet allocation on graphics processing units

F Yan, N Xu, Y Qi - Advances in neural information …, 2009 - proceedings.neurips.cc
The recent emergence of Graphics Processing Units (GPUs) as general-purpose parallel
computing devices provides us with new opportunities to develop scalable learning methods …

Real-time high-quality stereo vision system in FPGA

W Wang, J Yan, N Xu, Y Wang… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Stereo vision is a well-known technique for acquiring depth information. In this paper, we
propose a real-time high-quality stereo vision system in field-programmable gate array (FPGA). …

ForeGraph: Exploring large-scale graph processing on multi-FPGA architecture

G Dai, T Huang, Y Chi, N Xu, Y Wang… - Proceedings of the 2017 …, 2017 - dl.acm.org
The performance of large-scale graph processing suffers from challenges including poor
locality, lack of scalability, random access pattern, and heavy data conflicts. Some …

Genetic and dietary regulation of lipid droplet expansion in Caenorhabditis elegans

SO Zhang, AC Box, N Xu, J Le Men… - Proceedings of the …, 2010 - National Acad Sciences
Dietary fat accumulates in lipid droplets or endolysosomal compartments that undergo selective
expansion under normal or pathophysiological conditions. We find that genetic defects …

A low-latency FPGA implementation for real-time object detection

…, L Cheng, C Li, Y Li, G He, N Xu… - … symposium on circuits …, 2021 - ieeexplore.ieee.org
The advancement of object detection algorithms makes them widely used in autonomous
systems. However, due to high computational complexity of Convolutional Neural Networks(…