Hao Su

Ph.D Candidate

Geometric Computing Lab and Artificial Intelligence Lab
Dept. of Computer Science
Stanford University, USA

Office: S297 James H. Clark Center
Address: 318 Campus Dr, Stanford, CA 94305
Email: haosu@cs.stanford.edu

bio / google scholar / publication

News

Research Statement

My research interests are computer vision, computer graphics, and machine learning. I particularly focus on joint analysis of 2D images and 3D shapes, that enables information to flow between 2D to 3D domains bidirectionally. Potential applications include 3D reconstruction, robots, virtual/augmented reality, etc.

To lay a solid foundation for data-driven approaches, I contributed in building large-scale knowledge-base of 2D images (ImageNet) and 3D shapes (ShapeNet) as student lead, where rich semantical, visual, geometrical, and physical properties are stored.

Towards the goal, I also believe that it is essential to bridge two fields --- computer vision and computer graphics, where the former focuses on visual data analysis and the latter on geometrical and physical data analysis and synthesis. I develop tools to link 2D and 3D data based upon geometry processing and machine learning techniques, especially deep learning.

Research Overview

Computer Vision and Computer Graphics
  • Joint Analysis of 2D Images and 3D Shapes
  • Crowd-sourcing for Large-scale Dataset Construction
  • Scene Understanding
Statistics and Optimization
  • Large-scale Optimization
  • Large-scale Graph Analysis
  • Multivariate Density Estimation

Publications

Computer Vision and Computer Graphics

A Point Set Generation Network for 3D Object Reconstruction from a Single Image
Hao Su*, Haoqiang Fan*, Leonidas Guibas
arxiv
Build a generative neural network to directly output a set of unordered points. As applications, it can be used for single-image based 3D reconstruction and shape completion.
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
Hao Su*, Charles Qi*, Kaichun Mo, Leonidas Guibas
arxiv
Build a neural network to directly consume an unordered point cloud as input, without converting to other 3D representations such as voxel grids first. Rich theoretical and empirical analyses are provided.
Learning Shape Abstractions by Assembling Volumetric Primitives
Shubham Tulsiani, Hao Su, Leonidas Guibas, Alexei A. Efros, Jitendra Malik
arxiv
Learn to abstract polygonal meshes by a flexible number of simple primitives such as cuboids. The abstraction is category consistent.
SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation
Li Yi, Hao Su, Xingwen Guo, Leonidas Guibas
arxiv
A neural network for graphs with non-isometric embeddings in a metric space. Spectral analysis, especially spectral domain synchronization, is conducted for efficient kernel weight sharing. Exemplified by part segmentation.
Volumetric and Multi-View CNNs for Object Classification on 3D Data
Hao Su*, Charles Qi*, Matthias Niessner, Angela Dai, Mengyuan Yan, Leonidas Guibas
CVPR 2016 (spotlight oral)
Novel architectures for volumetric and multi-view data classification, achieved state-of-the-art performance.
Multilinear Hyperplane Hashing
Xianglong Liu, Xinjie Fan, Cheng Deng, Hao Su, Dacheng Tao
CVPR 2016
Efficient approximate point-to-plane search.
Synthesizing Training Images for Boosting Human 3D Pose Estimation
Wenzheng Chen, Huan Wang, Yangyan Li, Hao Su, Zhenhua Wang, Chenghe Tu, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen
3DV 2016 (oral)
Extend RenderForCNN (my ICCV'15 paper) for 3D human pose estimation focusing on domain adaptation and texture transfer.
ObjectNet3D: A Large Scale Database for 3D Object Recognition
Yu Xiang, Wonhui Kim, Wei Chen, Jingwei Ji, Christopher Choy, Hao Su, Roozbeh Mottaghi, Leonidas Guibas, Silvio Savarese
ECCV 2016 (spotlight oral)
A large-scale image-shape database by linking ImageNet and ShapeNet at instance level.
FPNN: Field Probing Neural Networks for 3D Data
Yangyan Li, Soeren Pirk, Hao Su, Charles R. Qi, Leonidas J. Guibas
NIPS 2016
Very efficient 3D deep learning for volumetric data processing, by learning to sense a subset of the space.
3D Attention-Driven Depth Acquisition for Object Identification
Kai Xu, Yifei Shi, Lintao Zheng, Junyu Zhang, Min Liu, Hui Huang, Hao Su, Daniel Cohen-Or, Baoquan Chen
Transactions on Graphics (SIGGRAPH ASIA 2016)
Teach robots to identify objects with few scannings and movements by a 3D attention model and reinforcement learning.
Unsupervised Texture Transfer from Images to Model Collections
Tuanfeng Y. Wang, Hao Su, Qixing Huang, Jingwei Huang, Leonidas J. Guibas, Niloy J. Mitra
Transactions on Graphics (SIGGRAPH ASIA 2016)
Transfer textures from product images to 3D shapes. The increased texture variation in ShapeNet is validated to be effective for RenderForCNN (my ICCV'15 paper).
A Scalable Active Framework for Region Annotation in 3D Shape Collections
Li Yi, Vladimir G. Kim, Duygu Ceylan, I-Chao Shen, Mengyuan Yan, Hao Su, Cewu Lu, Qixing Huang, Alla Sheffer, Leonidas Guibas
Transactions on Graphics (SIGGRAPH ASIA 2016)
Annotate the parts for ShapeNet by crowd-sourcing and label propagation with high efficiency and accuracy.
SHREC’16 Track: Large-Scale 3D Shape Retrieval from ShapeNet Core55
M. Savva, F. Yu, Hao Su, M. Aono, B. Chen, D. Cohen-Or, W. Deng, H. Su, S. Bai, X. Bai, N. Fish, J. Han, E. Kalogerakis, E. G. Learned-Miller, Y. Li, M. Liao, S. Maji, A. Tatsuma, Y. Wang, N. Zhang, Z. Zhou
EuroGraphics SHREC2016 Workshop Report
Technical report for SHREC'16, the most renowned challenge for 3D shape retrieval.
ShapeNet: An Information-Rich 3D Model Repository
Angel X. Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva*, Shuran Song, Hao Su*, Jianxiong Xiao, Li Yi, Fisher Yu
Corresponding author, student lead
The official report of ShapeNet, an object-centric database of semantics, geometry and physics.
3D-Assisted Image Feature Synthesis for Novel Views of an Object
Hao Su*, Fan Wang*, Li Yi, Leonidas Guibas
ICCV 2015 (oral, acceptance rate: 2%)
Synthesize features at novel views of a 3D object from the observe viewpoint, leveraging on the geometric priors from ShapeNet.
Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views
Hao Su*, Charles Qi*, Yangyan Li, Leonidas Guibas
ICCV 2015 (oral, acceptance rate: 2%)
Show that large-scale synthetic data rendered from virtual world may greatly benefit deep learning to work in real world.
Joint Embeddings of Shapes and Images via CNN Image Purification
Hao Su*, Yangyan Li*, Charles Qi, Noa Fish, Daniel Cohen-Or, Leonidas Guibas
Transactions on Graphics (SIGGRAPH Asia 2015)
Learn to map 3D shapes and 2D images to a joint canonical space.
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky*, Jia Deng*, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, Li Fei-Fei
IJCV 2015
The technical report for ImageNet Challenge.
Estimating Image Depth using Shape Collections
Hao Su, Qixing Huang, Niloy Mitra, Yangyan Li, Leonidas Guibas
Transactions on Graphics (SIGGRAPH 2014)
Learn to estimate the depth from a single input image assisted by geometric priors from a 3D shape collection (later merged to ShapeNet).
Fine-Grained Semi-Supervised Labeling of Large Shape Collections
Qixing Huang, Hao Su, Leonidas Guibas
Transactions on Graphics (SIGGRAPH Asia 2013)
Fine-grained 3D shape classification.
Multi-level structured image coding on high-dimensional image representation
Li-Jia Li*, Jun Zhu*, Hao Su, Eric. P. Xing, Li Fei-Fei
ACCV 2013
Multi-layer sparse coding for compressing ObjectBank representation.
Crowd-sourcing Annotations for Visual Object Detection
Hao Su, Jia Deng, Li Fei-Fei
AAAI 2012 Human Computation Workshop
A system to annotate object bounding boxes for ImageNet by crowd-sourcing.
Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification
Hao Su*, Li-Jia Li*, Eric.P. Xing, Li Fei-Fei
NIPS 2010 (top 10 most cited paper in NIPS since 2010)
Learn to describe scenes by objects. Can be viewed as a layer-wise trained CNN (Gradient-HoG-Part-Object-Scene hierarchy).
Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories
Hao Su*, Min Sun*, Li Fei-Fei, Silvio Savarese
ICCV 2009 (oral, acceptance rate: 4%)
Continuous viewpoint estimation by a graphical model.
A Multi-View Probabilistic Model for 3D Object Classes
Hao Su*, Min Sun*, Li Fei-Fei, Silvio Savarese
CVPR 2009
Discrete viewpoint estimation by a graphical model.
Construction and Analysis of a Large Scale Image Ontology
Jia Deng, Hao Su, Minh Do, Kai Li, Li Fei-Fei
VSS 2009
ImageNet analysis paper.

Statistics and Optimization

Density Estimation via Discrepancy
Kun Yang, Hao Su, Wing Wong
arXiv:1509.06831, 2015
co-BPM: a Bayesian Model for Estimating Divergence and Distance of Distributions
Kun Yang, Hao Su, Wing Wong
arXiv:1410.0726, 2014
Reverse Top-k Search using Random Walk with Restart
Adams Wei Yu, Nikos Mamoulis, Hao Su
VLDB 2014
Efficient Euclidean Projections onto the Intersection of Norm Balls
Hao Su*, Adams W. Yu*, Li Fei-Fei
ICML 2012

Misc

Pathlet Learning for Compressing and Planning Trajectories
Chen Chen, Hao Su, Qixing Huang, Lin Zhang, Leonidas Guibas
SIGSPATIAL 2013

Academic calendar

Click here