Publications

Google Scholar

Journal/Book Chapter

Semantic Highlight Retrieval and Term Prediction
TIP, 2017
Min Sun, Kuo-Hao Zeng, Yen-Chen Lin, and Ali Farhadi [pdf]
Ranking Highlights in Personal Videos by Analyzing Edited Videos
TIP, 2016
M. Sun, A. Farhadi, Tseng-Hung Chen, and S. Seitz [pdf] [tech]
Salient Montages from Unconstrained Videos
TPAMI, 2016
M. Sun, A. Farhadi, B. Taskar, and S. Seitz [pdf]
Relating Things and Stuff via Object Property Interactions
TPAMI, 2013
M. Sun, B. Kim, and S. Savarese [pdf] [project] [bibtex]
Object detection, shape recovery, and 3D modelling by depth-encoded hough voting
CVIU, 2013
M. Sun, S. S. Kumar, G. Bradski, and S. Savarese [pdf]
Model-based object recognition
Book Chapter, Encyclopedia of Computer Vision, Springer 2012
M. Sun and S. Savarese [link]
Object Detection using Geometrical Context Feedback IJCV, 2012
M. Sun, S. Ying-Ze Bao, and S. Savarese
[pdf]
Toward coherent object detection and scene layout understanding (editor choice)
IVC, 2012
S. Yingze Bao, M. Sun, S. Savarese [pdf] [bibtex]
Make3D: Learning 3-D Scene Structure from a Single Still Image
TPAMI, 2008
Ashutosh Saxena, Min Sun, Andrew Y. Ng [pdf]

Conference/Workshop

BiFuse: Monocular 360 Depth Estimation via Bi-Projection Fusion
CVPR, 2020
Fu-En Wang*, Yu-Hsuan Yeh*, Min Sun, Wei-Chen Chiu, Yi-Hsuan Tsai (*indicate equal contribution) [pdf] [website] [code]
360SD-Net: 360°Stereo Depth Estimation with Learnable Cost Volume
ICRA, 2020
Ning-Hsu Wang, Bolivar E. Solarte, Wei-Chen Chiu, Yi-Hsuan Tsai, Min Sun [pdf] [website] [code]
InstaNAS: Instance-aware Neural Architecture Search
AAAI, 2020
An-Chieh Cheng*, Chieh Hubert Lin*, Da-Cheng Juan, Wei Wei, Min Sun (*indicate equal contribution) [pdf]
360-Indoor: Towards Learning Real-World Objects in 360&deg Indoor Equirectangular Images
WACV, 2020
Shih-Han Chou, Cheng Sun, Wen-Yen Chang, Wan-Ting Hsu, Min Sun, Jianlong Fu. [pdf] [website]
QoS-aware Neural Architecture Search
NeurIPS workshop, 2019
An-Chieh Cheng, Chieh Hubert Lin, Da-Cheng Juan, Wei Wei, Min Sun [pdf]
360SD-Net: 360°Stereo Depth Estimation with Learnable Cost Volume
ICCV workshop, 2019
Ning-Hsu Wang, Bolivar E. Solarte, Wei-Chen Chiu, Yi-Hsuan Tsai, Min Sun
Joint Monocular 3D Detection and Tracking
ICCV, 2019
Hou-Ning Hu, Qi-Zhi Cai, Dequan Wang, Ji Lin, Min Sun, Philipp Krähenbühl, Trevor Darrell, Fisher Yu. [code]
Point-to-Point Video Generation
ICCV, 2019
Tsun-Hsuan Wang*, Yen-Chi Cheng*, Chieh Hubert Lin, Hwann-Tzong Chen, Min Sun (*indicate equal contribution) [pdf] [website] [code]
3D LiDAR and Stereo Fusion using Stereo Matching Network with Conditional Cost Volume Normalization
IROS, 2019
Tsun-Hsuan Wang, Hou-Ning Hu, Chieh Hubert Lin, Yi-Hsuan Tsai, Wei-Chen Chiu, Min Sun [pdf] [website] [code]
DuLa-Net: A Dual-Projection Network for Estimating Room Layouts from a Single RGB Panorama
CVPR, 2019
Shang-Ta Yang, Fu-En Wang, Chi-Han Peng, Peter Wonka, Min Sun, Hung-Kuo Chu [pdf] [website]
HorizonNet: Learning Room Layout with 1D Representation and Pano Stretch Data Augmentation
CVPR, 2019
Cheng Sun, Chi-Wei Hsiao, Min Sun, Hwann-Tzong Chen [pdf] [website] [code]
Plug-and-Play: Improve Depth Estimation via Sparse Data Propagation
ICRA, 2019
Tsun-Hsuan Wang, Fu-En Wang, Juan-Ting Lin, Yi-Hsuan Tsai, Wei-Chen Chiu, Min Sun [pdf] [website]
Unsupervised Stylish Image Description Generation via Domain Layer Norm
AAAI, 2019
Cheng Kuan Chen*, Zhu Feng Pan*, Min Sun, Ming-Yu Liu (*indicate equal contribution) [pdf]
Leveraging Sequence Embedding and Convolutional Neural Network for Protein Function Prediction
NeurIPS workshop, 2018
Wei-Cheng Tseng, Po-Han Chi, Jia-Hua Wu, Min Sun
Learning a Multi-Modal Policy via Imitating Demonstrations with Mixed Behaviors
NeurIPS workshop, 2018
Fang-I Hsiao, Jui-Hsuan Kuo, Min Sun [pdf]
Radiotherapy Target Contouring with Convolutional Gated Graph Neural Network
NeurIPS ML4H Workshop Spotlight, 2018
Chun-Hung Chao, Yen-Chi Cheng, Hsien-Tzu Cheng, Chi-Wen Huang, Tsung-Ying Ho, Chen-Kan Tseng, Le Lu, Min Sun
Self-Supervised Learning of Depth and Camera Motion from 360° Videos
ACCV, 2018
Fu-En Wang*, Hou-Ning Hu*, Hsien-Tzu Cheng*, Juan-Ting Lin, Shang-Ta Yang, Meng-Li Shih, Hung-Kuo Chu, Min Sun (*indicate equal contribution) [pdf] [website]
Searching Toward Pareto-optimal Device-aware Neural Architectures
ICCAD, 2018
An-Chieh Cheng, Jin-Dong Dong, Chi-Hung Hsu, Shu-Huan Chang, Min Sun, Shih-Chieh Chang, Jia-Yu Pan, Yu-Ting Chen, Wei Wei, Da-Cheng Juan [pdf]
Liquid Pouring Monitoring via Rich Sensory Inputs
ECCV, 2018
Tz-Ying Wu*, Juan-Ting Lin*, Tsun-Hsuang Wang, Chan-Wei Hu, Juan Carlos Niebles, Min Sun (*indicate equal contribution) [pdf] [website]
Leveraging Motion Priors in Videos for Improving Human Segmentation
ECCV, 2018
Yu-Ting Chen, Wen-Yen Chang, Hai-Lun Lu, Tingfan Wu, Min Sun [pdf] [website]
Efficient Uncertainty Estimation for Semantic Segmentation in Videos
ECCV, 2018
Po-Yu Huang, Wan-Ting Hsu, Chun-Yueh Chiu, Ting-Fan Wu, Min Sun [pdf] [website]
DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural Architectures
ECCV, 2018
Jin-Dong Dong, An-Chieh Cheng, Da-Cheng Juan, Wei Wei, Min Sun [pdf] [website]
A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss
ACL Oral, 2018
Wan-Ting Hsu, Chieh-Kai Lin, Ming-Ying Lee, Kerui Min, Jing Tang, Min Sun [pdf] [website] [code]
PPP-Net: Platform-aware Progressive Search for Pareto-optimal Neural Architectures
ICLR workshop, 2018
Jin-Dong Dong, An-Chieh Cheng, Da-Cheng Juan, Wei Wei, Min Sun [pdf] [openreview]
Omnidirectional CNN for Visual Place Recognition and Navigation
ICRA, 2018
Tsun-Hsuan Wang*, Hung-Jui Huang*, Juan-Ting Lin, Chan-Wei Hu, Kua-Hao Zeng, Min Sun (*indicate equal contribution) [pdf] [website]
Cube Padding for Weakly-Supervised Saliency Prediction in 360&deg Videos
CVPR, 2018
Hsien-Tzu Cheng, Chun-Hung Chao, Jin-Dong Dong, Hao-Kai Wen, Tyng-Luh Liu, Min Sun [pdf] [website]
Self-view Grounding Given a Narrated 360&deg Video.
AAAI, 2018
Shih-Han Chou, Yi-Chun Chen, Kuo-Hao Zeng, Hou-Ning Hu, Jianlong Fu, Min Sun [pdf] [website]
Visual Forecasting by Imitating Dynamics in Natural Sequences.
ICCV Spotlight, 2017
Kuo-Hao Zeng, William B. Shen, De-An Huang, Min Sun, Juan Carlos Niebles [pdf]
Anticipating Daily Intention using On-Wrist Motion Triggered Sensing.
ICCV Spotlight, 2017
Tz-Ying Wu*, Ting-An Chien*, Cheng-Sheng Chan, Chan-Wei Hu, Min Sun (*indicate equal contribution) [pdf] [website]
No More Discrimination: Cross City Adaptation of Road Scene Segmenters
ICCV, 2017
Yi-Hsin Chen, Wei-Yu Chen, Yu-Ting Chen, Bo-Cheng Tsai, Yu-Chiang Frank Wang, Min Sun [pdf] [website] [dataset]
Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner.
ICCV, 2017
Tseng-Hung Chen, Yuan-Hong Liao, Ching-Yao Chuang, Wan-Ting Hsu, Jianlong Fu and Min Sun [pdf] [website] [code]
Tactics of Adversarial Attack on Deep Reinforcement Learning Agent.
IJCAI, 2017
Yen-Chen Lin, Zhang-Wei Hong, Yuan-Hong Liao, Meng-Li Shih, Ming-Yu Liu and Min Sun [pdf] [website]
Tactics of Adversarial Attack on Deep Reinforcement Learning Agent.
ICLR Workshop, 2017
Yen-Chen Lin, Zhang-Wei Hong, Yuan-Hong Liao, Meng-Li Shih, Ming-Yu Liu and Min Sun [pdf] [website]
Agent-centric Risk Assessment: Accident Anticipation and Risky Region Localization.
CVPR Spotlight, 2017
Kuo-Hao Zeng, Shih-Han Chou, Fu-Hsiang Chan, Juan Carlos Niebles and Min Sun [pdf] [website]
Deep 360 Pilot: Learning a Deep Agent for Piloting through 360° Sports Videos.
CVPR Oral, 2017
Hou-Ning Hu*, Yen-Chen Lin*, Ming-Yu Liu, Hsien-Tzu Cheng, Yung-Ju Chang and Min Sun (*indicate equal contribution) [pdf] [website]
Tell Me Where to Look: Investigating Ways for Assisting Focus in 360° Video.
CHI, 2017 [acceptance rate: ---/2424=~25%]
Yen-Chen Lin, Yung-Ju Chang, Hou-Ning Hu, Hsien-Tzu Cheng, Chi-Wen Huang, and Min Sun [pdf] [website]
Leveraging Video Descriptions to Learn Video Question Answering.
AAAI, 2017 [acceptance rate: 638/2590=24.6%]
Kuo-Hao Zeng, Tseng-Hung Chen, Ching-Yao Chuang, Yuan-Hong Liao, Juan Carlos Niebles and Min Sun [pdf] [website]
Anticipating Accidents in Dashcam Videos.
ACCV, 2016 (Oral) [oral acceptance rate:33/590=5.6%]
Fu-Hsiang Chan, Yu-Ting Chen, Yu Xiang, Min Sun [pdf] [website] [code]
Title Generation for User Generated Videos.
ECCV, 2016 [acceptance rate: 415/1561=26.6%]
Kuo-Hao Zeng, Tseng-Hung Chen, Juan Carlos Niebles, Min Sun [pdf] [video] [website]
Semantic Highlight Retrieval.
ICIP, 2016
Kuo-Hao Zeng, Yen-Chen Lin, Ali Farhadi, Min Sun [pdf]
Proactive sensing for improving hand pose estimation.
CHI, 2016 [acceptance rate: 565/2435=23.2%]
Dun-Yu Hsiao, M. Sun, C. Ballweber, S. Cooper, and Z Popović. [pdf] [YouTube]
Recognition from Hand Cameras: A Revisit with Deep Learning.
ECCV, 2016 [acceptance rate: 415/1561=26.6%]
Cheng-Sheng Chan, Shou-Zhong Chen, Pei-Xuan Xie, Chiung-Chih Chang, Min Sun [pdf] [website]
Ranking Domain-specific Highlights by Analyzing Edited Videos
ECCV, 2014
M. Sun, A. Farhadi, and S. Seitz [pdf] [tech] [project]
Salient Montages from Unconstrained Videos
ECCV, 2014
M. Sun, A. Farhadi, B. Taskar, and S. Seitz [pdf] [tech] [project]
Find the Best Path: an Efficient and Accurate Classifier for Image Hierarchies
ICCV, 2013
M. Sun, W. Huang, and S. Savarese [pdf]
Learning Hierarchical Linguistic Descriptions of Visual Datasets
NAACL-HLT Workshop on Vision and Language, 2013
R. Mittelman, M. Sun, B. Kuipers, and S. Savarese [pdf] [bibtex]
Relating Things and Stuff by High-Order Potential Modeling
ECCV 2012 Workshop on Higher-Order Models and Global Constraints in Computer Vision (HiPot), 2012
M. Sun, B. Kim, P. Kohli, and S. Savarese [pdf] [project] [bibtex]
An Efficient Branch-and-Bound Algorithm for Optimal Human Pose Estimation
CVPR, 2012
M. Sun, M. Telaprolu, H. Lee, and S. Savarese [pdf] [project] [bibtex]
Conditional regression forests for human pose estimation
CVPR, 2012
M. Sun, P. Kohli, and J. Shotton [pdf]
Mobile Object Detection through Client-Server based Vote Transfer
CVPR, 2012
S. Kumar, M. Sun, and S. Savarese [pdf] [bibtex]
Efficient and Exact MAP Inference using Branch and Bound
AISTATS, 2012
M. Sun, M. Telaprolu, H. Lee, and S. Savarese [pdf] [project] [bibtex]
Articulated Part-based Model for Joint Object Detection and Pose Estimation
ICCV, 2011
M. Sun and S. Savarese [pdf] [supplementary material] [bibtex]
Toward Automatic 3D Generic Object Modeling from One Single Image
3DIM-PVT, 2011
M. Sun, S. Kumar, G. Bradsky, and S. Savarese [pdf]
Object Detection with Geometrical Context Feedback Loop (oral)
BMVC, 2010
M. Sun, S. Ying-Ze Bao, and S. Savarese [pdf] [bibtex]
Depth-Encoded Hough Voting for Joint Object Detection and Shape Recovery
ECCV, 2010
M. Sun, G. Bradsky, B. Xu, and S. Savarese [pdf] [bibtex]
Toward Coherent Object Detection And Scene Layout Understanding
CVPR, 2010
S.Yingze Bao, M. Sun, and S. Savarese [pdf] [bibtex]
Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories (oral)
ICCV, 2009
M. Sun, H. Su, Silvio Savarese, L. Fei-Fei [pdf] [bibtex]
A Multi-View Probabilistic Model for 3D Object Classes
CVPR, 2009
M. Sun, H. Su, Silvio Savarese, L. Fei-Fei [pdf] [bibtex]
Unsupervised Object Pose Classification from Short Video Sequences
BMVC, 2009
L. Mei, M. Sun, K.M. Carter, A.O. Hero III, S. Savarese [pdf] [bibtex]