2020.12.21 Vision papers

12-15-2020	Object-based attention for spatio-temporal reasoning: Outperforming neuro-symbolic models with flexible distributed architectures by David Ding et al
12-17-2020	Taming Transformers for High-Resolution Image Synthesis by Patrick Esser et al
12-16-2020	Learning Continuous Image Representation with Local Implicit Image Function by Yinbo Chen et al
12-16-2020	Point Transformer by Hengshuang Zhao et al
12-17-2020	SceneFormer: Indoor Scene Generation with Transformers by Xinpeng Wang et al
12-17-2020	Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image by Ronghang Hu et al
12-17-2020	Content Masked Loss: Human-Like Brush Stroke Planning in a Reinforcement Learning Painting Agent by Peter Schaldenbrand et al
12-16-2020	Sketch Generation with Drawing Process Guided by Vector Flow and Grayscale by Zhengyan Tong et al
12-16-2020	Sparse Signal Models for Data Augmentation in Deep Learning ATR by Tushar Agarwal et al
12-16-2020	Unsupervised Learning of Local Discriminative Representation for Medical Images by Huai Chen et al
12-17-2020	Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image by Andrew Liu et al
12-17-2020	Toward Transformer-Based Object Detection by Josh Beal et al
12-16-2020	Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts by Ji Hou et al
12-16-2020	Projected Distribution Loss for Image Enhancement by Mauricio Delbracio et al
12-15-2020	FoggySight: A Scheme for Facial Lookup Privacy by Ivan Evtimov et al
12-17-2020	Transformer Interpretability Beyond Attention Visualization by Hila Chefer et al
12-16-2020	Polyblur: Removing mild blur by polynomial reblurring by Mauricio Delbracio et al
12-15-2020	FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation by Tarun Kalluri et al
12-16-2020	Learning to Recover 3D Scene Shape from a Single Image by Wei Yin et al
12-16-2020	Self-Supervised Sketch-to-Image Synthesis by Bingchen Liu et al
12-17-2020	Human Mesh Recovery from Multiple Shots by Georgios Pavlakos et al
12-16-2020	Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces by Bert Moons et al
12-16-2020	StarcNet: Machine Learning for Star Cluster Identification by Gustavo Perez et al
12-16-2020	C2F-FWN: Coarse-to-Fine Flow Warping Network for Spatial-Temporal Consistent Motion Transfer by Dongxu Wei et al
12-17-2020	Image-Based Jet Analysis by Michael Kagan
12-16-2020	Unlabeled Data Guided Semi-supervised Histopathology Image Segmentation by Hongxiao Wang et al
12-16-2020	Shape My Face: Registering 3D Face Scans by Surface-to-Surface Translation by Mehdi Bahri et al
12-17-2020	Computation-Efficient Knowledge Distillation via Uncertainty-Aware Mixup by Guodong Xu et al
12-17-2020	Neural Radiance Flow for 4D View Synthesis and Video Processing by Yilun Du et al
12-17-2020	PCT: Point Cloud Transformer by Meng-Hao Guo et al
12-16-2020	MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification by Te-Lin Wu et al
12-17-2020	End-to-End Human Pose and Mesh Reconstruction with Transformers by Kevin Lin et al
12-15-2020	Detecting Invisible People by Tarasha Khurana et al
12-16-2020	DECOR-GAN: 3D Shape Detailization by Conditional Refinement by Zhiqin Chen et al
12-16-2020	Deep Reinforcement Learning of Graph Matching by Chang Liu et al
12-17-2020	Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations by Adel Ahmadyan et al
12-17-2020	Detection and Prediction of Nutrient Deficiency Stress using Longitudinal Aerial Imagery by Saba Dadsetan et al
12-16-2020	uBAM: Unsupervised Behavior Analysis and Magnification using Deep Learning by Biagio Brattoli et al
12-17-2020	On Episodes, Prototypical Networks, and Few-shot Learning by Steinar Laenen et al
12-17-2020	Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency by Qiang Zhang et al
12-15-2020	Exploiting Sample Uncertainty for Domain Adaptive Person Re-Identification by Kecheng Zheng et al
12-17-2020	Relightable 3D Head Portraits from a Smartphone Video by Artem Sevastopolsky et al
12-17-2020	Deep Learning Techniques for Super-Resolution in Video Games by Alexander Watson
12-16-2020	Roof-GAN: Learning to Generate Roof Geometry and Relations for Residential Houses by Yiming Qian et al
12-16-2020	Spatial Context-Aware Self-Attention Model For Multi-Organ Segmentation by Hao Tang et al
12-15-2020	Object-Centric Neural Scene Rendering by Michelle Guo et al
12-17-2020	Combating Mode Collapse in GAN training: An Empirical Analysis using Hessian Eigenvalues by Ricard Durall et al
12-15-2020	StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding by Jinshan Zeng et al
12-17-2020	Describing the Structural Phenotype of the Glaucomatous Optic Nerve Head Using Artificial Intelligence by Satish K. Panda et al
12-17-2020	Trajectory saliency detection using consistency-oriented latent codes from a recurrent auto-encoder by L. Maczyta et al
12-17-2020	End-to-end Deep Object Tracking with Circular Loss Function for Rotated Bounding Box by Vladislav Belyaev et al
12-17-2020	Zoom-to-Inpaint: Image Inpainting with High Frequency Details by Soo Ye Kim et al
12-17-2020	Temporal LiDAR Frame Prediction for Autonomous Driving by David Deng et al
12-16-2020	S3CNet: A Sparse Semantic Scene Completion Network for LiDAR Point Clouds by Ran Cheng et al
12-15-2020	Responsible Disclosure of Generative Models Using Scalable Fingerprinting by Ning Yu et al
12-15-2020	Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation by Minsu Kim et al
12-15-2020	A Closer Look at the Robustness of Vision-and-Language Pre-trained Models by Linjie Li et al
12-17-2020	Multi-Modal Depth Estimation Using Convolutional Neural Networks by Sadique Adnan Siddiqui et al
12-17-2020	A Hierarchical Feature Constraint to Camouflage Medical Adversarial Attacks by Qingsong Yao et al
12-16-2020	Transfer Learning Through Weighted Loss Function and Group Normalization for Vessel Segmentation from Retinal Images by Abdullah Sarhan et al
12-16-2020	Efficient Golf Ball Detection and Tracking Based on Convolutional Neural Networks and Kalman Filter by Tianxiao Zhang et al
12-16-2020	Neural Pruning via Growing Regularization by Huan Wang et al
12-17-2020	Efficient CNN-LSTM based Image Captioning using Neural Network Compression by Harshit Rampal et al
12-17-2020	RainNet: A Large-Scale Dataset for Spatial Precipitation Downscaling by Xuanhong Chen et al
12-16-2020	On the Limitations of Denoising Strategies as Adversarial Defenses by Zhonghan Niu et al
12-15-2020	Seeing Behind Objects for 3D Multi-Object Tracking in RGB-D Sequences by Norman Müller et al
12-17-2020	Learning Compositional Radiance Fields of Dynamic Human Heads by Ziyan Wang et al
12-17-2020	Weakly-Supervised Action Localization and Action Recognition using Global-Local Attention of 3D CNN by Novanto Yudistira et al
12-18-2020	Frequency Consistent Adaptation for Real World Super Resolution by Xiaozhong Ji et al
12-16-2020	Reduction in the complexity of 1D 1H-NMR spectra by the use of Frequency to Information Transformation by Homayoun Valafar et al
12-18-2020	On Modality Bias in the TVQA Dataset by Thomas Winterbottom et al
12-16-2020	Event Camera Calibration of Per-pixel Biased Contrast Threshold by Ziwei Wang et al
12-16-2020	Simultaneous View and Feature Selection for Collaborative Multi-Robot Recognition by Brian Reily et al
12-16-2020	ISD: Self-Supervised Learning by Iterative Similarity Distillation by Ajinkya Tejankar et al
12-17-2020	Joint Search of Data Augmentation Policies and Network Architectures by Taiga Kashima et al
12-17-2020	Invariant Teacher and Equivariant Student for Unsupervised 3D Human Pose Estimation by Chenxin Xu et al
12-17-2020	Incremental Learning from Low-labelled Stream Data in Open-Set Video Face Recognition by Eric Lopez-Lopez et al
12-15-2020	Masksembles for Uncertainty Estimation by Nikita Durasov et al
12-17-2020	Exploiting Learnable Joint Groups for Hand Pose Estimation by Moran Li et al
12-17-2020	Embodied Visual Active Learning for Semantic Segmentation by David Nilsson et al
12-17-2020	LIGHTEN: Learning Interactions with Graph and Hierarchical TEmporal Networks for HOI in videos by Sai Praneeth Reddy Sunkesula et al
12-15-2020	Representing Ambiguity in Registration Problems with Conditional Invertible Neural Networks by Darya Trofimova et al
12-16-2020	Clique: Spatiotemporal Object Re-identification at the City Scale by Tiantu Xu et al
12-16-2020	Learning to Recognize Patch-Wise Consistency for Deepfake Detection by Tianchen Zhao et al
12-15-2020	Canny-VO: Visual Odometry with RGB-D Cameras based on Geometric 3D-2D Edge Alignment by Yi Zhou et al
12-15-2020	KOALAnet: Blind Super-Resolution using Kernel-Oriented Adaptive Local Adjustment by Soo Ye Kim et al
12-17-2020	Multi-shot Temporal Event Localization: a Benchmark by Xiaolong Liu et al
12-17-2020	PanoNet3D: Combining Semantic and Geometric Understanding for LiDARPoint Cloud Detection by Xia Chen et al
12-16-2020	A Contrast Synthesized Thalamic Nuclei Segmentation Scheme using Convolutional Neural Networks by Lavanya Umapathy et al
12-15-2020	FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) Systems by Lu Chen et al
12-17-2020	A new semi-supervised self-training method for lung cancer prediction by Kelvin Shak et al
12-17-2020	Learning to Share: A Multitasking Genetic Programming Approach to Image Feature Learning by Ying Bi et al
12-15-2020	Improved Image Matting via Real-time User Clicks and Uncertainty Estimation by Tianyi Wei et al
12-16-2020	CompositeTasking: Understanding Images by Spatial Composition of Tasks by Nikola Popovic et al
12-17-2020	A fully pipelined FPGA accelerator for scale invariant feature transform keypoint descriptor matching, by Luka Daoud et al
12-17-2020	Learned Block-based Hybrid Image Compression by Yaojun Wu et al
12-16-2020	Semi-Global Shape-aware Network by Pengju Zhang et al
12-18-2020	Trying Bilinear Pooling in Video-QA by Thomas Winterbottom et al
12-16-2020	Temporal Graph Modeling for Skeleton-based Action Recognition by Jianan Li et al
12-16-2020	Latent Space Conditioning on Generative Adversarial Networks by Ricard Durall et al
12-15-2020	Enhance Multimodal Transformer With External Label And In-Domain Pretrain: Hateful Meme Challenge Winning Solution by Ron Zhu
12-15-2020	Post-Hurricane Damage Assessment Using Satellite Imagery and Geolocation Features by Quoc Dung Cao et al
12-18-2020	A Surrogate Lagrangian Relaxation-based Model Compression for Deep Neural Networks by Deniz Gurevin et al
12-15-2020	Exploring Vicinal Risk Minimization for Lightweight Out-of-Distribution Detection by Deepak Ravikumar et al
12-17-2020	XXResolution Correspondence Networks by Georgi Tinchev et al
12-16-2020	Evaluation of deep learning-based myocardial infarction quantification using Segment CMR software by Olivier Rukundo
12-17-2020	Reconstructing Hand-Object Interactions in the Wild by Zhe Cao et al
12-15-2020	HeadGAN: Video-and-Audio-Driven Talking Head Synthesis by Michail Christos Doukas et al
12-16-2020	Interpretable Image Clustering via Diffeomorphism-Aware K-Means by Romain Cosentino et al
12-16-2020	AutoCaption: Image Captioning with Neural Architecture Search by Xinxin Zhu et al
12-16-2020	Learning to Run with Potential-Based Reward Shaping and Demonstrations from Video Data by Aleksandra Malysheva et al

12-18-2020	STNet: Scale Tree Network with Multi-level Auxiliator for Crowd Counting by Mingjie Wang et al
12-15-2020	FMODetect: Robust Detection and Trajectory Estimation of Fast Moving Objects by Denys Rozumnyi et al
12-16-2020	Unsupervised Image Segmentation using Mutual Mean-Teaching by Zhichao Wu et al
12-16-2020	Secret Key Agreement with Physical Unclonable Functions: An Optimality Summary by Onur Günlü et al
12-15-2020	SID-NISM: A Self-supervised Low-light Image Enhancement Framework by Lijun Zhang et al
12-16-2020	Self-Supervised Person Detection in 2D Range Data using a Calibrated Camera by Dan Jia et al
12-17-2020	Information-Preserving Contrastive Learning for Self-Supervised Representations by Tianhong Li et al
12-15-2020	Mitigating bias in calibration error estimation by Rebecca Roelofs et al
12-18-2020	AU-Guided Unsupervised Domain Adaptive Facial Expression Recognition by Kai Wang et al
12-18-2020	SegGroup: Seg-Level Supervision for 3D Instance and Semantic Segmentation by An Tao et al
12-15-2020	NeuralQAAD: An Efficient Differentiable Framework for High Resolution Point Cloud Compression by Nicolas Wagner et al
12-18-2020	Temporal Bilinear Encoding Network of Audio-Visual Features at Low Sampling Rates by Feiyan Hu et al
12-17-2020	Treadmill Assisted Gait Spoofing (TAGS): An Emerging Threat to wearable Sensor-based Gait Authentication by Rajesh Kumar et al
12-16-2020	I3DOL: Incremental 3D Object Learning without Catastrophic Forgetting by Jiahua Dong et al
12-15-2020	FINED: Fast Inference Network for Edge Detection by Jan Kristanto Wibisono et al
12-15-2020	Research on All-content Text Recognition Method for Financial Ticket Image by Fukang Tian et al
12-16-2020	Cross-Cohort Generalizability of Deep and Conventional Machine Learning for MRI-based Diagnosis and Prediction of Alzheimers Disease by Esther E. Bron et al
12-15-2020	Geometric Surface Image Prediction for Image Recognition Enhancement by Tanasai Sucontphunt
12-15-2020	Class-incremental Learning with Rectified Feature-Graph Preservation by Cheng-Hsun Lei et al
12-16-2020	Difficulty in estimating visual information from randomly sampled images by Masaki Kitayama et al
12-15-2020	Deep Layout of Custom-size Furniture through Multiple-domain Learning by Xinhan Di et al
12-17-2020	Firearm Detection via Convolutional Neural Networks: Comparing a Semantic Segmentation Model Against End-to-End Solutions by Alexander Egiazarov et al
12-18-2020	SCNet: Training Inference Sample Consistency for Instance Segmentation by Thang Vu et al
12-15-2020	Deep Learning to Segment Pelvic Bones: Large-scale CT Datasets and Baseline Models by Pengbo Liu et al
12-18-2020	CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth by Xingxing Zuo et al
12-17-2020	Attention-based Image Upsampling by Souvik Kundu et al
12-15-2020	Learning-Based Quality Assessment for Image Super-Resolution by Tiesong Zhao et al
12-18-2020	Separation and Concentration in Deep Networks by John Zarka et al
12-16-2020	Joint Generative and Contrastive Learning for Unsupervised Person Re-identification by Hao Chen et al
12-15-2020	Wasserstein Contrastive Representation Distillation by Liqun Chen et al
12-17-2020	Object Detection based on OcSaFPN in Aerial Images with Noise by Chengyuan Li et al
12-16-2020	Learning-Based Algorithms for Vessel Tracking: A Review by Dengqiang Jia et al
12-15-2020	Robust Factorization Methods Using a Gaussian/Uniform Mixture Model by Andrei Zaharescu et al
12-15-2020	SPOC learners final grade prediction based on a novel sampling batch normalization embedded neural network method by Zhuonan Liang et al
12-15-2020	NAPA: Neural Art Human Pose Amplifier by Qingfu Wan et al
12-16-2020	Towards Recognizing New Semantic Concepts in New Visual Domains by Massimiliano Mancini
12-15-2020	Two-Stage Copy-Move Forgery Detection with Self Deep Matching and Proposal SuperGlue by Yaqi Liu et al
12-15-2020	CosSGD: Nonlinear Quantization for Communication-efficient Federated Learning by Yang He et al
12-18-2020	TDN: Temporal Difference Networks for Efficient Action Recognition by Limin Wang et al
12-15-2020	docExtractor: An off-the-shelf historical document element extraction by Tom Monnier et al
12-15-2020	Jet tagging in the Lund plane with graph networks by Frédéric A. Dreyer et al
12-18-2020	Spectral Reflectance Estimation Using Projector with Unknown Spectral Power Distribution by Hironori Hidaka et al
12-18-2020	Hyperspectral Image Semantic Segmentation in Cityscapes by Yuxing Huang et al
12-16-2020	AdjointBackMap: Reconstructing Effective Decision Hypersurfaces from CNN Layers Using Adjoint Operators by Qing Wan et al
12-17-2020	Self-supervised Learning with Fully Convolutional Networks by Zhengeng Yang et al
12-17-2020	Flow-based Generative Models for Learning Manifold to Manifold Mappings by Xingjian Zhen et al
12-15-2020	Unsupervised Domain Adaptation from Synthetic to Real Images for Anchorless Object Detection by Tobias Scheck et al
12-15-2020	Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses by Chen Ju et al
12-16-2020	SimuGAN: Unsupervised forward modeling and optimal design of a LIDAR Camera by Nir Diamant et al
12-16-2020	Analysing the Direction of Emotional Influence in Nonverbal Dyadic Communication: A Facial-Expression Study by Maha Shadaydeh et al
12-16-2020	Revisiting 3D Context Modeling with Supervised Pre-training for Universal Lesion Detection in CT Slices by Shu Zhang et al
12-15-2020	Practical Auto-Calibration for Spatial Scene-Understanding from Crowdsourced Dashcamera Videos by Hemang Chawla et al
12-15-2020	Attentional Local Contrast Networks for Infrared Small Target Detection by Yimian Dai et al
12-18-2020	LGENet: Local and Global Encoder Network for Semantic Segmentation of Airborne Laser Scanning Point Clouds by Yaping Lin et al
12-18-2020	Multimodal Transfer Learning-based Approaches for Retinal Vascular Segmentation by José Morano et al
12-15-2020	FCFR-Net: Feature Fusion based Coarse-to-Fine Residual Learning for Monocular Depth Completion by Lina Liu et al
12-15-2020	mDALU: Multi-Source Domain Adaptation and Label Unification with Partial Datasets by Rui Gong et al
12-17-2020	Exploring Motion Boundaries in an End-to-End Network for Vision-based Parkinsons Severity Assessment by Amirhossein Dadashzadeh et al
12-15-2020	Event-based Motion Segmentation with Spatio-Temporal Graph Cuts by Yi Zhou et al
12-15-2020	Towards Improving Spatiotemporal Action Recognition in Videos by Shentong Mo et al
12-17-2020	3D Object Classification on Partial Point Clouds: A Practical Perspective by Zelin Xu et al
12-16-2020	PGMAN: An Unsupervised Generative Multi-adversarial Network for Pan-sharpening by Huanyu Zhou et al
12-15-2020	Geometry Enhancements from Visual Content: Going Beyond Ground Truth by Liran Azaria et al
12-15-2020	Domain Adaptive Object Detection via Feature Separation and Alignment by Chengyang Liang et al
12-15-2020	Automated system to measure Tandem Gait to assess executive functions in children by Mohammad Zaki Zadeh et al
12-18-2020	PointINet: Point Cloud Frame Interpolation Network by Fan Lu et al
12-15-2020	Artificial Dummies for Urban Dataset Augmentation by Antonín Vobecký et al
12-17-2020	Fast 3-dimensional estimation of the Foveal Avascular Zone from OCTA by Giovanni Ometto et al
12-15-2020	GTA: Global Temporal Attention for Video Action Understanding by Bo He et al
12-15-2020	End-to-end Generative Floor-plan and Layout with Attributes and Relation Graph by Xinhan Di et al
12-15-2020	Training an Emotion Detection Classifier using Frames from a Mobile Therapeutic Game for Children with Developmental Disorders by Peter Washington et al
12-15-2020	Dilated-Scale-Aware Attention ConvNet For Multi-Class Object Counting by Wei Xu et al
12-15-2020	Frozen-to-Paraffin: Categorization of Histological Frozen Sections by the Aid of Paraffin Sections and Generative Adversarial Networks by Michael Gadermayr et al
12-18-2020	Boosting Monocular Depth Estimation with Lightweight 3D Point Fusion by Lam Huynh et al
12-15-2020	Pose Error Reduction for Focus Enhancement in Thermal Synthetic Aperture Visualization by Indrajit Kurmi et al
12-18-2020	Assessing Pattern Recognition Performance of Neuronal Cultures through Accurate Simulation by Gabriele Lagani et al
12-17-2020	FG-Net: Fast Large-Scale LiDAR Point CloudsUnderstanding Network Leveraging CorrelatedFeature Mining and Geometric-Aware Modelling by Kangcheng Liu et al
12-18-2020	A Holistically-Guided Decoder for Deep Representation Learning with Applications to Semantic Segmentation and Object Detection by Jianbo Liu et al
12-15-2020	CUDA-Optimized real-time rendering of a Foveated Visual System by Elian Malkin et al
12-16-2020	TEMImageNet and AtomSegNet Deep Learning Training Library and Models for High-Precision Atom Segmentation, Localization, Denoising, and Super-resolution Processing of Atom-Resolution Scanning TEM Images by Ruoqian Lin et al
12-18-2020	PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object Detection by Yanan Zhang et al
12-18-2020	Learning Complex 3D Human Self-Contact by Mihai Fieraru et al
12-15-2020	Fast 3D Image Moments by William Diggin et al
12-15-2020	Cluster, Split, Fuse, and Update: Meta-Learning for Open Compound Domain Adaptive Semantic Segmentation by Rui Gong et al
12-15-2020	Equalization Loss v2: A New Gradient Balance Approach for Long-tailed Object Detection by Jingru Tan et al
12-15-2020	Robots Understanding Contextual Information in Human-Centered Environments using Weakly Supervised Mask Data Distillation by Daniel Dworakowski et al
12-18-2020	Improving 3D convolutional neural network comprehensibility via interactive visualization of relevance maps: Evaluation in Alzheimers disease by Martin Dyrba et al
12-17-2020	CT Film Recovery via Disentangling Geometric Deformation and Illumination Variation: Simulated Datasets and Deep Models by Quan Quan et al
12-15-2020	Personal Mental Health Navigator: Harnessing the Power of Data, Personal Models, and Health Cybernetics to Promote Psychological Well-being by Amir M. Rahmani et al

Craig SmithDecember 21, 2020