2020.12.21 Vision papers

 

12-15-2020

Object-based attention for spatio-temporal reasoning: Outperforming neuro-symbolic models with flexible distributed architectures
by David Ding et al

12-17-2020

Taming Transformers for High-Resolution Image Synthesis
by Patrick Esser et al

12-16-2020

Learning Continuous Image Representation with Local Implicit Image Function
by Yinbo Chen et al

12-16-2020

Point Transformer
by Hengshuang Zhao et al

12-17-2020

SceneFormer: Indoor Scene Generation with Transformers
by Xinpeng Wang et al

12-17-2020

Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image
by Ronghang Hu et al

12-17-2020

Content Masked Loss: Human-Like Brush Stroke Planning in a Reinforcement Learning Painting Agent
by Peter Schaldenbrand et al

12-16-2020

Sketch Generation with Drawing Process Guided by Vector Flow and Grayscale
by Zhengyan Tong et al

12-16-2020

Sparse Signal Models for Data Augmentation in Deep Learning ATR
by Tushar Agarwal et al

12-16-2020

Unsupervised Learning of Local Discriminative Representation for Medical Images
by Huai Chen et al

12-17-2020

Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image
by Andrew Liu et al

12-17-2020

Toward Transformer-Based Object Detection
by Josh Beal et al

12-16-2020

Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts
by Ji Hou et al

12-16-2020

Projected Distribution Loss for Image Enhancement
by Mauricio Delbracio et al

12-15-2020

FoggySight: A Scheme for Facial Lookup Privacy
by Ivan Evtimov et al

12-17-2020

Transformer Interpretability Beyond Attention Visualization
by Hila Chefer et al

12-16-2020

Polyblur: Removing mild blur by polynomial reblurring
by Mauricio Delbracio et al

12-15-2020

FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
by Tarun Kalluri et al

12-16-2020

Learning to Recover 3D Scene Shape from a Single Image
by Wei Yin et al

12-16-2020

Self-Supervised Sketch-to-Image Synthesis
by Bingchen Liu et al

12-17-2020

Human Mesh Recovery from Multiple Shots
by Georgios Pavlakos et al

12-16-2020

Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces
by Bert Moons et al

12-16-2020

StarcNet: Machine Learning for Star Cluster Identification
by Gustavo Perez et al

12-16-2020

C2F-FWN: Coarse-to-Fine Flow Warping Network for Spatial-Temporal Consistent Motion Transfer
by Dongxu Wei et al

12-17-2020

Image-Based Jet Analysis
by Michael Kagan

12-16-2020

Unlabeled Data Guided Semi-supervised Histopathology Image Segmentation
by Hongxiao Wang et al

12-16-2020

Shape My Face: Registering 3D Face Scans by Surface-to-Surface Translation
by Mehdi Bahri et al

12-17-2020

Computation-Efficient Knowledge Distillation via Uncertainty-Aware Mixup
by Guodong Xu et al

12-17-2020

Neural Radiance Flow for 4D View Synthesis and Video Processing
by Yilun Du et al

12-17-2020

PCT: Point Cloud Transformer
by Meng-Hao Guo et al

12-16-2020

MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification
by Te-Lin Wu et al

12-17-2020

End-to-End Human Pose and Mesh Reconstruction with Transformers
by Kevin Lin et al

12-15-2020

Detecting Invisible People
by Tarasha Khurana et al

12-16-2020

DECOR-GAN: 3D Shape Detailization by Conditional Refinement
by Zhiqin Chen et al

12-16-2020

Deep Reinforcement Learning of Graph Matching
by Chang Liu et al

12-17-2020

Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations
by Adel Ahmadyan et al

12-17-2020

Detection and Prediction of Nutrient Deficiency Stress using Longitudinal Aerial Imagery
by Saba Dadsetan et al

12-16-2020

uBAM: Unsupervised Behavior Analysis and Magnification using Deep Learning
by Biagio Brattoli et al

12-17-2020

On Episodes, Prototypical Networks, and Few-shot Learning
by Steinar Laenen et al

12-17-2020

Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency
by Qiang Zhang et al

12-15-2020

Exploiting Sample Uncertainty for Domain Adaptive Person Re-Identification
by Kecheng Zheng et al

12-17-2020

Relightable 3D Head Portraits from a Smartphone Video
by Artem Sevastopolsky et al

12-17-2020

Deep Learning Techniques for Super-Resolution in Video Games
by Alexander Watson

12-16-2020

Roof-GAN: Learning to Generate Roof Geometry and Relations for Residential Houses
by Yiming Qian et al

12-16-2020

Spatial Context-Aware Self-Attention Model For Multi-Organ Segmentation
by Hao Tang et al

12-15-2020

Object-Centric Neural Scene Rendering
by Michelle Guo et al

12-17-2020

Combating Mode Collapse in GAN training: An Empirical Analysis using Hessian Eigenvalues
by Ricard Durall et al

12-15-2020

StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding
by Jinshan Zeng et al

12-17-2020

Describing the Structural Phenotype of the Glaucomatous Optic Nerve Head Using Artificial Intelligence
by Satish K. Panda et al

12-17-2020

Trajectory saliency detection using consistency-oriented latent codes from a recurrent auto-encoder
by L. Maczyta et al

12-17-2020

End-to-end Deep Object Tracking with Circular Loss Function for Rotated Bounding Box
by Vladislav Belyaev et al

12-17-2020

Zoom-to-Inpaint: Image Inpainting with High Frequency Details
by Soo Ye Kim et al

12-17-2020

Temporal LiDAR Frame Prediction for Autonomous Driving
by David Deng et al

12-16-2020

S3CNet: A Sparse Semantic Scene Completion Network for LiDAR Point Clouds
by Ran Cheng et al

12-15-2020

Responsible Disclosure of Generative Models Using Scalable Fingerprinting
by Ning Yu et al

12-15-2020

Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation
by Minsu Kim et al

12-15-2020

A Closer Look at the Robustness of Vision-and-Language Pre-trained Models
by Linjie Li et al

12-17-2020

Multi-Modal Depth Estimation Using Convolutional Neural Networks
by Sadique Adnan Siddiqui et al

12-17-2020

A Hierarchical Feature Constraint to Camouflage Medical Adversarial Attacks
by Qingsong Yao et al

12-16-2020

Transfer Learning Through Weighted Loss Function and Group Normalization for Vessel Segmentation from Retinal Images
by Abdullah Sarhan et al

12-16-2020

Efficient Golf Ball Detection and Tracking Based on Convolutional Neural Networks and Kalman Filter
by Tianxiao Zhang et al

12-16-2020

Neural Pruning via Growing Regularization
by Huan Wang et al

12-17-2020

Efficient CNN-LSTM based Image Captioning using Neural Network Compression
by Harshit Rampal et al

12-17-2020

RainNet: A Large-Scale Dataset for Spatial Precipitation Downscaling
by Xuanhong Chen et al

12-16-2020

On the Limitations of Denoising Strategies as Adversarial Defenses
by Zhonghan Niu et al

12-15-2020

Seeing Behind Objects for 3D Multi-Object Tracking in RGB-D Sequences
by Norman Müller et al

12-17-2020

Learning Compositional Radiance Fields of Dynamic Human Heads
by Ziyan Wang et al

12-17-2020

Weakly-Supervised Action Localization and Action Recognition using Global-Local Attention of 3D CNN
by Novanto Yudistira et al

12-18-2020

Frequency Consistent Adaptation for Real World Super Resolution
by Xiaozhong Ji et al

12-16-2020

Reduction in the complexity of 1D 1H-NMR spectra by the use of Frequency to Information Transformation
by Homayoun Valafar et al

12-18-2020

On Modality Bias in the TVQA Dataset
by Thomas Winterbottom et al

12-16-2020

Event Camera Calibration of Per-pixel Biased Contrast Threshold
by Ziwei Wang et al

12-16-2020

Simultaneous View and Feature Selection for Collaborative Multi-Robot Recognition
by Brian Reily et al

12-16-2020

ISD: Self-Supervised Learning by Iterative Similarity Distillation
by Ajinkya Tejankar et al

12-17-2020

Joint Search of Data Augmentation Policies and Network Architectures
by Taiga Kashima et al

12-17-2020

Invariant Teacher and Equivariant Student for Unsupervised 3D Human Pose Estimation
by Chenxin Xu et al

12-17-2020

Incremental Learning from Low-labelled Stream Data in Open-Set Video Face Recognition
by Eric Lopez-Lopez et al

12-15-2020

Masksembles for Uncertainty Estimation
by Nikita Durasov et al

12-17-2020

Exploiting Learnable Joint Groups for Hand Pose Estimation
by Moran Li et al

12-17-2020

Embodied Visual Active Learning for Semantic Segmentation
by David Nilsson et al

12-17-2020

LIGHTEN: Learning Interactions with Graph and Hierarchical TEmporal Networks for HOI in videos
by Sai Praneeth Reddy Sunkesula et al

12-15-2020

Representing Ambiguity in Registration Problems with Conditional Invertible Neural Networks
by Darya Trofimova et al

12-16-2020

Clique: Spatiotemporal Object Re-identification at the City Scale
by Tiantu Xu et al

12-16-2020

Learning to Recognize Patch-Wise Consistency for Deepfake Detection
by Tianchen Zhao et al

12-15-2020

Canny-VO: Visual Odometry with RGB-D Cameras based on Geometric 3D-2D Edge Alignment
by Yi Zhou et al

12-15-2020

KOALAnet: Blind Super-Resolution using Kernel-Oriented Adaptive Local Adjustment
by Soo Ye Kim et al

12-17-2020

Multi-shot Temporal Event Localization: a Benchmark
by Xiaolong Liu et al

12-17-2020

PanoNet3D: Combining Semantic and Geometric Understanding for LiDARPoint Cloud Detection
by Xia Chen et al

12-16-2020

A Contrast Synthesized Thalamic Nuclei Segmentation Scheme using Convolutional Neural Networks
by Lavanya Umapathy et al

12-15-2020

FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) Systems
by Lu Chen et al

12-17-2020

A new semi-supervised self-training method for lung cancer prediction
by Kelvin Shak et al

12-17-2020

Learning to Share: A Multitasking Genetic Programming Approach to Image Feature Learning
by Ying Bi et al

12-15-2020

Improved Image Matting via Real-time User Clicks and Uncertainty Estimation
by Tianyi Wei et al

12-16-2020

CompositeTasking: Understanding Images by Spatial Composition of Tasks
by Nikola Popovic et al

12-17-2020

A fully pipelined FPGA accelerator for scale invariant feature transform keypoint descriptor matching,
by Luka Daoud et al

12-17-2020

Learned Block-based Hybrid Image Compression
by Yaojun Wu et al

12-16-2020

Semi-Global Shape-aware Network
by Pengju Zhang et al

12-18-2020

Trying Bilinear Pooling in Video-QA
by Thomas Winterbottom et al

12-16-2020

Temporal Graph Modeling for Skeleton-based Action Recognition
by Jianan Li et al

12-16-2020

Latent Space Conditioning on Generative Adversarial Networks
by Ricard Durall et al

12-15-2020

Enhance Multimodal Transformer With External Label And In-Domain Pretrain: Hateful Meme Challenge Winning Solution
by Ron Zhu

12-15-2020

Post-Hurricane Damage Assessment Using Satellite Imagery and Geolocation Features
by Quoc Dung Cao et al

12-18-2020

A Surrogate Lagrangian Relaxation-based Model Compression for Deep Neural Networks
by Deniz Gurevin et al

12-15-2020

Exploring Vicinal Risk Minimization for Lightweight Out-of-Distribution Detection
by Deepak Ravikumar et al

12-17-2020

XXResolution Correspondence Networks
by Georgi Tinchev et al

12-16-2020

Evaluation of deep learning-based myocardial infarction quantification using Segment CMR software
by Olivier Rukundo

12-17-2020

Reconstructing Hand-Object Interactions in the Wild
by Zhe Cao et al

12-15-2020

HeadGAN: Video-and-Audio-Driven Talking Head Synthesis
by Michail Christos Doukas et al

12-16-2020

Interpretable Image Clustering via Diffeomorphism-Aware K-Means
by Romain Cosentino et al

12-16-2020

AutoCaption: Image Captioning with Neural Architecture Search
by Xinxin Zhu et al

12-16-2020

Learning to Run with Potential-Based Reward Shaping and Demonstrations from Video Data
by Aleksandra Malysheva et al

12-18-2020

STNet: Scale Tree Network with Multi-level Auxiliator for Crowd Counting
by Mingjie Wang et al

12-15-2020

FMODetect: Robust Detection and Trajectory Estimation of Fast Moving Objects
by Denys Rozumnyi et al

12-16-2020

Unsupervised Image Segmentation using Mutual Mean-Teaching
by Zhichao Wu et al

12-16-2020

Secret Key Agreement with Physical Unclonable Functions: An Optimality Summary
by Onur Günlü et al

12-15-2020

SID-NISM: A Self-supervised Low-light Image Enhancement Framework
by Lijun Zhang et al

12-16-2020

Self-Supervised Person Detection in 2D Range Data using a Calibrated Camera
by Dan Jia et al

12-17-2020

Information-Preserving Contrastive Learning for Self-Supervised Representations
by Tianhong Li et al

12-15-2020

Mitigating bias in calibration error estimation
by Rebecca Roelofs et al

12-18-2020

AU-Guided Unsupervised Domain Adaptive Facial Expression Recognition
by Kai Wang et al

12-18-2020

SegGroup: Seg-Level Supervision for 3D Instance and Semantic Segmentation
by An Tao et al

12-15-2020

NeuralQAAD: An Efficient Differentiable Framework for High Resolution Point Cloud Compression
by Nicolas Wagner et al

12-18-2020

Temporal Bilinear Encoding Network of Audio-Visual Features at Low Sampling Rates
by Feiyan Hu et al

12-17-2020

Treadmill Assisted Gait Spoofing (TAGS): An Emerging Threat to wearable Sensor-based Gait Authentication
by Rajesh Kumar et al

12-16-2020

I3DOL: Incremental 3D Object Learning without Catastrophic Forgetting
by Jiahua Dong et al

12-15-2020

FINED: Fast Inference Network for Edge Detection
by Jan Kristanto Wibisono et al

12-15-2020

Research on All-content Text Recognition Method for Financial Ticket Image
by Fukang Tian et al

12-16-2020

Cross-Cohort Generalizability of Deep and Conventional Machine Learning for MRI-based Diagnosis and Prediction of Alzheimers Disease
by Esther E. Bron et al

12-15-2020

Geometric Surface Image Prediction for Image Recognition Enhancement
by Tanasai Sucontphunt

12-15-2020

Class-incremental Learning with Rectified Feature-Graph Preservation
by Cheng-Hsun Lei et al

12-16-2020

Difficulty in estimating visual information from randomly sampled images
by Masaki Kitayama et al

12-15-2020

Deep Layout of Custom-size Furniture through Multiple-domain Learning
by Xinhan Di et al

12-17-2020

Firearm Detection via Convolutional Neural Networks: Comparing a Semantic Segmentation Model Against End-to-End Solutions
by Alexander Egiazarov et al

12-18-2020

SCNet: Training Inference Sample Consistency for Instance Segmentation
by Thang Vu et al

12-15-2020

Deep Learning to Segment Pelvic Bones: Large-scale CT Datasets and Baseline Models
by Pengbo Liu et al

12-18-2020

CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth
by Xingxing Zuo et al

12-17-2020

Attention-based Image Upsampling
by Souvik Kundu et al

12-15-2020

Learning-Based Quality Assessment for Image Super-Resolution
by Tiesong Zhao et al

12-18-2020

Separation and Concentration in Deep Networks
by John Zarka et al

12-16-2020

Joint Generative and Contrastive Learning for Unsupervised Person Re-identification
by Hao Chen et al

12-15-2020

Wasserstein Contrastive Representation Distillation
by Liqun Chen et al

12-17-2020

Object Detection based on OcSaFPN in Aerial Images with Noise
by Chengyuan Li et al

12-16-2020

Learning-Based Algorithms for Vessel Tracking: A Review
by Dengqiang Jia et al

12-15-2020

Robust Factorization Methods Using a Gaussian/Uniform Mixture Model
by Andrei Zaharescu et al

12-15-2020

SPOC learners final grade prediction based on a novel sampling batch normalization embedded neural network method
by Zhuonan Liang et al

12-15-2020

NAPA: Neural Art Human Pose Amplifier
by Qingfu Wan et al

12-16-2020

Towards Recognizing New Semantic Concepts in New Visual Domains
by Massimiliano Mancini

12-15-2020

Two-Stage Copy-Move Forgery Detection with Self Deep Matching and Proposal SuperGlue
by Yaqi Liu et al

12-15-2020

CosSGD: Nonlinear Quantization for Communication-efficient Federated Learning
by Yang He et al

12-18-2020

TDN: Temporal Difference Networks for Efficient Action Recognition
by Limin Wang et al

12-15-2020

docExtractor: An off-the-shelf historical document element extraction
by Tom Monnier et al

12-15-2020

Jet tagging in the Lund plane with graph networks
by Frédéric A. Dreyer et al

12-18-2020

Spectral Reflectance Estimation Using Projector with Unknown Spectral Power Distribution
by Hironori Hidaka et al

12-18-2020

Hyperspectral Image Semantic Segmentation in Cityscapes
by Yuxing Huang et al

12-16-2020

AdjointBackMap: Reconstructing Effective Decision Hypersurfaces from CNN Layers Using Adjoint Operators
by Qing Wan et al

12-17-2020

Self-supervised Learning with Fully Convolutional Networks
by Zhengeng Yang et al

12-17-2020

Flow-based Generative Models for Learning Manifold to Manifold Mappings
by Xingjian Zhen et al

12-15-2020

Unsupervised Domain Adaptation from Synthetic to Real Images for Anchorless Object Detection
by Tobias Scheck et al

12-15-2020

Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses
by Chen Ju et al

12-16-2020

SimuGAN: Unsupervised forward modeling and optimal design of a LIDAR Camera
by Nir Diamant et al

12-16-2020

Analysing the Direction of Emotional Influence in Nonverbal Dyadic Communication: A Facial-Expression Study
by Maha Shadaydeh et al

12-16-2020

Revisiting 3D Context Modeling with Supervised Pre-training for Universal Lesion Detection in CT Slices
by Shu Zhang et al

12-15-2020

Practical Auto-Calibration for Spatial Scene-Understanding from Crowdsourced Dashcamera Videos
by Hemang Chawla et al

12-15-2020

Attentional Local Contrast Networks for Infrared Small Target Detection
by Yimian Dai et al

12-18-2020

LGENet: Local and Global Encoder Network for Semantic Segmentation of Airborne Laser Scanning Point Clouds
by Yaping Lin et al

12-18-2020

Multimodal Transfer Learning-based Approaches for Retinal Vascular Segmentation
by José Morano et al

12-15-2020

FCFR-Net: Feature Fusion based Coarse-to-Fine Residual Learning for Monocular Depth Completion
by Lina Liu et al

12-15-2020

mDALU: Multi-Source Domain Adaptation and Label Unification with Partial Datasets
by Rui Gong et al

12-17-2020

Exploring Motion Boundaries in an End-to-End Network for Vision-based Parkinsons Severity Assessment
by Amirhossein Dadashzadeh et al

12-15-2020

Event-based Motion Segmentation with Spatio-Temporal Graph Cuts
by Yi Zhou et al

12-15-2020

Towards Improving Spatiotemporal Action Recognition in Videos
by Shentong Mo et al

12-17-2020

3D Object Classification on Partial Point Clouds: A Practical Perspective
by Zelin Xu et al

12-16-2020

PGMAN: An Unsupervised Generative Multi-adversarial Network for Pan-sharpening
by Huanyu Zhou et al

12-15-2020

Geometry Enhancements from Visual Content: Going Beyond Ground Truth
by Liran Azaria et al

12-15-2020

Domain Adaptive Object Detection via Feature Separation and Alignment
by Chengyang Liang et al

12-15-2020

Automated system to measure Tandem Gait to assess executive functions in children
by Mohammad Zaki Zadeh et al

12-18-2020

PointINet: Point Cloud Frame Interpolation Network
by Fan Lu et al

12-15-2020

Artificial Dummies for Urban Dataset Augmentation
by Antonín Vobecký et al

12-17-2020

Fast 3-dimensional estimation of the Foveal Avascular Zone from OCTA
by Giovanni Ometto et al

12-15-2020

GTA: Global Temporal Attention for Video Action Understanding
by Bo He et al

12-15-2020

End-to-end Generative Floor-plan and Layout with Attributes and Relation Graph
by Xinhan Di et al

12-15-2020

Training an Emotion Detection Classifier using Frames from a Mobile Therapeutic Game for Children with Developmental Disorders
by Peter Washington et al

12-15-2020

Dilated-Scale-Aware Attention ConvNet For Multi-Class Object Counting
by Wei Xu et al

12-15-2020

Frozen-to-Paraffin: Categorization of Histological Frozen Sections by the Aid of Paraffin Sections and Generative Adversarial Networks
by Michael Gadermayr et al

12-18-2020

Boosting Monocular Depth Estimation with Lightweight 3D Point Fusion
by Lam Huynh et al

12-15-2020

Pose Error Reduction for Focus Enhancement in Thermal Synthetic Aperture Visualization
by Indrajit Kurmi et al

12-18-2020

Assessing Pattern Recognition Performance of Neuronal Cultures through Accurate Simulation
by Gabriele Lagani et al

12-17-2020

FG-Net: Fast Large-Scale LiDAR Point CloudsUnderstanding Network Leveraging CorrelatedFeature Mining and Geometric-Aware Modelling
by Kangcheng Liu et al

12-18-2020

A Holistically-Guided Decoder for Deep Representation Learning with Applications to Semantic Segmentation and Object Detection
by Jianbo Liu et al

12-15-2020

CUDA-Optimized real-time rendering of a Foveated Visual System
by Elian Malkin et al

12-16-2020

TEMImageNet and AtomSegNet Deep Learning Training Library and Models for High-Precision Atom Segmentation, Localization, Denoising, and Super-resolution Processing of Atom-Resolution Scanning TEM Images
by Ruoqian Lin et al

12-18-2020

PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object Detection
by Yanan Zhang et al

12-18-2020

Learning Complex 3D Human Self-Contact
by Mihai Fieraru et al

12-15-2020

Fast 3D Image Moments
by William Diggin et al

12-15-2020

Cluster, Split, Fuse, and Update: Meta-Learning for Open Compound Domain Adaptive Semantic Segmentation
by Rui Gong et al

12-15-2020

Equalization Loss v2: A New Gradient Balance Approach for Long-tailed Object Detection
by Jingru Tan et al

12-15-2020

Robots Understanding Contextual Information in Human-Centered Environments using Weakly Supervised Mask Data Distillation
by Daniel Dworakowski et al

12-18-2020

Improving 3D convolutional neural network comprehensibility via interactive visualization of relevance maps: Evaluation in Alzheimers disease
by Martin Dyrba et al

12-17-2020

CT Film Recovery via Disentangling Geometric Deformation and Illumination Variation: Simulated Datasets and Deep Models
by Quan Quan et al

12-15-2020

Personal Mental Health Navigator: Harnessing the Power of Data, Personal Models, and Health Cybernetics to Promote Psychological Well-being
by Amir M. Rahmani et al

 
Craig Smith