2022.5.2 Vision papers

 

04-26-2022

PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions
by Zhaoqi Leng et al

04-28-2022

CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
by Ming Ding et al

04-28-2022

NeurMiPs: Neural Mixture of Planar Experts for View Synthesis
by Zhi-Hao Lin et al

04-26-2022

ClothFormer:Taming Video Virtual Try-on in All Module
by Jianbin Jiang et al

04-26-2022

Understanding The Robustness in Vision Transformers
by Daquan Zhou et al

04-27-2022

Use All The Labels: A Hierarchical Multi-Label Contrastive Learning Framework
by Shu Zhang et al

04-28-2022

HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling
by Zhongang Cai et al

04-28-2022

Unlocking High-Accuracy Differentially Private Image Classification through Scale
by Soham De et al

04-26-2022

From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation
by Yuzhe Qin et al

04-27-2022

Dataset for Robust and Accurate Leading Vehicle Velocity Recognition
by Genya Ogawa et al

04-27-2022

Few-Shot Head Swapping in the Wild
by Changyong Shu et al

04-27-2022

Grasping the Arrow of Time from the Singularity: Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN
by Qiucheng Wu et al

04-29-2022

Flamingo: a Visual Language Model for Few-Shot Learning
by Jean-Baptiste Alayrac et al

04-26-2022

Density-preserving Deep Point Cloud Compression
by Yun He et al

04-29-2022

PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining
by Yuting Gao et al

04-26-2022

Expanding the Latent Space of StyleGAN for Real Face Editing
by Yin Yu et al

04-28-2022

List-Mode PET Image Reconstruction Using Deep Image Prior
by Kibo Ote et al

04-28-2022

Keep the Caption Information: Preventing Shortcut Learning in Contrastive Image-Caption Retrieval
by Maurits Bleeker et al

04-28-2022

Articulated Objects in Free-form Hand Interaction
by Zicong Fan et al

04-28-2022

Two Decades of Colorization and Decolorization for Images and Videos
by Shiguang Liu

04-27-2022

The MeVer DeepFake Detection Service: Lessons Learnt from Developing and Deploying in the Wild
by Spyridon Baxevanakis et al

04-28-2022

An Overview of Color Transfer and Style Transfer for Images and Videos
by Shiguang Liu

04-27-2022

Adversarial Fine-tune with Dynamically Regulated Adversary
by Pengyue Hou et al

04-29-2022

Fix the Noise: Disentangling Source Feature for Transfer Learning of StyleGAN
by Dongyeun Lee et al

04-27-2022

Offline Visual Representation Learning for Embodied Navigation
by Karmesh Yadav et al

04-29-2022

Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval
by Siyu Ren et al

04-26-2022

RadioPathomics: Multimodal Learning in Non-Small Cell Lung Cancer for Adaptive Radiotherapy
by Matteo Tortora et al

04-26-2022

On Fragile Features and Batch Normalization in Adversarial Training
by Nils Philipp Walter et al

04-28-2022

Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
by Spencer Whitehead et al

04-28-2022

Continual Learning with Bayesian Model based on a Fixed Pre-trained Feature Extractor
by Yang Yang et al

04-26-2022

Deeper Insights into ViTs Robustness towards Common Corruptions
by Rui Tian et al

04-28-2022

Vision-Language Pre-Training for Boosting Scene Text Detectors
by Sibo Song et al

04-28-2022

Mixup-based Deep Metric Learning Approaches for Incomplete Supervision
by Luiz H. Buris et al

04-27-2022

An Iterative Labeling Method for Annotating Fisheries Imagery
by Zhiyong Zhang et al

04-26-2022

MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation
by Inkyu Shin et al

04-26-2022

Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams
by Matteo Tiezzi et al

04-26-2022

Where and What: Driver Attention-based Object Detection
by Yao Rong et al

04-26-2022

A survey on attention mechanisms for medical applications: are we moving towards better algorithms?
by Tiago Gonçalves et al

04-28-2022

Rotationally Equivariant 3D Object Detection
by Hong-Xing Yu et al

04-28-2022

Poly-CAM: High resolution class activation map for convolutional neural networks
by Alexandre Englebert et al

04-29-2022

A Challenging Benchmark of Anime Style Recognition
by Haotang Li et al

04-26-2022

An Algorithm for the Labeling and Interactive Visualization of the Cerebrovascular System of Ischemic Strokes
by Florian Thamm et al

04-28-2022

Unsupervised Spatial-spectral Hyperspectral Image Reconstruction and Clustering with Diffusion Geometry
by Kangning Cui et al

04-28-2022

Oracle Guided Image Synthesis with Relative Queries
by Alec Helbling et al

04-26-2022

AAU-net: An Adaptive Attention U-net for Breast Lesions Segmentation in Ultrasound Images
by Gongping Chen et al

04-26-2022

Unsupervised Segmentation of Hyperspectral Remote Sensing Images with Superpixels
by Mirko Paolo Barbato et al

04-27-2022

Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimers Disease Diagnosis
by Houliang Zhou et al

04-26-2022

An Overview of Recent Work in Media Forensics: Methods and Threats
by Kratika Bhagtani et al

04-28-2022

BAGNet: Bidirectional Aware Guidance Network for Malignant Breast lesions Segmentation
by Gongping Chen et al

04-28-2022

Computer Vision for Road Imaging and Pothole Detection: A State-of-the-Art Review of Systems and Algorithms
by Nachuan Ma et al

04-26-2022

Understanding the Impact of Edge Cases from Occluded Pedestrians for ML Systems
by Jens Henriksson et al

04-28-2022

Learning to Split for Automatic Bias Detection
by Yujia Bao et al

04-26-2022

A Comparative Study on Approaches to Acoustic Scene Classification using CNNs
by Ishrat Jahan Ananya et al

04-27-2022

Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation
by Farshid Varno et al

04-28-2022

TJ4DRadSet: A 4D Radar Dataset for Autonomous Driving
by Lianqing Zheng et al

04-28-2022

MMRotate: A Rotated Object Detection Benchmark using Pytorch
by Yue Zhou et al

04-27-2022

A Scalable Combinatorial Solver for Elastic Geometrically Consistent 3D Shape Matching
by Paul Roetzer et al

04-27-2022

Self-Driving Car Steering Angle Prediction: Let Transformer Be a Car Again
by Chingis Oinar et al

04-28-2022

Deep Orientation-Aware Functional Maps: Tackling Symmetry Issues in Shape Matching
by Nicolas Donati et al

04-27-2022

Epicardial Adipose Tissue Segmentation from CT Images with A Semi-3D Neural Network
by Marin Benčević et al

04-26-2022

SCGC : Self-Supervised Contrastive Graph Clustering
by Gayan K. Kulatilleke et al

04-28-2022

Goldilocks-curriculum Domain Randomization and Fractal Perlin Noise with Application to Sim2Real Pneumonia Lesion Detection
by Takahiro Suzuki et al

04-28-2022

COVID-Net US-X: Enhanced Deep Neural Network for Detection of COVID-19 Patient Cases from Convex Ultrasound Imaging Through Extended Linear-Convex Ultrasound Augmentation Learning
by E. Zhixuan Zeng et al

04-26-2022

Learning Dual-Pixel Alignment for Defocus Deblurring
by Yu Li et al

04-27-2022

Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training
by Guanhong Wang et al

04-28-2022

On the Role of Field of View for Occlusion Removal with Airborne Optical Sectioning
by Francis Seits et al

04-26-2022

Neural Maximum A Posteriori Estimation on Unpaired Data for Motion Deblurring
by Youjian Zhang et al

04-28-2022

Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
by Arnav Chakravarthy et al

04-28-2022

Audio-Visual Contrastive Learning for Self-supervised Action Recognition
by Haoyuan Lan et al

04-28-2022

Deep Generalized Unfolding Networks for Image Restoration
by Chong Mou et al

04-26-2022

Coarse-to-fine Q-attention with Tree Expansion
by Stephen James et al

04-28-2022

Temporal Progressive Attention for Early Action Prediction
by Alexandros Stergiou et al

04-27-2022

A Multi-Head Convolutional Neural Network With Multi-path Attention improves Image Denoising
by Jiahong Zhang et al

04-28-2022

Morphing Attack Potential
by Matteo Ferrara et al

04-28-2022

Lightweight Bimodal Network for Single-Image Super-Resolution via Symmetric CNN and Recursive Transformer
by Guangwei Gao et al

04-28-2022

Resource-efficient domain adaptive pre-training for medical images
by Yasar Mehmood et al

04-28-2022

Generative Adversarial Networks for Image Super-Resolution: A Survey
by Chunwei Tian et al

04-28-2022

SemAttNet: Towards Attention-based Semantic Aware Guided Depth Completion
by Danish Nazir et al

04-26-2022

Restricted Black-box Adversarial Attack Against DeepFake Face Swapping
by Junhao Dong et al

04-28-2022

Depth Estimation with Simplified Transformer
by John Yang et al

04-26-2022

Robust Face Anti-Spoofing with Dual Probabilistic Modeling
by Yuanhan Zhang et al

04-26-2022

Sound Localization by Self-Supervised Time Delay Estimation
by Ziyang Chen et al

04-28-2022

Discriminative-Region Attention and Orthogonal-View Generation Model for Vehicle Re-Identification
by Huadong Li et al

04-27-2022

Mapping suburban bicycle lanes using street scene images and deep learning
by Tyler Saxton

04-28-2022

Unified Simulation, Perception, and Generation of Human Behavior
by Ye Yuan

04-28-2022

Inverse-Designed Meta-Optics with Spectral-Spatial Engineered Response to Mimic Color Perception
by Chris Munley et al

04-28-2022

A Closer Look at Branch Classifiers of Multi-exit Architectures
by Shaohui Lin et al

04-28-2022

Semi-MoreGAN: A New Semi-supervised Generative Adversarial Network for Mixture of Rain Removal
by Yiyang Shen et al

04-26-2022

Evaluating the Quality of a Synthesized Motion with the Fr\echet Motion Distance
by Antoine Maiorca et al

04-26-2022

Focal Sparse Convolutional Networks for 3D Object Detection
by Yukang Chen et al

04-26-2022

Meta-free representation learning for few-shot learning via stochastic weight averaging
by Kuilin Chen et al

04-26-2022

Optimized latent-code selection for explainable conditional text-to-image GANs
by Zhenxing Zhang et al

04-29-2022

Multiple Degradation and Reconstruction Network for Single Image Denoising via Knowledge Distillation
by Juncheng Li et al

04-26-2022

Contrastive Language-Action Pre-training for Temporal Localization
by Mengmeng Xu et al

04-26-2022

Instance-Specific Feature Propagation for Referring Segmentation
by Chang Liu et al

04-28-2022

Equine radiograph classification using deep convolutional neural networks
by Raniere Gaia Costa da Silva et al

04-27-2022

BBBD: Bounding Box Based Detector for Occlusion Detection and Order Recovery
by Kaziwa Saleh et al

04-27-2022

Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution
by Tze Ho Elden Tse et al

04-27-2022

PRE-NAS: Predictor-assisted Evolutionary Neural Architecture Search
by Yameng Peng et al

04-27-2022

Towards assessing agricultural land suitability with causal machine learning
by Georgios Giannarakis et al

04-26-2022

U-Net with ResNet Backbone for Garment Landmarking Purpose
by Khay Boon Hong

04-29-2022

Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM
by Jinwoo Jeon et al

04-28-2022

Noise-reducing attention cross fusion learning transformer for histological image classification of osteosarcoma
by Liangrui Pan et al

04-26-2022

RAPQ: Rescuing Accuracy for Power-of-Two Low-bit Post-training Quantization
by Hongyi Yao et al

04-28-2022

Symmetric Transformer-based Network for Unsupervised Image Registration
by Mingrui Ma et al

04-26-2022

Differentiable Zooming for Multiple Instance Learning on Whole-Slide Images
by Kevin Thandiackal et al

04-26-2022

MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
by Yuying Ge et al

04-27-2022

Power Bundle Adjustment for Large-Scale 3D Reconstruction
by Simon Weber et al

04-28-2022

Learning to Extract Building Footprints from Off-Nadir Aerial Images
by Jinwang Wang et al

04-26-2022

ROMA: Cross-Domain Region Similarity Matching for Unpaired Nighttime Infrared to Daytime Visible Video Translation
by Zhenjie Yu et al

04-28-2022

Learning cosmology and clustering with cosmic graphs
by Pablo Villanueva-Domingo et al

04-26-2022

TranSiam: Fusing Multimodal Visual Features Using Transformer for Medical Image Segmentation
by Xuejian Li et al

04-27-2022

DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers
by Xianing Chen et al

04-27-2022

Defending Against Person Hiding Adversarial Patch Attack with a Universal White Frame
by Youngjoon Yu et al

04-28-2022

Region-level Contrastive and Consistency Learning for Semi-Supervised Semantic Segmentation
by Jianrong Zhang et al

04-28-2022

Unsupervised Multi-Modal Medical Image Registration via Discriminator-Free Image-to-Image Translation
by Zekang Chen et al

04-28-2022

GRIT: General Robust Image Task Benchmark
by Tanmay Gupta et al

04-27-2022

Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion
by Sen Chen et al

04-26-2022

ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
by Yufei Xu et al

04-26-2022

Acquiring a Dynamic Light Field through a Single-Shot Coded Image
by Ryoya Mizuno et al

04-28-2022

Controllable Image Captioning
by Luka Maxwell

04-28-2022

Streaming Multiscale Deep Equilibrium Models
by Can Ufuk Ertenli et al

04-27-2022

Conformer and Blind Noisy Students for Improved Image Quality Assessment
by Marcos V. Conde et al

04-27-2022

CATrans: Context and Affinity Transformer for Few-Shot Segmentation
by Shan Zhang et al

04-26-2022

Causal Transportability for Visual Recognition
by Chengzhi Mao et al

04-28-2022

KING: Generating Safety-Critical Driving Scenarios for Robust Imitation via Kinematics Gradients
by Niklas Hanselmann et al

04-29-2022

Using 3D Shadows to Detect Object Hiding Attacks on Autonomous Vehicle Perception
by Zhongyuan Hau et al

04-27-2022

Forecasting Urban Development from Satellite Images
by Nando Metzger

04-28-2022

Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection
by Mingtao Feng et al

04-26-2022

Unified GCNs: Towards Connecting GCNs with CNNs
by Ziyan Zhang et al

04-28-2022

AE-NeRF: Auto-Encoding Neural Radiance Fields for 3D-Aware Object Manipulation
by Mira Kim et al

04-28-2022

Hybrid Relation Guided Set Matching for Few-shot Action Recognition
by Xiang Wang et al

04-27-2022

Dropout Inference with Non-Uniform Weight Scaling
by Zhaoyuan Yang et al

04-26-2022

Attentive Fine-Grained Structured Sparsity for Image Restoration
by Junghun Oh et al

04-28-2022

GenDR: A Generalized Differentiable Renderer
by Felix Petersen et al

04-26-2022

Context-Aware Sequence Alignment using 4D Skeletal Augmentation
by Taein Kwon et al

04-29-2022

AdaInt: Learning Adaptive Intervals for 3D Lookup Tables on Real-time Image Enhancement
by Canqian Yang et al

04-29-2022

Learning Adaptive Warping for Real-World Rolling Shutter Correction
by Mingdeng Cao et al

04-27-2022

Self-Supervised Text Erasing with Controllable Image Synthesis
by Gangwei Jiang et al

04-26-2022

Intercategorical Label Interpolation for Emotional Face Generation with Conditional Generative Adversarial Networks
by Silvan Mertes et al

04-27-2022

Person Re-Identification
by Mustafa Ebrahim Chasmai et al

04-27-2022

SSR-GNNs: Stroke-based Sketch Representation with Graph Neural Networks
by Sheng Cheng et al

04-29-2022

Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval
by Shupeng Su et al

04-26-2022

Multi stain graph fusion for multimodal integration in pathology
by Chaitanya Dwivedi et al

04-27-2022

Attention Consistency on Visual Corruptions for Single-Source Domain Generalization
by Ilke Cugu et al

04-27-2022

3D Magic Mirror: Clothing Reconstruction from a Single Image via a Causal Perspective
by Zhedong Zheng et al

04-27-2022

An Improved Nearest Neighbour Classifier
by Eric Setterqvist et al

04-29-2022

A Deep Learning based No-reference Quality Assessment Model for UGC Videos
by Wei Sun et al

04-27-2022

MAPLE-Edge: A Runtime Latency Predictor for Edge Devices
by Saeejith Nair et al

04-26-2022

Adaptive Split-Fusion Transformer
by Zixuan Su et al

04-27-2022

Ollivier-Ricci Curvature For Head Pose Estimation From a Single Image
by Lucia Cascone et al

04-27-2022

Relevance-based Margin for Contrastively-trained Video Retrieval Models
by Alex Falcon et al

04-27-2022

Gleo-Det: Deep Convolution Feature-Guided Detector with Local Entropy Optimization for Salient Points
by Chao Li et al

04-29-2022

Deep Geometry Post-Processing for Decompressed Point Clouds
by Xiaoqing Fan et al

04-29-2022

Preoperative brain tumor imaging: models and software for segmentation and standardized reporting
by D. Bouget et al

04-29-2022

SCS-Co: Self-Consistent Style Contrastive Learning for Image Harmonization
by Yucheng Hang et al

04-27-2022

Self-Supervised Learning of Object Parts for Semantic Segmentation
by Adrian Ziegler et al

04-26-2022

Urban Change Detection Using a Dual-Task Siamese Network and Semi-Supervised Learning
by Sebastian Hafner et al

04-26-2022

Boosting Adversarial Transferability of MLP-Mixer
by Haoran Lyu et al

04-27-2022

HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation
by Lukas Hoyer et al

04-28-2022

Where in the World is this Image? Transformer-based Geo-localization in the Wild
by Shraman Pramanick et al

04-26-2022

Generating Topological Structure of Floorplans from Room Attributes
by Yin Yu et al

04-28-2022

Automatic Detection and Classification of Symbols in Engineering Drawings
by Sourish Sarkar et al

04-29-2022

Segmentation of kidney stones in endoscopic video feeds
by Zachary A Stoebner et al

04-27-2022

Global Trajectory Helps Person Retrieval in a Camera Network
by Xin Zhang et al

04-27-2022

CapOnImage: Context-driven Dense-Captioning on Image
by Yiqi Gao et al

04-26-2022

Improving the Transferability of Adversarial Examples with Restructure Embedded Patches
by Huipeng Zhou et al

04-29-2022

Hardware Trojan Detection Using Unsupervised Deep Learning on Quantum Diamond Microscope Magnetic Field Images
by Maitreyi Ashok et al

04-26-2022

Coupled Iterative Refinement for 6D Multi-Object Pose Estimation
by Lahav Lipson et al

04-29-2022

Improving Transferability for Domain Adaptive Detection Transformers
by Kaixiong Gong et al

04-26-2022

Building Change Detection using Multi-Temporal Airborne LiDAR Data
by Ritu Yadav et al

04-29-2022

SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation
by Chang Shu et al

04-29-2022

C3-STISR: Scene Text Image Super-resolution with Triple Clues
by Minyi Zhao et al

04-27-2022

Low-rank Meets Sparseness: An Integrated Spatial-Spectral Total Variation Approach to Hyperspectral Denoising
by Haijin Zeng et al

04-29-2022

CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification
by Marcos V. Conde et al

04-29-2022

OSSGAN: Open-Set Semi-Supervised Image Generation
by Kai Katsumata et al

04-26-2022

Evaluation of Self-taught Learning-based Representations for Facial Emotion Recognition
by Bruna Delazeri et al

04-29-2022

Towards Automatic Parsing of Structured Visual Content through the Use of Synthetic Data
by Lukas Scholch et al

04-29-2022

Adversarial Distortion Learning for Medical Image Denoising
by Morteza Ghahremani et al

04-26-2022

The Influence of the Other-Race Effect on Susceptibility to Face Morphing Attacks
by Snipta Mallick et al

04-29-2022

Neural Implicit Representations for Physical Parameter Inference from a Single Video
by Florian Hofherr et al

04-29-2022

Seeing without Looking: Analysis Pipeline for Child Sexual Abuse Datasets
by Camila Laranjeira et al

04-29-2022

Learning Localization-aware Target Confidence for Siamese Visual Tracking
by Jiahao Nie et al

04-28-2022

Understanding the impact of image and input resolution on deep digital pathology patch classifiers
by Eu Wern Teh et al

04-29-2022

EndoMapper dataset of complete calibrated endoscopy procedures
by Pablo Azagra et al

04-26-2022

A Close Look into Human Activity Recognition Models using Deep Learning
by Wei Zhong Tee et al

04-26-2022

Leveraging Unlabeled Data for Sketch-based Understanding
by Javier Morales et al

04-26-2022

AccMPEG: Optimizing Video Encoding for Video Analytics
by Kuntai Du et al

04-28-2022

Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
by Boqing Zhu et al

04-28-2022

One Model to Synthesize Them All: Multi-contrast Multi-scale Transformer for Missing Data Imputation
by Jiang Liu et al

04-26-2022

Unsupervised Learning of Unbiased Visual Representations
by Carlo Alberto Barbano et al

04-28-2022

Coupling Deep Imputation with Multitask Learning for Downstream Tasks on Genomics Data
by Sophie Peacock et al

04-27-2022

Channel Pruned YOLOv5-based Deep Learning Approach for Rapid and Accurate Outdoor Obstacles Detection
by Zeqian Li et al

 
Craig Smith