2022.2.28 Vision papers

 

02-24-2022

Self-Distilled StyleGAN: Towards Generation from Internet Photos
by Ron Mokady et al

02-23-2022

Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut
by Yangtao Wang et al

02-24-2022

FreeSOLO: Learning to Segment Objects without Annotations
by Xinlong Wang et al

02-23-2022

Near Perfect GAN Inversion
by Qianli Feng et al

02-23-2022

Diffractive optical system design by cascaded propagation
by Boris Ferdman et al

02-23-2022

CAISE: Conversational Agent for Image Search and Editing
by Hyounghun Kim et al

02-24-2022

Auto-scaling Vision Transformers without Training
by Wuyang Chen et al

02-23-2022

Paying U-Attention to Textures: Multi-Stage Hourglass Vision Transformer for Universal Texture Synthesis
by Shouchang Guo et al

02-24-2022

Learning to Merge Tokens in Vision Transformers
by Cedric Renggli et al

02-24-2022

Phrase-Based Affordance Detection via Cyclic Bilateral Interaction
by Liangsheng Lu et al

02-22-2022

Retrieval Augmented Classification for Long-Tail Visual Recognition
by Alexander Long et al

02-23-2022

Commonsense Reasoning for Identifying and Understanding the Implicit Need of Help and Synthesizing Assistive Actions
by Maëlic Neau et al

02-24-2022

Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
by Dacheng Yin et al

02-23-2022

Explanatory Paradigms in Neural Networks
by Ghassan AlRegib et al

02-23-2022

Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets
by Islam Ali et al

02-23-2022

A spectral-spatial fusion anomaly detection method for hyperspectral imagery
by Zengfu Hou et al

02-23-2022

When do GANs replicate? On the choice of dataset size
by Qianli Feng et al

02-23-2022

Thermal hand image segmentation for biometric recognition
by Xavier Font-Aragones et al

02-23-2022

Augmentation based unsupervised domain adaptation
by Mauricio Orbes-Arteaga et al

02-23-2022

Improving Robustness of Convolutional Neural Networks Using Element-Wise Activation Scaling
by Zhi-Yuan Zhang et al

02-23-2022

Weakly-supervised learning for image-based classification of primary melanomas into genomic immune subgroups
by Lucy Godson et al

02-24-2022

When Transformer Meets Robotic Grasping: Exploits Context for Efficient Grasp Detection
by Shaochen Wang et al

02-24-2022

Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge
by Sharib Ali et al

02-23-2022

Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
by Shizhe Chen et al

02-23-2022

A Note on Machine Learning Approach for Computational Imaging
by Bin Dong

02-23-2022

M2I: From Factored Marginal Trajectory Prediction to Interactive Prediction
by Qiao Sun et al

02-23-2022

Controlling Memorability of Face Images
by Mohammad Younesi et al

02-23-2022

New Benchmark for Household Garbage Image Recognition
by Zhize Wu et al

02-24-2022

Rare Gems: Finding Lottery Tickets at Initialization
by Kartik Sreenivasan et al

02-23-2022

Discovering Multiple and Diverse Directions for Cognitive Image Properties
by Umut Kocasari et al

02-23-2022

Learning Multi-Object Dynamics with Compositional Neural Radiance Fields
by Danny Driess et al

02-23-2022

MITI: SLAM Benchmark for Laparoscopic Surgery
by Regine Hartwig et al

02-23-2022

ISDA: Position-Aware Instance Segmentation with Deformable Attention
by Kaining Ying et al

02-22-2022

Learning from the Pros: Extracting Professional Goalkeeper Technique from Broadcast Footage
by Matthew Wear et al

02-23-2022

EcoFusion: Energy-Aware Adaptive Sensor Fusion for Efficient Autonomous Vehicle Perception
by Arnav Vaibhav Malawade et al

02-23-2022

Reconstruction Task Finds Universal Winning Tickets
by Ruichen Li et al

02-24-2022

Towards Effective and Robust Neural Trojan Defenses via Input Filtering
by Kien Do et al

02-23-2022

Art Creation with Multi-Conditional StyleGANs
by Konstantin Dobler et al

02-22-2022

The Winning Solution to the iFLYTEK Challenge 2021 Cultivated Land Extraction from High-Resolution Remote Sensing Image
by Zhen Zhao et al

02-23-2022

Visual-tactile sensing for Real-time liquid Volume Estimation in Grasping
by Fan Zhu et al

02-23-2022

CG-SSD: Corner Guided Single Stage 3D Object Detection from LiDAR Point Cloud
by Ruiqi Ma et al

02-22-2022

Roto-Translation Equivariant Super-Resolution of Two-Dimensional Flows Using Convolutional Neural Networks
by Yuki Yasuda

02-23-2022

HMD-EgoPose: Head-Mounted Display-Based Egocentric Marker-Less Tool and Hand Pose Estimation for Augmented Surgical Guidance
by Mitchell Doughty et al

02-23-2022

Multi-Teacher Knowledge Distillation for Incremental Implicitly-Refined Classification
by Longhui Yu et al

02-25-2022

Local Intensity Order Transformation for Robust Curvilinear Object Segmentation
by Tianyi Shi et al

02-25-2022

ARIA: Adversarially Robust Image Attribution for Content Provenance
by Maksym Andriushchenko et al

02-24-2022

A Transformer-based Network for Deformable Medical Image Registration
by Yibo Wang et al

02-23-2022

A modification of the conjugate direction method for motion estimation
by Marcos Faundez-Zanuy et al

02-23-2022

On PAC-Bayesian reconstruction guarantees for VAEs
by Badr-Eddine Chérief-Abdellatif et al

02-24-2022

Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance
by Zhuoning Yuan et al

02-23-2022

On-line signature verification system with failure to enroll managing
by Joan Fabregas et al

02-24-2022

Data variation-aware medical image segmentation
by Arkadiy Dushatskiy et al

02-23-2022

RadioTransformer: A Cascaded Global-Focal Transformer for Visual Attention-guided Disease Classification
by Moinak Bhattacharya et al

02-25-2022

Learning to Identify Perceptual Bugs in 3D Video Games
by Benedict Wilkins et al

02-24-2022

Factorizer: A Scalable Interpretable Approach to Context Modeling for Medical Image Segmentation
by Pooya Ashtari et al

02-23-2022

Deep Metric Learning-Based Semi-Supervised Regression With Alternate Learning
by Adina Zell et al

02-24-2022

Structure-aware Unsupervised Tagged-to-Cine MRI Synthesis with Self Disentanglement
by Xiaofeng Liu et al

02-24-2022

Monogenic Wavelet Scattering Network for Texture Image Classification
by Wai Ho Chak et al

02-24-2022

A novel unsupervised covid lung lesion segmentation based on the lung tissue identification
by Faeze Gholamian Khah et al

02-24-2022

Slow-Fast Visual Tempo Learning for Video-based Action Recognition
by Yuanzhong Liu et al

02-22-2022

LPF-Defense: 3D Adversarial Defense based on Frequency Analysis
by Hanieh Naderi et al

02-24-2022

Learn From the Past: Experience Ensemble Knowledge Distillation
by Chaofei Wang et al

02-23-2022

Deep Bayesian ICP Covariance Estimation
by Andrea De Maio et al

02-23-2022

A Method for Waste Segregation using Convolutional Neural Networks
by Jash Shah et al

02-24-2022

Transformers in Medical Image Analysis: A Review
by Kelei He et al

02-23-2022

EMOTHAW: A novel database for emotional state recognition from handwriting
by Laurence Likforman-Sulem et al

02-24-2022

Interpolation-based Contrastive Learning for Few-Label Semi-Supervised Learning
by Xihong Yang et al

02-25-2022

Data refinement for fully unsupervised visual inspection using pre-trained networks
by Antoine Cordier et al

02-23-2022

SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images
by Sara Mousavi et al

02-24-2022

Uncertainty-driven Planner for Exploration and Navigation
by Georgios Georgakis et al

02-25-2022

Active Learning for Point Cloud Semantic Segmentation via Spatial-Structural Diversity Reasoning
by Feifei Shao et al

02-25-2022

An exploration of the performances achievable by combining unsupervised background subtraction algorithms
by Sébastien Piérard et al

02-24-2022

Measuring CLEVRness: Blackbox testing of Visual Reasoning Models
by Spyridon Mouselinos et al

02-24-2022

AFFDEX 2.0: A Real-Time Facial Expression Analysis Toolkit
by Mina Bishay et al

02-23-2022

Nuclei panoptic segmentation and composition regression with multi-task deep neural networks
by Satoshi Kondo et al

02-25-2022

Confidence Calibration for Object Detection and Segmentation
by Fabian Küppers et al

02-23-2022

Image Classification on Small Datasets via Masked Feature Mixing
by Christoph Reinders et al

02-23-2022

Absolute Zero-Shot Learning
by Rui Gao et al

02-25-2022

6D Rotation Representation For Unconstrained Head Pose Estimation
by Thorsten Hempel et al

02-24-2022

SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent Entropy for Text Image Recognition
by Yen-Cheng Chang et al

02-24-2022

Domain Disentangled Generative Adversarial Network for Zero-Shot Sketch-Based 3D Shape Retrieval
by Rui Xu et al

02-23-2022

Amodal Panoptic Segmentation
by Rohit Mohan et al

02-23-2022

Multi-scale Sparse Representation-Based Shadow Inpainting for Retinal OCT Images
by Yaoqi Tang et al

02-23-2022

Localizing Small Apples in Complex Apple Orchard Environments
by Christian Wilms et al

02-25-2022

Predicting 4D Liver MRI for MR-guided Interventions
by Gino Gulamhussene et al

02-25-2022

An Ensemble Approach for Patient Prognosis of Head and Neck Tumor Using Multimodal Data
by Numan Saeed et al

02-24-2022

Fully Self-Supervised Learning for Semantic Segmentation
by Yuan Wang et al

02-24-2022

N-QGN: Navigation Map from a Monocular Camera using Quadtree Generating Networks
by Daniel Braun et al

02-25-2022

LF-VIO: A Visual-Inertial-Odometry Framework for Large Field-of-View Cameras with Negative Plane
by Ze Wang et al

02-25-2022

RRL:Regional Rotation Layer in Convolutional Neural Networks
by Zongbo Hao et al

02-24-2022

Computer Aided Diagnosis and Out-of-Distribution Detection in Glaucoma Screening Using Color Fundus Photography
by Satoshi Kondo et al

02-25-2022

A Novel Hand Gesture Detection and Recognition system based on ensemble-based Convolutional Neural Network
by Abir Sen et al

02-25-2022

TeachAugment: Data Augmentation Optimization Using Teacher Knowledge
by Teppei Suzuki

02-23-2022

ProFormer: Learning Data-efficient Representations of Body Movement with Prototype-based Feature Augmentation and Visual Transformers
by Kunyu Peng et al

02-23-2022

Mixed-Block Neural Architecture Search for Medical Image Segmentation
by Martijn M. A. Bosma et al

02-23-2022

Human Motion Detection Using Sharpened Dimensionality Reduction and Clustering
by Jeewon Heo et al

02-23-2022

Anomaly Detection in 3D Point Clouds using Deep Geometric Descriptors
by Paul Bergmann et al

02-25-2022

Implicit Optimizer for Diffeomorphic Image Registration
by Kun Han et al

02-25-2022

Sensing accident-prone features in urban scenes for proactive driving and accident prevention
by Sumit Mishra et al

02-22-2022

An End-to-End Cascaded Image Deraining and Object Detection Neural Network
by Kaige Wang et al

02-24-2022

DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep Association
by Xiyang Wang et al

02-25-2022

On Modality Bias Recognition and Reduction
by Yangyang Guo et al

02-25-2022

Faithful learning with sure data for lung nodule diagnosis
by Hanxiao Zhang et al

02-22-2022

Reliable Inlier Evaluation for Unsupervised Point Cloud Registration
by Yaqi Shen et al

02-23-2022

A comparative study of in-air trajectories at short and long distances in online handwriting
by Carlos Alonso-Martinez et al

02-25-2022

Reconstruction of Perceived Images from fMRI Patterns and Semantic Brain Exploration using Instance-Conditioned GANs
by Furkan Ozcelik et al

02-23-2022

SLOGAN: Handwriting Style Synthesis for Arbitrary-Length and Out-of-Vocabulary Text
by Canjie Luo et al

02-25-2022

RELMOBNET: A Robust Two-Stage End-To-End Training Approach For MOBILENETV3 Based Relative Camera Pose Estimation
by Praveen Kumar Rajendran et al

02-24-2022

Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion
by Hyeonsoo Jang et al

02-24-2022

TwistSLAM: Constrained SLAM in Dynamic Environment
by Mathieu Gonzalez et al

02-25-2022

Improving Amharic Handwritten Word Recognition Using Auxiliary Task
by Mesay Samuel Gondere et al

02-25-2022

NeuralFusion: Neural Volumetric Rendering under Human-object Interactions
by Yuheng Jiang et al

02-25-2022

Improving generalization with synthetic training data for deep learning based quality inspection
by Antoine Cordier et al

02-23-2022

Deepfake Detection for Facial Images with Facemasks
by Donggeun Ko et al

02-25-2022

Deep Dirichlet uncertainty for unsupervised out-of-distribution detection of eye fundus photographs in glaucoma screening
by Teresa Araújo et al

02-23-2022

Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition
by Xiaoguang Zhu et al

02-22-2022

Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning
by Hao He et al

02-24-2022

Effective Actor-centric Human-object Interaction Detection
by Kunlun Xu et al

02-25-2022

Towards Safe, Real-Time Systems: Stereo vs Images and LiDAR for 3D Object Detection
by Matthew Levine

02-25-2022

Improved Dual Correlation Reduction Network
by Yue Liu et al

02-25-2022

Joint Answering and Explanation for Visual Commonsense Reasoning
by Zhenyang Li et al

02-24-2022

Understanding Adversarial Robustness from Feature Maps of Convolutional Layers
by Cong Xu et al

02-24-2022

GIAOTracker: A comprehensive framework for MCMOT with global information and optimizing strategies in VisDrone 2021
by Yunhao Du et al

02-23-2022

A Novel Self-Supervised Cross-Modal Image Retrieval Method In Remote Sensing
by Gencer Sumbul et al

02-22-2022

FUNQUE: Fusion of Unified Quality Evaluators
by Abhinau K. Venkataramanan et al

02-22-2022

Evaluating Feature Attribution Methods in the Image Domain
by Arne Gevaert et al

02-23-2022

Synthesizing Photorealistic Images with Deep Generative Learning
by Chuanxia Zheng

02-24-2022

The effect of fatigue on the performance of online writer recognition
by Enric Sesa-Nogueras et al

02-24-2022

Online handwriting, signature and touch dynamics: tasks and potential applications in the field of security and health
by Marcos Faundez-Zanuy et al

02-24-2022

Fourier-Based Augmentations for Improved Robustness and Uncertainty Calibration
by Ryan Soklaski et al

02-24-2022

Learning Transferable Reward for Query Object Localization with Policy Adaptation
by Tingfeng Li et al

02-24-2022

Efficient Video Segmentation Models with Per-frame Inference
by Yifan Liu et al

02-24-2022

Analyzing Human Observer Ability in Morphing Attack Detection -- Where Do We Stand?
by Sankini Rancha Godage et al

02-22-2022

Enabling Efficient Deep Convolutional Neural Network-based Sensor Fusion for Autonomous Driving
by Xiaoming Zeng et al

02-24-2022

Optimal channel selection with discrete QCQP
by Yeonwoo Jeong et al

02-22-2022

Learning with Free Object Segments for Long-Tailed Instance Segmentation
by Cheng Zhang et al

02-24-2022

Instantaneous Physiological Estimation using Video Transformers
by Ambareesh Revanur et al

02-24-2022

On Monocular Depth Estimation and Uncertainty Quantification using Classification Approaches for Regression
by Xuanlong Yu et al

02-24-2022

Highly-Efficient Binary Neural Networks for Visual Place Recognition
by Bruno Ferrarini et al

02-22-2022

Arbitrary Shape Text Detection using Transformers
by Zobeir Raisi et al

02-24-2022

Time Efficient Training of Progressive Generative Adversarial Network using Depthwise Separable Convolution and Super Resolution Generative Adversarial Network
by Atharva Karwande et al

02-24-2022

RescueNet: A High Resolution UAV Semantic Segmentation Benchmark Dataset for Natural Disaster Damage Assessment
by Tashnim Chowdhury et al

02-24-2022

StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing Translation
by Peter Schaldenbrand et al

 
Craig Smith