2022.3.7 Vision papers

 

03-01-2022

Generative Adversarial Networks
by Gilad Cohen et al

03-03-2022

Understanding Failure Modes of Self-Supervised Learning
by Neha Mukund Kalibhat et al

03-03-2022

Efficient Video Instance Segmentation via Tracklet Query and Proposal
by Jialian Wu et al

03-03-2022

BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning
by Zhi Hou et al

03-03-2022

NeRF-Supervision: Learning Dense Object Descriptors from Neural Radiance Fields
by Lin Yen-Chen et al

03-02-2022

TableFormer: Table Structure Understanding with Transformers
by Ahmed Nassar et al

03-03-2022

Recovering 3D Human Mesh from Monocular Images: A Survey
by Yating Tian et al

03-01-2022

Variational Autoencoders Without the Variation
by Gregory A. Daly et al

03-01-2022

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
by Zihao Wang et al

03-03-2022

Playable Environments: Video Manipulation in Space and Time
by Willi Menapace et al

03-04-2022

Freeform Body Motion Generation from Speech
by Jing Xu et al

03-01-2022

D^2ETR: Decoder-Only DETR with Computationally Efficient Cross-Scale Attention
by Junyu Lin et al

03-03-2022

Vision-Language Intelligence: Tasks, Representation Learning, and Large Models
by Feng Li et al

03-04-2022

DiT: Self-supervised Pre-training for Document Image Transformer
by Junlong Li et al

03-02-2022

HighMMT: Towards Modality and Task Generalization for High-Modality Representation Learning
by Paul Pu Liang et al

03-03-2022

Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning
by Weixin Liang et al

03-01-2022

Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology
by Richard J. Chen et al

03-01-2022

Affordance Learning from Play for Sample-Efficient Policy Learning
by Jessica Borja-Diaz et al

03-01-2022

Recent, rapid advancement in visual question answering architecture
by Venkat Kodali et al

03-03-2022

PINA: Learning a Personalized Implicit Neural Avatar from a Single RGB-D Video Sequence
by Zijian Dong et al

03-01-2022

Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
by Mingyang Zhou et al

03-03-2022

A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism
by Rashid Khan et al

03-01-2022

InCloud: Incremental Learning for Point Cloud Place Recognition
by Joshua Knights et al

03-01-2022

Styleverse: Towards Identity Stylization across Heterogeneous Domains
by Jia Li et al

03-01-2022

Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor Perturbation
by Wei Dai et al

03-03-2022

Autoregressive Image Generation using Residual Quantization
by Doyup Lee et al

03-01-2022

Towards Creativity Characterization of Generative Models via Group-based Subset Scanning
by Celia Cintas et al

03-01-2022

CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding
by Mohamed Afham et al

03-02-2022

Hyperspectral Pixel Unmixing with Latent Dirichlet Variational Autoencoder
by Kiran Mantripragada et al

03-03-2022

Random Quantum Neural Networks (RQNN) for Noisy Image Recognition
by Debanjan Konar et al

03-03-2022

ROCT-Net: A new ensemble deep convolutional model with improved spatial resolution learning for detecting common diseases from retinal OCT images
by Mohammad Rahimzadeh et al

03-02-2022

DisARM: Displacement Aware Relation Module for 3D Detection
by Yao Duan et al

03-03-2022

Investigating the limited performance of a deep-learning-based SPECT denoising approach: An observer-study-based characterization
by Zitong Yu et al

03-03-2022

Interactive Image Synthesis with Panoptic Layout Generation
by Bo Wang et al

03-02-2022

MetaDT: Meta Decision Tree for Interpretable Few-Shot Learning
by Baoquan Zhang et al

03-02-2022

PetsGAN: Rethinking Priors for Single Image Generation
by Zicheng Zhang et al

03-01-2022

Multi-Task Multi-Scale Learning For Outcome Prediction in 3D PET Images
by Amine Amyar et al

03-03-2022

Capturing Shape Information with Multi-Scale Topological Loss Terms for 3D Reconstruction
by Dominik J. E. Waibel et al

03-03-2022

Selective Residual M-Net for Real Image Denoising
by Chi-Mao Fan et al

03-01-2022

How certain are your uncertainties?
by Luke Whitbread et al

03-03-2022

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields
by Shanyan Guan et al

03-01-2022

Colon Nuclei Instance Segmentation using a Probabilistic Two-Stage Detector
by Pedro Costa et al

03-01-2022

Compliance Challenges in Forensic Image Analysis Under the Artificial Intelligence Act
by Benedikt Lorch et al

03-02-2022

Differentiable IFS Fractals
by Cory Braker Scott

03-02-2022

Enhancing Adversarial Robustness for Deep Metric Learning
by Mo Zhou et al

03-03-2022

Recent Advances in Vision Transformer: A Survey and Outlook of Recent Work
by Khawar Islam

03-03-2022

Detecting High-Quality GAN-Generated Face Images using Neural Networks
by Ehsan Nowroozi et al

03-03-2022

DIME: Fine-grained Interpretations of Multimodal Models via Disentangled Local Explanations
by Yiwei Lyu et al

03-01-2022

Towards IID representation learning and its application on biomedical data
by Jiqing Wu et al

03-03-2022

Label-Only Model Inversion Attacks via Boundary Repulsion
by Mostafa Kahla et al

03-03-2022

On Learning Contrastive Representations for Learning with Noisy Labels
by Li Yi et al

03-02-2022

Protecting Celebrities with Identity Consistency Transformer
by Xiaoyi Dong et al

03-01-2022

Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption
by Ke Han et al

03-03-2022

Ensembles of Vision Transformers as a New Paradigm for Automated Classification in Ecology
by S. Kyathanahally et al

03-03-2022

TCTrack: Temporal Contexts for Aerial Tracking
by Ziang Cao et al

03-01-2022

Can No-reference features help in Full-reference image quality estimation?
by Saikat Dutta et al

03-01-2022

Separable-HoverNet and Instance-YOLO for Colon Nuclei Identification and Counting
by Chunhui Lin et al

03-03-2022

Instance Segmentation for Autonomous Log Grasping in Forestry Operations
by Jean-Michel Fortin et al

03-03-2022

Cross-Modality Earth Movers Distance for Visible Thermal Person Re-Identification
by Yongguo Ling et al

03-03-2022

DenseUNets with feedback non-local attention for the segmentation of specular microscopy images of the corneal endothelium with Fuchs dystrophy
by Juan P. Vigueras-Guillén et al

03-03-2022

Rethinking the role of normalization and residual blocks for spiking neural networks
by Shin-ichi Ikegawa et al

03-03-2022

Self-supervised Transparent Liquid Segmentation for Robotic Pouring
by Gautham Narayan Narasimhan et al

03-01-2022

A unified 3D framework for Organs at Risk Localization and Segmentation for Radiation Therapy Planning
by Fernando Navarro et al

03-02-2022

ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural Networks
by Mohammad Mahdi Dehshibi et al

03-03-2022

LatentFormer: Multi-Agent Transformer-Based Interaction Modeling and Trajectory Prediction
by Elmira Amirloo et al

03-02-2022

VAE-iForest: Auto-encoding Reconstruction and Isolation-based Anomalies Detecting Fallen Objects on Road Surface
by Takato Yasuno et al

03-03-2022

LGT-Net: Indoor Panoramic Room Layout Estimation with Geometry-Aware Transformer Network
by Zhigang Jiang et al

03-03-2022

Adaptive Path Planning for UAVs for Multi-Resolution Semantic Segmentation
by Felix Stache et al

03-03-2022

Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-Identification
by Zhipeng Huang et al

03-01-2022

SwitchHit: A Probabilistic, Complementarity-Based Switching System for Improved Visual Place Recognition in Changing Environments
by Maria Waheed et al

03-03-2022

Adaptive Local-Global Relational Network for Facial Action Units Recognition and Facial Paralysis Estimation
by Xuri Ge et al

03-01-2022

Towards deep learning-powered IVF: A large public benchmark for morphokinetic parameter prediction
by Tristan Gomez et al

03-02-2022

H4D: Human 4D Modeling by Learning Neural Compositional Representation
by Boyan Jiang et al

03-02-2022

LILE: Look In-Depth before Looking Elsewhere -- A Dual Attention Network using Transformers for Cross-Modal Information Retrieval in Histopathology Archives
by Danial Maleki et al

03-03-2022

Translational Lung Imaging Analysis Through Disentangled Representations
by Pedro M. Gordaliza et al

03-01-2022

Image analysis for automatic measurement of crustose lichens
by Pedro Guedes et al

03-03-2022

NUQ: A Noise Metric for Diffusion MRI via Uncertainty Discrepancy Quantification
by Shreyas Fadnavis et al

03-03-2022

CenterSnap: Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation
by Muhammad Zubair Irshad et al

03-01-2022

When A Conventional Filter Meets Deep Learning: Basis Composition Learning on Image Filters
by Fu Lee Wang et al

03-01-2022

Omni-frequency Channel-selection Representations for Unsupervised Anomaly Detection
by Yufei Liang et al

03-03-2022

Revisiting Click-based Interactive Video Object Segmentation
by Stephane Vujasinovic et al

03-02-2022

OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation
by Dingding Cai et al

03-01-2022

Stable, accurate and efficient deep neural networks for inverse problems with analysis-sparse models
by Maksym Neyra-Nesterenko et al

03-03-2022

An Efficient Subpopulation-based Membership Inference Attack
by Shahbaz Rezaei et al

03-04-2022

Carbon Footprint of Selecting and Training Deep Learning Models for Medical Image Analysis
by Raghavendra Selvan et al

03-03-2022

Ad2Attack: Adaptive Adversarial Attack on Real-Time UAV Tracking
by Changhong Fu et al

03-04-2022

Uncertainty Estimation for Heatmap-based Landmark Localization
by Lawrence Schobs et al

03-03-2022

A study on the distribution of social biases in self-supervised learning visual models
by Kirill Sirotkin et al

03-01-2022

Towards a unified view of unsupervised non-local methods for image denoising: the NL-Ridge approach
by Sébastien Herbreteau et al

03-02-2022

A Simple and Universal Rotation Equivariant Point-cloud Network
by Ben Finkelshtein et al

03-01-2022

MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video
by Jinlu Zhang et al

03-01-2022

Bridge the Gap between Supervised and Unsupervised Learning for Fine-Grained Classification
by Jiabao Wang et al

03-02-2022

Video Question Answering: Datasets, Algorithms and Challenges
by Yaoyao Zhong et al

03-03-2022

Why adversarial training can hurt robust accuracy
by Jacob Clarysse et al

03-02-2022

Shape constrained CNN for segmentation guided prediction of myocardial shape and pose parameters in cardiac MRI
by Sofie Tilborghs et al

03-03-2022

CAFE: Learning to Condense Dataset by Aligning Features
by Kai Wang et al

03-03-2022

Exploring Patch-wise Semantic Relation for Contrastive Learning in Image-to-Image Translation Tasks
by Chanyong Jung et al

03-01-2022

X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
by Zhihao Yuan et al

03-03-2022

Curriculum-style Local-to-global Adaptation for Cross-domain Remote Sensing Image Segmentation
by Bo Zhang et al

03-02-2022

Improving Lidar-Based Semantic Segmentation of Top-View Grid Maps by Learning Features in Complementary Representations
by Frank Bieder et al

03-02-2022

Hybrid Tracker with Pixel and Instance for Video Panoptic Segmentation
by Weicai Ye et al

03-03-2022

Weakly Supervised Object Localization as Domain Adaption
by Lei Zhu et al

03-03-2022

Region-of-Interest Based Neural Video Compression
by Yura Perugachi-Diaz et al

03-02-2022

Asynchronous Optimisation for Event-based Visual Odometry
by Daqi Liu et al

03-01-2022

Boundary Corrected Multi-scale Fusion Network for Real-time Semantic Segmentation
by Tianjiao Jiang et al

03-02-2022

A Generalized Approach for Cancellable Template and Its Realization for Minutia Cylinder-Code
by Xingbo Dong et al

03-01-2022

Robust Seatbelt Detection and Usage Recognition for Driver Monitoring Systems
by Feng Hu

03-02-2022

Detecting Adversarial Perturbations in Multi-Task Perception
by Marvin Klingner et al

03-02-2022

GRASP EARTH: Intuitive Software for Discovering Changes on the Planet
by Waku Hatakeyama et al

03-02-2022

Learning Moving-Object Tracking with FMCW LiDAR
by Yi Gu et al

03-01-2022

ProgressLabeller: Visual Data Stream Annotation for Training Object-Centric 3D Perception
by Xiaotong Chen et al

03-03-2022

SegTAD: Precise Temporal Action Detection via Semantic Segmentation
by Chen Zhao et al

03-01-2022

Full RGB Just Noticeable Difference (JND) Modelling
by Jian Jin et al

03-02-2022

Conditional Reconstruction for Open-set Semantic Segmentation
by Ian Nunes et al

03-01-2022

Exploring Wilderness Using Explainable Machine Learning in Satellite Imagery
by Timo T. Stomberg et al

03-01-2022

Descriptellation: Deep Learned Constellation Descriptors for SLAM
by Chunwei Xing et al

03-02-2022

Self-Supervised Scene Flow Estimation with 4D Automotive Radar
by Fangqiang Ding et al

03-04-2022

Safety-aware metrics for object detectors in autonomous driving
by Andrea Ceccarelli et al

03-01-2022

Beam-Shape Effects and Noise Removal from THz Time-Domain Images in Reflection Geometry in the 0.25-6 THz Range
by Marina Ljubenovic et al

03-03-2022

Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds
by Chaoda Zheng et al

03-03-2022

Computer Vision Aided Blockage Prediction in Real-World Millimeter Wave Deployments
by Gouranga Charan et al

03-01-2022

OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion
by Yuyan Li et al

03-01-2022

GSC Loss: A Gaussian Score Calibrating Loss for Deep Learning
by Qingsong Zhao et al

03-01-2022

Instance-aware multi-object self-supervision for monocular depth prediction
by Houssem eddine Boulahbal et al

03-02-2022

Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification
by Kai Yi et al

03-01-2022

Tempera: Spatial Transformer Feature Pyramid Network for Cardiac MRI Segmentation
by Christoforos Galazis et al

03-01-2022

3DCTN: 3D Convolution-Transformer Network for Point Cloud Classification
by Dening Lu et al

03-04-2022

Do Explanations Explain? Model Knows Best
by Ashkan Khakzar et al

03-02-2022

Self-Supervised Learning for Real-World Super-Resolution from Dual Zoomed Observations
by Zhilu Zhang et al

03-01-2022

Clean-Annotation Backdoor Attack against Lane Detection Systems in the Wild
by Xingshuo Han et al

03-03-2022

Intensity Image-based LiDAR Fiducial Marker System
by Yibo Liu et al

03-02-2022

A Unified Query-based Paradigm for Point Cloud Understanding
by Zetong Yang et al

03-02-2022

Continual BatchNorm Adaptation (CBNA) for Semantic Segmentation
by Marvin Klingner et al

03-03-2022

Constrained unsupervised anomaly segmentation
by Julio Silva-Rodríguez et al

03-02-2022

CycleMix: A Holistic Strategy for Medical Image Segmentation from Scribble Supervision
by Ke Zhang et al

03-03-2022

Robustness and Adaptation to Hidden Factors of Variation
by William Paul et al

03-02-2022

Self-supervised Transformer for Deepfake Detection
by Hanqing Zhao et al

03-02-2022

Image-based material analysis of ancient historical documents
by Thomas Reynolds et al

03-03-2022

Bridging the Source-to-target Gap for Cross-domain Person Re-Identification with Intermediate Domains
by Yongxing Dai et al

03-03-2022

Relative distance matters for one-shot landmark detection
by Qingsong Yao et al

03-02-2022

3D Common Corruptions and Data Augmentation
by Oğuzhan Fatih Kar et al

03-03-2022

Correlation-Aware Deep Tracking
by Fei Xie et al

03-03-2022

WPNAS: Neural Architecture Search by jointly using Weight Sharing and Predictor
by Ke Lin et al

03-01-2022

Adversarial samples for deep monocular 6D object pose estimation
by Jinlai Zhang et al

03-01-2022

Comprehensive Analysis of the Object Detection Pipeline on UAVs
by Leon Amadeus Varga et al

03-02-2022

Unsupervised Anomaly Detection from Time-of-Flight Depth Images
by Pascal Schneider et al

03-01-2022

Dense Voxel Fusion for 3D Object Detection
by Anas Mahmoud et al

03-02-2022

A Split Semantic Detection Algorithm for Psychological Sandplay Image
by Xiaokun Feng et al

03-02-2022

Visual Feature Encoding for GNNs on Road Networks
by Oliver Stromann et al

03-02-2022

Fast and Robust Ground Surface Estimation from LIDAR Measurements using Uniform B-Splines
by Sascha Wirges et al

03-01-2022

Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
by Jing Tan et al

03-01-2022

JOINED : Prior Guided Multi-task Learning for Joint Optic Disc/Cup Segmentation and Fovea Detection
by Huaqing He et al

03-02-2022

Container Localisation and Mass Estimation with an RGB-D Camera
by Tommaso Apicella et al

03-01-2022

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding
by Qiaole Dong et al

03-01-2022

SEA: Bridging the Gap Between One- and Two-stage Detector Distillation via SEmantic-aware Alignment
by Yixin Chen et al

03-01-2022

Hybrid Optimized Deep Convolution Neural Network based Learning Model for Object Detection
by Venkata Beri

03-02-2022

Improving Generalization of Deep Networks for Estimating Physical Properties of Containers and Fillings
by Hengyi Wang et al

03-02-2022

Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence
by Zhihong Pan et al

03-02-2022

A Principled Design of Image Representation: Towards Forensic Tasks
by Shuren Qi et al

03-02-2022

PUFA-GAN: A Frequency-Aware Generative Adversarial Network for 3D Point Cloud Upsampling
by Hao Liu et al

03-03-2022

FairPrune: Achieving Fairness Through Pruning for Dermatological Disease Diagnosis
by Yawen Wu et al

03-03-2022

STUN: Self-Teaching Uncertainty Estimation for Place Recognition
by Kaiwen Cai et al

03-01-2022

FP-Loc: Lightweight and Drift-free Floor Plan-assisted LiDAR Localization
by Ling Gao et al

03-01-2022

Efficient Globally-Optimal Correspondence-Less Visual Odometry for Planar Ground Vehicles
by Ling Gao et al

03-02-2022

NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
by Weihao Yuan et al

03-02-2022

SoftGroup for 3D Instance Segmentation on Point Clouds
by Thang Vu et al

03-02-2022

Translation Invariant Global Estimation of Heading Angle Using Sinogram of LiDAR Point Cloud
by Xiaqing Ding et al

03-04-2022

MF-Hovernet: An Extension of Hovernet for Colon Nuclei Identification and Counting (CoNiC) Challenge
by Vi Thi-Tuong Vo et al

03-03-2022

Debiased Batch Normalization via Gaussian Process for Generalizable Person Re-Identification
by Jiawei Liu et al

03-04-2022

Rethinking Efficient Lane Detection via Curve Modeling
by Zhengyang Feng et al

03-02-2022

Vision-based Large-scale 3D Semantic Mapping for Autonomous Driving Applications
by Qing Cheng et al

03-04-2022

Rethinking Reconstruction Autoencoder-Based Out-of-Distribution Detection
by Yibo Zhou

03-03-2022

Addressing the Shape-Radiance Ambiguity in View-Dependent Radiance Fields
by Sverker Rasmuson et al

03-01-2022

Unified Physical Threat Monitoring System Aided by Virtual Building Simulation
by Zenjie Li et al

03-02-2022

CD-GAN: a robust fusion-based generative adversarial network for unsupervised change detection between heterogeneous images
by Jin-Ju Wang et al

03-04-2022

HDNet: High-resolution Dual-domain Learning for Spectral Compressive Imaging
by Xiaowan Hu et al

03-01-2022

Runtime Detection of Executional Errors in Robot-Assisted Surgery
by Zongyu Li et al

03-03-2022

Self-Supervised Ego-Motion Estimation Based on Multi-Layer Fusion of RGB and Inferred Depth
by Zijie Jiang et al

03-02-2022

iMVS: Improving MVS Networks by Learning Depth Discontinuities
by Nail Ibrahimli et al

03-02-2022

Sketched RT3D: How to reconstruct billions of photons per second
by Julián Tachella et al

03-04-2022

Mobile authentication of copy detection patterns
by Olga Taran et al

03-03-2022

Syntax-Aware Network for Handwritten Mathematical Expression Recognition
by Ye Yuan et al

03-01-2022

Motion-aware Dynamic Graph Neural Network for Video Compressive Sensing
by Ruiying Lu et al

03-04-2022

Transformations in Learned Image Compression from a Communication Perspective
by Youneng Bao et al

03-02-2022

Aggregated Pyramid Vision Transformer: Split-transform-merge Strategy for Image Recognition without Convolutions
by Rui-Yang Ju et al

03-02-2022

Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation
by Zhaozheng Chen et al

03-04-2022

Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression
by A. Burakhan Koyuncu et al

03-03-2022

Audio-Visual Object Classification for Human-Robot Collaboration
by A. Xompero et al

03-03-2022

Occlusion-Aware Cost Constructor for Light Field Depth Estimation
by Yingqian Wang et al

03-02-2022

Improving Point Cloud Based Place Recognition with Ranking-based Loss and Large Batch Training
by Jacek Komorowski

03-03-2022

3D Human Motion Prediction: A Survey
by Kedi Lyu et al

03-04-2022

Pedestrian Stop and Go Forecasting with Hybrid Feature Fusion
by Dongxu Guo et al

03-04-2022

The Familiarity Hypothesis: Explaining the Behavior of Deep Open Set Methods
by Thomas G. Dietterich et al

03-03-2022

ViTransPAD: Video Transformer using convolution and self-attention for Face Presentation Attack Detection
by Zuheng Ming et al

03-03-2022

3D endoscopic depth estimation using 3D surface-aware constraints
by Shang Zhao et al

03-01-2022

Tricks and Plugins to GBM on Images and Sequences
by Biyi Fang et al

03-02-2022

Half Wavelet Attention on M-Net+ for Low-Light Image Enhancement
by Chi-Mao Fan et al

03-03-2022

Towards Universal Backward-Compatible Representation Learning
by Binjie Zhang et al

03-03-2022

Multi-Tailed Vision Transformer for Efficient Inference
by Yunke Wang et al

03-03-2022

Learning Incrementally to Segment Multiple Organs in a CT Image
by Pengbo Liu et al

03-04-2022

Behavioural Curves Analysis Using Near-Infrared-Iris Image Sequences
by L. Causa et al

03-01-2022

Low-Cost On-device Partial Domain Adaptation (LoCO-PDA): Enabling efficient CNN retraining on edge devices
by Aditya Rajagopal et al

03-03-2022

Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations
by Hao Shen et al

03-04-2022

Mixed Reality Depth Contour Occlusion Using Binocular Similarity Matching and Three-dimensional Contour Optimisation
by Naye Ji et al

03-04-2022

Quantum Levenberg--Marquardt Algorithm for optimization in Bundle Adjustment
by Luca Bernecker et al

03-02-2022

Spatial-Temporal Gating-Adjacency GCN for Human Motion Prediction
by Chongyang Zhong et al

03-02-2022

ParaPose: Parameter and Domain Randomization Optimization for Pose Estimation using Synthetic Data
by Frederik Hagelskjaer et al

03-04-2022

Differentiable Control Barrier Functions for Vision-based End-to-End Autonomous Driving
by Wei Xiao et al

03-04-2022

Detecting GAN-generated Images by Orthogonal Training of Multiple CNNs
by Sara Mandelli et al

03-02-2022

DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
by Feng Li et al

03-04-2022

AutoMO-Mixer: An automated multi-objective Mixer model for balanced, safe and robust prediction in medicine
by Xi Chen et al

03-02-2022

3D object reconstruction and 6D-pose estimation from 2D shape for robotic grasping of objects
by Marcell Wolnitza et al

03-03-2022

Polarity Sampling: Quality and Diversity Control of Pre-Trained Generative Networks via Singular Values
by Ahmed Imtiaz Humayun et al

03-03-2022

Counting Molecules: Python based scheme for automated enumeration and categorization of molecules in scanning tunneling microscopy images
by Jack Hellerstedt et al

03-02-2022

Exploring Smoothness and Class-Separation for Semi-supervised Medical Image Segmentation
by Yicheng Wu et al

03-03-2022

HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction
by Yunze Liu et al

03-02-2022

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars
by Le Yang et al

03-02-2022

MUAD: Multiple Uncertainties for Autonomous Driving benchmark for multiple uncertainty types and tasks
by Gianni Franchi et al

03-02-2022

Parameterized Image Quality Score Distribution Prediction
by Yixuan Gao et al

03-02-2022

TransDARC: Transformer-based Driver Activity Recognition with Latent Space Feature Calibration
by Kunyu Peng et al

03-04-2022

ACVNet: Attention Concatenation Volume for Accurate and Efficient Stereo Matching
by Gangwei Xu et al

03-01-2022

Knock, knock. Whos there? -- Identifying football player jersey numbers with synthetic data
by Divya Bhargavi et al

03-01-2022

3D Skeleton-based Human Motion Prediction with Manifold-Aware GAN
by Baptiste Chopin et al

03-02-2022

Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation
by Jiaming Zhang et al

03-04-2022

Voice-Face Homogeneity Tells Deepfake
by Harry Cheng et al

03-04-2022

Time-to-Label: Temporal Consistency for Self-Supervised Monocular 3D Object Detection
by Issa Mouawad et al

03-04-2022

Semi-parametric Makeup Transfer via Semantic-aware Correspondence
by Mingrui Zhu et al

03-04-2022

Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels
by Tao Pu et al

03-04-2022

HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening
by Wele Gedara Chaminda Bandara et al

03-01-2022

Effect of Timing Error: A Case Study of Navigation Camera
by Sandeep S. Kulkarni et al

03-03-2022

Universal Segmentation of 33 Anatomies
by Pengbo Liu et al

03-04-2022

Simultaneous Alignment and Surface Regression Using Hybrid 2D-3D Networks for 3D Coherent Layer Segmentation of Retina OCT Images
by Hong Liu et al

03-04-2022

DetFlowTrack: 3D Multi-object Tracking based on Simultaneous Optimization of Object Detection and Scene Flow Estimation
by Yueling Shen et al

03-04-2022

PatchMVSNet: Patch-wise Unsupervised Multi-View Stereo for Weakly-Textured Surface Reconstruction
by Haonan Dong et al

03-04-2022

OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation
by Peng Li et al

03-03-2022

Towards Rich, Portable, and Large-Scale Pedestrian Data Collection
by Allan Wang et al

03-04-2022

Didnt see that coming: a survey on non-verbal social human behavior forecasting
by German Barquero et al

03-03-2022

A Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation
by Hamidreza Fazlali et al

03-04-2022

Partial Wasserstein Adversarial Network for Non-rigid Point Set Registration
by Zi-Ming Wang et al

03-03-2022

Robust Segmentation of Brain MRI in the Wild with Hierarchical CNNs and no Retraining
by Benjamin Billot et al

03-04-2022

Real-Time Hybrid Mapping of Populated Indoor Scenes using a Low-Cost Monocular UAV
by Stuart Golodetz et al

03-02-2022

What Makes Transfer Learning Work For Medical Images: Feature Reuse & Other Factors
by Christos Matsoukas et al

03-03-2022

FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context
by Pinaki Nath Chowdhury et al

03-03-2022

Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
by Yi-Nan Chen et al

03-03-2022

Color Space-based HoVer-Net for Nuclei Instance Segmentation and Classification
by Hussam Azzuni et al

03-04-2022

Computer-Aided Road Inspection: Systems and Algorithms
by Rui Fan et al

03-04-2022

Feature Transformation for Cross-domain Few-shot Remote Sensing Scene Classification
by Qiaoling Chen et al

03-02-2022

Quality or Quantity: Toward a Unified Approach for Multi-organ Segmentation in Body CT
by Fakrul Islam Tushar et al

03-01-2022

There is a Time and Place for Reasoning Beyond the Image
by Xingyu Fu et al

03-03-2022

Semantic-guided Image Virtual Attribute Learning for Noisy Multi-label Chest X-ray Classification
by Yuanhong Chen et al

03-04-2022

Patch Similarity Aware Data-Free Quantization for Vision Transformers
by Zhikai Li et al

03-03-2022

Scribble-Supervised Medical Image Segmentation via Dual-Branch Network and Dynamically Mixed Pseudo Labels Supervision
by Xiangde Luo et al

03-02-2022

Object Pose Estimation using Mid-level Visual Representations
by Negar Nejatishahidin et al

03-04-2022

F2DNet: Fast Focal Detection Network for Pedestrian Detection
by Abdul Hannan Khan et al

03-04-2022

Class-Aware Contrastive Semi-Supervised Learning
by Fan Yang et al

03-04-2022

Characterizing Renal Structures with 3D Block Aggregate Transformers
by Xin Yu et al

03-03-2022

Fast Neural Architecture Search for Lightweight Dense Prediction Networks
by Lam Huynh et al

03-03-2022

Towards Benchmarking and Evaluating Deepfake Detection
by Chenhao Lin et al

03-03-2022

MixCL: Pixel label matters to contrastive learning
by Jun Li et al

03-04-2022

SFPN: Synthetic FPN for Object Detection
by Yu-Ming Zhang et al

03-04-2022

ViT-P: Rethinking Data-efficient Vision Transformers from Locality
by Bin Chen et al

03-04-2022

Convolutional Analysis Operator Learning by End-To-End Training of Iterative Neural Networks
by Andreas Kofler et al

03-03-2022

A Comprehensive Review of Computer Vision in Sports: Open Issues, Future Trends and Research Directions
by Banoth Thulasya Naik et al

03-02-2022

Nuclei segmentation and classification in histopathology images with StarDist for the CoNIC Challenge 2022
by Martin Weigert et al

03-03-2022

A multi-stream convolutional neural network for classification of progressive MCI in Alzheimers disease using structural MRI images
by Mona Ashtari-Majlan et al

03-02-2022

Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations
by Aishik Konwer et al

03-02-2022

Contextual Attention Network: Transformer Meets U-Net
by Azad Reza et al

03-03-2022

Anomaly Detection-Inspired Few-Shot Medical Image Segmentation Through Self-Supervision With Supervoxels
by Stine Hansen et al

03-02-2022

E-CIR: Event-Enhanced Continuous Intensity Recovery
by Chen Song et al

03-03-2022

Sim2Real Instance-Level Style Transfer for 6D Pose Estimation
by Takuya Ikeda et al

03-01-2022

Deep Temporal Interpolation of Radar-based Precipitation
by Michiaki Tatsubori et al

 
Craig Smith