2022.1.10 Vision papers

 

01-07-2022

NeROIC: Neural Rendering of Objects from Online Image Collections
by Zhengfei Kuang et al

01-05-2022

Robust Self-Supervised Audio-Visual Speech Recognition
by Bowen Shi et al

01-07-2022

Generalized Category Discovery
by Sagar Vaze et al

01-04-2022

Sound and Visual Representation Learning with Multiple Pretraining Tasks
by Arun Balajee Vasudevan et al

01-06-2022

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision
by Maximilian Harl et al

01-06-2022

De-rendering 3D Objects in the Wild
by Felix Wimbauer et al

01-05-2022

All You Need In Sign Language Production
by Razieh Rastgoo et al

01-06-2022

Consistent Style Transfer
by Xuan Luo et al

01-06-2022

Self-Training Vision Language BERTs with a Unified Conditional Model
by Xiaofeng Yang et al

01-07-2022

Detecting Twenty-thousand Classes using Image-level Supervision
by Xingyi Zhou et al

01-04-2022

Corrupting Data to Remove Deceptive Perturbation: Using Preprocessing Method to Improve System Robustness
by Hieu Le et al

01-06-2022

Compact Bidirectional Transformer for Image Captioning
by Yuanen Zhou et al

01-05-2022

Incremental Object Grounding Using Scene Graphs
by John Seon Keun Yi et al

01-06-2022

Bio-inspired Min-Nets Improve the Performance and Robustness of Deep Networks
by Philipp Grüning et al

01-06-2022

ASL-Skeleton3D and ASL-Phono: Two Novel Datasets for the American Sign Language
by Cleison Correia de Amorim et al

01-05-2022

DeepMLS: Geometry-Aware Control Point Deformation
by Meitar Shechter et al

01-05-2022

Contrastive Neighborhood Alignment
by Pengkai Zhu et al

01-05-2022

Quantum Capsule Networks
by Zidu Liu et al

01-05-2022

Debiased Learning from Naturally Imbalanced Pseudo-Labels for Zero-Shot and Semi-Supervised Learning
by Xudong Wang et al

01-05-2022

Formal Analysis of Art: Proxy Learning of Visual Concepts from Style Through Language Models
by Diana Kim et al

01-06-2022

Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling
by Yang Long et al

01-05-2022

Lawin Transformer: Improving Semantic Segmentation Transformer with Multi-Scale Representations via Large Window Attention
by Haotian Yan et al

01-04-2022

Deep Learning-Based Sparse Whole-Slide Image Analysis for the Diagnosis of Gastric Intestinal Metaplasia
by Jon Braatz et al

01-06-2022

Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization
by Hao Jiang et al

01-05-2022

Tackling the Class Imbalance Problem of Deep Learning Based Head and Neck Organ Segmentation
by Elias Tappeiner et al

01-05-2022

Towards realistic symmetry-based completion of previously unseen point clouds
by Taras Rumezhak et al

01-04-2022

Self-supervised Learning from 100 Million Medical Images
by Florin C. Ghesu et al

01-06-2022

An Abstraction-Refinement Approach to Verifying Convolutional Neural Networks
by Matan Ostrovsky et al

01-06-2022

HyperionSolarNet: Solar Panel Detection from Aerial Images
by Poonam Parhar et al

01-04-2022

Variational Stacked Local Attention Networks for Diverse Video Captioning
by Tonmoay Deb et al

01-04-2022

Weakly-supervised continual learning for class-incremental segmentation
by Gaston Lenczner et al

01-05-2022

Challenges of Artificial Intelligence -- From Machine Learning and Computer Vision to Emotional Intelligence
by Matti Pietikäinen et al

01-05-2022

Exemplar-free Class Incremental Learning via Discriminative and Comparable One-class Classifiers
by Wenju Sun et al

01-07-2022

Video Summarization Based on Video-text Representation
by Li Haopeng et al

01-05-2022

Cross-SRN: Structure-Preserving Super-Resolution Network with Cross Convolution
by Yuqing Liu et al

01-05-2022

Revisiting Deep Subspace Alignment for Unsupervised Domain Adaptation
by Kowshik Thopalli et al

01-07-2022

Bayesian Neural Networks for Reversible Steganography
by Ching-Chun Chang

01-05-2022

POCO: Point Convolution for Surface Reconstruction
by Alexandre Boulch et al

01-07-2022

Negative Evidence Matters in Interpretable Histology Image Classification
by Soufiane Belharbi et al

01-06-2022

Deep Learning Based Classification System For Recognizing Local Spinach
by Mirajul Islam et al

01-06-2022

Multi-Label Classification on Remote-Sensing Images
by Aditya Kumar Singh et al

01-06-2022

Decompose to Adapt: Cross-domain Object Detection via Feature Disentanglement
by Dongnan Liu et al

01-04-2022

The cluster structure function
by Andrew R. Cohen et al

01-04-2022

Aligning Domain-specific Distribution and Classifier for Cross-domain Classification from Multiple Sources
by Yongchun Zhu et al

01-04-2022

Attention Mechanism Meets with Hybrid Dense Network for Hyperspectral Image Classification
by Muhammad Ahmad et al

01-04-2022

Self-Supervised Approach to Addressing Zero-Shot Learning Problem
by Ademola Okerinde et al

01-04-2022

MoCoPnet: Exploring Local Motion and Contrast Priors for Infrared Small Target Super-Resolution
by Xinyi Ying et al

01-06-2022

Repurposing Existing Deep Networks for Caption and Aesthetic-Guided Image Cropping
by Nora Horanyi et al

01-05-2022

Multiple Sclerosis Lesions Segmentation using Attention-Based CNNs in FLAIR Images
by Mehdi SadeghiBakhi et al

01-04-2022

Eye Know You Too: A DenseNet Architecture for End-to-end Biometric Authentication via Eye Movements
by Dillon Lohr et al

01-05-2022

Towards Uniform Point Distribution in Feature-preserving Point Cloud Filtering
by Shuaijun Chen et al

01-06-2022

Enhancing Egocentric 3D Pose Estimation with Third Person Views
by Ameya Dhamanaskar et al

01-06-2022

Multi-Domain Joint Training for Person Re-Identification
by Lu Yang et al

01-07-2022

Sign Language Video Retrieval with Free-Form Textual Queries
by Amanda Duarte et al

01-05-2022

Learning True Rate-Distortion-Optimization for End-To-End Image Compression
by Fabian Brand et al

01-05-2022

Sign Language Recognition System using TensorFlow Object Detection API
by Sharvani Srivastava et al

01-04-2022

Advancing 3D Medical Image Analysis with Variable Dimension Transform based Supervised 3D Pre-training
by Shu Zhang et al

01-05-2022

Lumbar Bone Mineral Density Estimation from Chest X-ray Images: Anatomy-aware Attentive Multi-ROI Modeling
by Fakai Wang et al

01-04-2022

Multi-Representation Adaptation Network for Cross-domain Image Classification
by Yongchun Zhu et al

01-05-2022

The Effect of Model Compression on Fairness in Facial Expression Recognition
by Samuil Stoychev et al

01-05-2022

On the Real-World Adversarial Robustness of Real-Time Semantic Segmentation Models for Autonomous Driving
by Giulio Rossolini et al

01-04-2022

DIAL: Deep Interactive and Active Learning for Semantic Segmentation in Remote Sensing
by Gaston Lenczner et al

01-06-2022

Budget-aware Few-shot Learning via Graph Convolutional Network
by Shipeng Yan et al

01-05-2022

Deep Probabilistic Graph Matching
by He Liu et al

01-05-2022

Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation and Focal Loss
by Rui Peng et al

01-04-2022

Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images
by Ali Hatamizadeh et al

01-04-2022

Towards Understanding and Harnessing the Effect of Image Transformation in Adversarial Detection
by Hui Liu et al

01-04-2022

Identifying the exterior image of buildings on a 3D map and extracting elevation information using deep learning and digital image processing
by Donghwa Shon et al

01-05-2022

Memory-guided Image De-raining Using Time-Lapse Data
by Jaehoon Cho et al

01-04-2022

Problem-dependent attention and effort in neural networks with an application to image resolution
by Chris Rohlfs

01-04-2022

Latent Vector Expansion using Autoencoder for Anomaly Detection
by UJu Gim et al

01-04-2022

Learning to Generate Novel Classes for Deep Metric Learning
by Kyungmoon Lee et al

01-07-2022

A Review of Deep Learning Techniques for Markerless Human Motion on Synthetic Datasets
by Doan Duy Vo et al

01-04-2022

Towards Unsupervised Open World Semantic Segmentation
by Svenja Uhlemeyer et al

01-05-2022

Automated Scoring of Graphical Open-Ended Responses Using Artificial Neural Networks
by Matthias von Davier et al

01-05-2022

Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning
by Xingqun Qi et al

01-04-2022

Linear Variational State Space Filtering
by Daniel Pfrommer et al

01-06-2022

A three-dimensional dual-domain deep network for high-pitch and sparse helical CT reconstruction
by Wei Wang et al

01-06-2022

An unambiguous cloudiness index for nonwovens
by Michael Godehardt et al

01-05-2022

An Investigation of Benfords Law Divergence and Machine Learning Techniques for Intra-Class Separability of Fingerprint Images
by Aamo Iorliam et al

01-06-2022

TransVPR: Transformer-based place recognition with multi-level attention aggregation
by Ruotong Wang et al

01-06-2022

SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object Detection
by Chen Chen et al

01-05-2022

Culture-to-Culture Image Translation with Generative Adversarial Networks
by Giulia Zaino et al

01-07-2022

Deep Domain Adversarial Adaptation for Photon-efficient Imaging Based on Spatiotemporal Inception Network
by Yiwei Chen et al

01-05-2022

Improving Object Detection, Multi-object Tracking, and Re-Identification for Disaster Response Drones
by Chongkeun Paik et al

01-07-2022

Cross-Modality Deep Feature Learning for Brain Tumor Segmentation
by Dingwen Zhang et al

01-05-2022

Multi-Robot Collaborative Perception with Graph Neural Networks
by Yang Zhou et al

01-04-2022

Detailed Facial Geometry Recovery from Multi-view Images by Learning an Implicit Function
by Yunze Xiao et al

01-05-2022

FAVER: Blind Quality Prediction of Variable Frame Rate Videos
by Qi Zheng et al

01-05-2022

Evaluation of Thermal Imaging on Embedded GPU Platforms for Application in Vehicular Assistance Systems
by Muhammad Ali Farooq et al

01-04-2022

Neural KEM: A Kernel Method with Deep Coefficient Prior for PET Image Reconstruction
by Siqi Li et al

01-05-2022

Robust photon-efficient imaging using a pixel-wise residual shrinkage network
by Gongxin Yao et al

01-04-2022

A Robust Visual Sampling Model Inspired by Receptive Field
by Liwen Hu et al

01-04-2022

What Hinders Perceptual Quality of PSNR-oriented Methods?
by Tianshuo Xu et al

01-05-2022

Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
by Bowen Shi et al

01-04-2022

3DVSR: 3D EPI Volume-based Approach for Angular and Spatial Light field Image Super-resolution
by Trung-Hieu Tran et al

01-04-2022

A Transformer-Based Siamese Network for Change Detection
by Wele Gedara Chaminda Bandara et al

01-07-2022

Effect of Prior-based Losses on Segmentation Performance: A Benchmark
by Rosana {EL JURDI} et al

01-04-2022

Fusing Convolutional Neural Network and Geometric Constraint for Image-based Indoor Localization
by Jingwei Song et al

01-04-2022

Learning Quality-aware Representation for Multi-person Pose Regression
by Yabo Xiao et al

01-05-2022

Flow-Guided Sparse Transformer for Video Deblurring
by Jing Lin et al

01-04-2022

Image Processing Methods for Coronal Hole Segmentation, Matching, and Map Classification
by V. Jatla et al

01-04-2022

Synthesizing Tensor Transformations for Visual Self-attention
by Xian Wei et al

01-04-2022

Attention-based Dual Supervised Decoder for RGBD Semantic Segmentation
by Yang Zhang et al

01-06-2022

Balancing Generalization and Specialization in Zero-shot Learning
by Yun Li et al

01-04-2022

Transfer Learning for Retinal Vascular Disease Detection: A Pilot Study with Diabetic Retinopathy and Retinopathy of Prematurity
by Guan Wang et al

01-06-2022

Extending One-Stage Detection with Open-World Proposals
by Sachin Konan et al

01-06-2022

RestoreDet: Degradation Equivariant Representation for Object Detection in Low Resolution Images
by Ziteng Cui et al

01-05-2022

Probing TryOnGAN
by Saurabh Kumar et al

01-05-2022

Surface-Aligned Neural Radiance Fields for Controllable 3D Human Synthesis
by Tianhan Xu et al

01-04-2022

Data Augmentation for Depression Detection Using Skeleton-Based Gait Information
by Jingjing Yang et al

01-06-2022

EM-driven unsupervised learning for efficient motion segmentation
by Etienne Meunier et al

01-06-2022

Realistic Full-Body Anonymization with Surface-Guided GANs
by Håkon Hukkelås et al

01-04-2022

Towards Transferable Unrestricted Adversarial Examples with Minimum Changes
by Fangcheng Liu et al

01-04-2022

DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression
by Yi Ma et al

01-07-2022

Embodied Hands: Modeling and Capturing Hands and Bodies Together
by Javier Romero et al

01-06-2022

A Unified Framework for Attention-Based Few-Shot Object Detection
by Pierre Le Jeune et al

01-05-2022

TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets
by Susie Xi Rao et al

01-06-2022

Persistent Homology for Breast Tumor Classification using Mammogram Scans
by Aras Asaad et al

01-07-2022

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items
by Taimur Hassan et al

01-05-2022

GLAN: A Graph-based Linear Assignment Network
by He Liu et al

01-07-2022

Deep Generative Framework for Interactive 3D Terrain Authoring and Manipulation
by Shanthika Naik et al

01-07-2022

Auto-Weighted Layer Representation Based View Synthesis Distortion Estimation for 3-D Video Coding
by Jian Jin et al

01-07-2022

Motion Prediction via Joint Dependency Modeling in Phase Space
by Pengxiang Su et al

01-04-2022

Short Range Correlation Transformer for Occluded Person Re-Identification
by Yunbin Zhao et al

01-07-2022

Multiresolution Fully Convolutional Networks to detect Clouds and Snow through Optical Satellite Images
by Debvrat Varshney et al

01-07-2022

An Incremental Learning Approach to Automatically Recognize Pulmonary Diseases from the Multi-vendor Chest Radiographs
by Mehreen Sirshar et al

01-04-2022

Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion Estimation
by Qiankun Liu et al

01-07-2022

Leveraging Scale-Invariance and Uncertainity with Self-Supervised Domain Adaptation for Semantic Segmentation of Foggy Scenes
by Javed Iqbal et al

01-07-2022

Learning Target-aware Representation for Visual Tracking via Informative Interactions
by Mingzhe Guo et al

01-05-2022

Multi-Grid Redundant Bounding Box Annotation for Accurate Object Detection
by Solomon Negussie Tesema et al

01-07-2022

Detecting Human-to-Human-or-Object (H2O) Interactions with DIABOLO
by Astrid Orcesi et al

01-04-2022

DenseTact: Optical Tactile Sensor for Dense Shape Reconstruction
by Won Kyung Do et al

01-05-2022

Learning Semantic Ambiguities for Zero-Shot Learning
by Celina Hanouti et al

01-06-2022

CitySurfaces: City-Scale Semantic Segmentation of Sidewalk Materials
by Maryam Hosseini et al

01-07-2022

Uncertainty-Aware Cascaded Dilation Filtering for High-Efficiency Deraining
by Qing Guo et al

01-07-2022

Amplitude SAR Imagery Splicing Localization
by Edoardo Daniele Cannas et al

01-06-2022

A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration
by Aline Sindel et al

01-07-2022

Equalized Focal Loss for Dense Long-Tailed Object Detection
by Bo Li et al

01-06-2022

ITSA: An Information-Theoretic Approach to Automatic Shortcut Avoidance and Domain Generalization in Stereo Matching Networks
by WeiQin Chuah et al

01-05-2022

3D Intracranial Aneurysm Classification and Segmentation via Unsupervised Dual-branch Learning
by Di Shao et al

 
Craig Smith