2022.4.18 Vision papers

 

04-12-2022

GARF: Gaussian Activated Radiance Fields for High Fidelity Reconstruction and Pose Estimation
by Shin-Fang Chng et al

04-14-2022

DeiT III: Revenge of the ViT
by Hugo Touvron et al

04-14-2022

Neighborhood Attention Transformer
by Ali Hassani et al

04-14-2022

Masked Siamese Networks for Label-Efficient Learning
by Mahmoud Assran et al

04-14-2022

Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin-picking
by Kai Chen et al

04-13-2022

COAP: Compositional Articulated Occupancy of People
by Marko Mihajlovic et al

04-14-2022

Any-resolution Training for High-resolution Image Synthesis
by Lucy Chai et al

04-13-2022

DL4SciVis: A State-of-the-Art Survey on Deep Learning for Scientific Visualization
by Chaoli Wang et al

04-14-2022

Whats in your hands? 3D Reconstruction of Generic Objects in Hands
by Yufei Ye et al

04-13-2022

Geometric Understanding of Sketches
by Raghav Brahmadesam Venkataramaiyer

04-12-2022

Machine Learning Security against Data Poisoning: Are We There Yet?
by Antonio Emanuele Cinà et al

04-14-2022

MiniViT: Compressing Vision Transformers with Weight Multiplexing
by Jinnian Zhang et al

04-14-2022

GIFS: Neural Implicit Function for General Shape Representation
by Jianglong Ye et al

04-12-2022

ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension
by Sanjay Subramanian et al

04-13-2022

Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis
by Xuanmeng Zhang et al

04-14-2022

Ensuring accurate stain reproduction in deep generative networks for virtual immunohistochemistry
by Christopher D. Walsh et al

04-13-2022

Wassmap: Wasserstein Isometric Mapping for Image Manifold Learning
by Keaton Hamm et al

04-14-2022

BEHAVE: Dataset and Method for Tracking Human Object Interactions
by Bharat Lal Bhatnagar et al

04-13-2022

Controllable Video Generation through Global and Local Motion Dynamics
by Aram Davtyan et al

04-12-2022

Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity
by Weiyao Wang et al

04-13-2022

Towards Metrical Reconstruction of Human Faces
by Wojciech Zielonka et al

04-14-2022

A Level Set Theory for Neural Implicit Evolution under Explicit Flows
by Ishit Mehta et al

04-13-2022

Deep Learning-based Framework for Automatic Cranial Defect Reconstruction and Implant Modeling
by Marek Wodzinski et al

04-12-2022

VisCUIT: Visual Auditor for Bias in CNN Image Classifier
by Seongmin Lee et al

04-14-2022

Deformable Sprites for Unsupervised Video Decomposition
by Vickie Ye et al

04-14-2022

Geometric Deep Learning to Identify the Critical 3D Structural Features of the Optic Nerve Head for Glaucoma Diagnosis
by Fabian A. Braeu et al

04-13-2022

What Matters in Language Conditioned Robotic Imitation Learning
by Oier Mees et al

04-13-2022

Reuse your features: unifying retrieval and feature-metric alignment
by Javier Morlana et al

04-12-2022

Generative Negative Replay for Continual Learning
by Gabriele Graffieti et al

04-14-2022

Interpretability of Machine Learning Methods Applied to Neuroimaging
by Elina Thibeau-Sutre et al

04-13-2022

Deep Learning Model with GA based Feature Selection and Context Integration
by Ranju Mandal et al

04-14-2022

From Environmental Sound Representation to Robustness of 2D CNN Models Against Adversarial Attacks
by Mohammad Esmaeilpour et al

04-15-2022

MVSTER: Epipolar Transformer for Efficient Multi-View Stereo
by Xiaofeng Wang et al

04-13-2022

TIB-VA at SemEval-2022 Task 5: A Multimodal Architecture for the Detection and Classification of Misogynous Memes
by Sherzod Hakimov et al

04-14-2022

HyDe: The First Open-Source, Python-Based, GPU-Accelerated Hyperspectral Denoising Package
by Daniel Coquelin et al

04-14-2022

Modeling Indirect Illumination for Inverse Rendering
by Yuanqing Zhang et al

04-12-2022

X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks
by Zhaowei Cai et al

04-12-2022

Back to the Roots: Reconstructing Large and Complex Cranial Defects using an Image-based Statistical Shape Model
by Jianning Li et al

04-13-2022

Active Diffusion and VCA-Assisted Image Segmentation of Hyperspectral Images
by Sam L. Polk et al

04-14-2022

Medical Application of Geometric Deep Learning for the Diagnosis of Glaucoma
by Alexandre H. Thiery et al

04-14-2022

Guided Co-Modulated GAN for 360{\deg} Field of View Extrapolation
by Mohammad Reza Karimi Dastjerdi et al

04-14-2022

Unsupervised Deep Learning Meets Chan-Vese Model
by Dihan Zheng et al

04-12-2022

Examining the Proximity of Adversarial Examples to Class Manifolds in Deep Networks
by Štefan Pócoš et al

04-13-2022

Dynamic Neural Textures: Generating Talking-Face Videos with Continuously Controllable Expressions
by Zipeng Ye et al

04-12-2022

Multi-View Breast Cancer Classification via Hypercomplex Neural Networks
by Eleonora Lopez et al

04-12-2022

LifeLonger: A Benchmark for Continual Disease Classification
by Mohammad Mahdi Derakhshani et al

04-12-2022

GORDA: Graph-based ORientation Distribution Analysis of SLI scatterometry Patterns of Nerve Fibres
by Esteban Vaca et al

04-12-2022

Continual Predictive Learning from Videos
by Geng Chen et al

04-12-2022

RL-CoSeg : A Novel Image Co-Segmentation Algorithm with Deep Reinforcement Learning
by Xin Duan et al

04-14-2022

The multi-modal universe of fast-fashion: the Visuelle 2.0 benchmark
by Geri Skenderi et al

04-12-2022

Unsupervised Anomaly and Change Detection with Multivariate Gaussianization
by José A. Padrón-Hidalgo et al

04-13-2022

Estimating Structural Disparities for Face Models
by Shervin Ardeshir et al

04-12-2022

Automatic detection of glaucoma via fundus imaging and artificial intelligence: A review
by Lauren Coan et al

04-14-2022

Sketch guided and progressive growing GAN for realistic and editable ultrasound image synthesis
by Jiamin Liang et al

04-14-2022

LEFM-Nets: Learnable Explicit Feature Map Deep Networks for Segmentation of Histopathological Images of Frozen Sections
by Dario Sitnik et al

04-12-2022

Adaptive Cross-Attention-Driven Spatial-Spectral Graph Convolutional Network for Hyperspectral Image Classification
by Jin-Yu Yang et al

04-13-2022

Context-based Deep Learning Architecture with Optimal Integration Layer for Image Parsing
by Ranju Mandal et al

04-12-2022

Towards Open-Set Object Detection and Discovery
by Jiyang Zheng et al

04-15-2022

Deep CardioSound: An Ensembled Deep Learning Model for Heart Sound MultiLabelling
by Li Guo et al

04-14-2022

Learning Spatially Varying Pixel Exposures for Motion Deblurring
by Cindy M. Nguyen et al

04-13-2022

Learning Convolutional Neural Networks in the Frequency Domain
by Hengyue Pan et al

04-12-2022

Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search
by Minbin Huang et al

04-13-2022

DMCNet: Diversified Model Combination Network for Understanding Engagement from Video Screengrabs
by Sarthak Batra et al

04-14-2022

Q-TART: Quickly Training for Adversarial Robustness and in-Transferability
by Madan Ravi Ganesh et al

04-12-2022

Compact Model Training by Low-Rank Projection with Energy Transfer
by Kailing Guo et al

04-14-2022

Atmospheric Turbulence Removal with Complex-Valued Convolutional Neural Network
by Nantheera Anantrasirichai

04-14-2022

Cross-Image Relational Knowledge Distillation for Semantic Segmentation
by Chuanguang Yang et al

04-12-2022

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
by Wenqiang Zhang et al

04-13-2022

ViViD++: Vision for Visibility Dataset
by Alex Junho Lee et al

04-13-2022

Defensive Patches for Robust Recognition in the Physical World
by Jiakai Wang et al

04-12-2022

Video Captioning: a comparative review of where we are and which could be the route
by Daniela Moctezuma et al

04-12-2022

Probabilistic Compositional Embeddings for Multimodal Image Retrieval
by Andrei Neculai et al

04-13-2022

Receding Neuron Importances for Structured Pruning
by Mihai Suteu et al

04-13-2022

HASA: Hybrid Architecture Search with Aggregation Strategy for Echinococcosis Classification and Ovary Segmentation in Ultrasound Images
by Jikuan Qian et al

04-14-2022

Detection of Degraded Acacia tree species using deep neural networks on uav drone imagery
by Anne Achieng Osio et al

04-12-2022

NightLab: A Dual-level Architecture with Hardness Detection for Segmentation at Night
by Xueqing Deng et al

04-13-2022

WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic Segmentation for Lung Adenocarcinoma
by Chu Han et al

04-15-2022

Vision-and-Language Pretrained Models: A Survey
by Siqu Long et al

04-12-2022

SwinNet: Swin Transformer drives edge-aware RGB-D and RGB-T salient object detection
by Zhengyi Liu et al

04-13-2022

Deep learning based automatic detection of offshore oil slicks using SAR data and contextual information
by Emna Amri et al

04-12-2022

Hierarchical Text-Conditional Image Generation with CLIP Latents
by Aditya Ramesh et al

04-12-2022

Regression or Classification? Reflection on BP prediction from PPG data using Deep Neural Networks in the scope of practical applications
by Fabian Schrumpf et al

04-12-2022

On the Equity of Nuclear Norm Maximization in Unsupervised Domain Adaptation
by Wenju Zhang et al

04-12-2022

HyperDet3D: Learning a Scene-conditioned 3D Object Detector
by Yu Zheng et al

04-14-2022

High-performance Evolutionary Algorithms for Online Neuron Control
by Binxu Wang et al

04-12-2022

Towards Reliable Image Outpainting: Learning Structure-Aware Multimodal Fusion with Depth Guidance
by Lei Zhang et al

04-12-2022

Undoing the Damage of Label Shift for Cross-domain Semantic Segmentation
by Yahao Liu et al

04-13-2022

5G Features and Standards for Vehicle Data Exploitation
by Gorka Velez et al

04-14-2022

Semi-Supervised Training to Improve Player and Ball Detection in Soccer
by Renaud Vandeghen et al

04-12-2022

Open-set Text Recognition via Character-Context Decoupling
by Chang Liu et al

04-14-2022

Activation Regression for Continuous Domain Generalization with Applications to Crop Classification
by Samar Khanna et al

04-12-2022

Exploring Event Camera-based Odometry for Planetary Robots
by Florian Mahlknecht et al

04-12-2022

Content and Style Aware Generation of Text-line Images for Handwriting Recognition
by Lei Kang et al

04-12-2022

Neural Texture Extraction and Distribution for Controllable Person Image Synthesis
by Yurui Ren et al

04-13-2022

Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation
by Xiyu Wang et al

04-12-2022

DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection
by Haibao Yu et al

04-12-2022

Malceiver: Perceiver with Hierarchical and Multi-modal Features for Android Malware Detection
by Niall McLaughlin

04-14-2022

YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss
by Debapriya Maji et al

04-14-2022

Clothes-Changing Person Re-identification with RGB Modality Only
by Xinqian Gu et al

04-14-2022

SemiMultiPose: A Semi-supervised Multi-animal Pose Estimation Framework
by Ari Blau et al

04-13-2022

3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection
by Junyu Luo et al

04-12-2022

Super-Resolution for Selfie Biometrics: Introduction and Application to Face and Iris
by Fernando Alonso-Fernandez et al

04-12-2022

3DeformRS: Certifying Spatial Deformations on Point Clouds
by Gabriel Pérez S. et al

04-15-2022

COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval
by Haoyu Lu et al

04-14-2022

Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling
by Takashi Isobe et al

04-13-2022

Rapid model transfer for medical image segmentation via iterative human-in-the-loop update: from labelled public to unlabelled clinical datasets for multi-organ segmentation in CT
by Wenao Ma et al

04-13-2022

Transparent Shape from Single Polarization Images
by Shao Mingqi et al

04-14-2022

Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning
by Feilong Chen et al

04-14-2022

Explainable Analysis of Deep Learning Methods for SAR Image Classification
by Shenghan Su et al

04-13-2022

Recognition of Freely Selected Keypoints on Human Limbs
by Katja Ludwig et al

04-12-2022

EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data
by Anastasiia Kornilova et al

04-14-2022

SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos
by Anthony Cioppa et al

04-13-2022

MINSU (Mobile Inventory And Scanning Unit):Computer Vision and AI
by Jihoon Ryoo et al

04-14-2022

Implicit Sample Extension for Unsupervised Person Re-Identification
by Xinyu Zhang et al

04-12-2022

Unidirectional Video Denoising by Mimicking Backward Recurrent Modules with Look-ahead Forward Ones
by Junyi Li et al

04-12-2022

Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval
by Yu-Wei Zhan et al

04-12-2022

DistPro: Searching A Fast Knowledge Distillation Process via Meta Optimization
by Xueqing Deng et al

04-14-2022

Invisible-to-Visible: Privacy-Aware Human Instance Segmentation using Airborne Ultrasound via Collaborative Learning Variational Autoencoder
by Risako Tanigawa et al

04-12-2022

Semantic keypoint-based pose estimation from single RGB frames
by Karl Schmeckpeper et al

04-13-2022

Mitigating Bias in Facial Analysis Systems by Incorporating Label Diversity
by Camila Kolling et al

04-13-2022

A deep learning algorithm for reducing false positives in screening mammography
by Stefano Pedemonte et al

04-15-2022

Synthesizing Informative Training Samples with GAN
by Bo Zhao et al

04-12-2022

DCMS: Motion Forecasting with Dual Consistency and Multi-Pseudo-Target Supervision
by Maosheng Ye et al

04-14-2022

Human Identity-Preserved Motion Retargeting in Video Synthesis by Feature Disentanglement
by Jingzhe Ma et al

04-12-2022

SRMD: Sparse Random Mode Decomposition
by Nicholas Richardson et al

04-14-2022

OmniPD: One-Step Person Detection in Top-View Omnidirectional Indoor Scenes
by Jingrui Yu et al

04-12-2022

Unsupervised Anomaly Detection in 3D Brain MRI using Deep Learning with impured training data
by Finn Behrendt et al

04-15-2022

SSR-HEF: Crowd Counting with Multi-Scale Semantic Refining and Hard Example Focusing
by Jiwei Chen et al

04-15-2022

Towards PAC Multi-Object Detection and Tracking
by Shuo Li et al

04-14-2022

Autonomous Satellite Detection and Tracking using Optical Flow
by David Zuehlke et al

04-12-2022

Localization Distillation for Object Detection
by Zhaohui Zheng et al

04-15-2022

Crowd counting with segmentation attention convolutional neural network
by Jiwei Chen et al

04-13-2022

Out-of-distribution Detection with Deep Nearest Neighbors
by Yiyou Sun et al

04-14-2022

CroCo: Cross-Modal Contrastive learning for localization of Earth Observation data
by Wei-Hsin Tseng et al

04-15-2022

Crowd counting with crowd attention convolutional neural network
by Jiwei Chen et al

04-14-2022

Joint Forecasting of Panoptic Segmentations with Difference Attention
by Colin Graber et al

04-14-2022

Spatial Likelihood Voting with Self-Knowledge Distillation for Weakly Supervised Object Detection
by Ze Chen et al

04-14-2022

ViTOL: Vision Transformer for Weakly Supervised Object Localization
by Saurav Gupta et al

04-14-2022

End-to-end Learning for Joint Depth and Image Reconstruction from Diffracted Rotation
by Mazen Mel et al

04-14-2022

Weakly Supervised Attended Object Detection Using Gaze Data as Annotations
by Michele Mazzamuto et al

04-14-2022

Pyramidal Attention for Saliency Detection
by Tanveer Hussain et al

04-14-2022

Residual Swin Transformer Channel Attention Network for Image Demosaicing
by Wenzhu Xing et al

04-13-2022

Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
by Zhixi Cai et al

04-14-2022

Egocentric Human-Object Interaction Detection Exploiting Synthetic Data
by Rosario Leonardi et al

04-15-2022

INSTA-BNN: Binary Neural Network with INSTAnce-aware Threshold
by Changhun Lee et al

04-12-2022

Baseline Computation for Attribution Methods Based on Interpolated Inputs
by Miguel Lerma et al

04-14-2022

Visual-Inertial Odometry with Online Calibration of Velocity-Control Based Kinematic Motion Models
by Haolong Li et al

04-12-2022

How to Register a Live onto a Liver ? Partial Matching in the Space of Varifolds
by Pierre-Louis Antonsanti et al

04-15-2022

Unconditional Image-Text Pair Generation with Multimodal Cross Quantizer
by Hyungyung Lee et al

04-13-2022

Assessing cloudiness in nonwovens
by Michael Godehardt et al

04-12-2022

Label Distribution Learning for Generalizable Multi-source Person Re-identification
by Lei Qi et al

04-12-2022

Few-shot Forgery Detection via Guided Adversarial Interpolation
by Haonan Qiu et al

04-15-2022

Transfer Learning for Instance Segmentation of Waste Bottles using Mask R-CNN Algorithm
by Punitha Jaikumar et al

04-14-2022

RecurSeed and CertainMix for Weakly Supervised Semantic Segmentation
by Sang Hyun Jo et al

04-14-2022

Deep Vehicle Detection in Satellite Video
by Roman Pflugfelder et al

04-14-2022

3D Shuffle-Mixer: An Efficient Context-Aware Vision Learner of Transformer-MLP Paradigm for Dense Prediction in Medical Volume
by Jianye Pang et al

04-14-2022

Panoptic Segmentation using Synthetic and Real Data
by Camillo Quattrocchi et al

04-13-2022

Neural Vector Fields for Surface Representation and Inference
by Edoardo Mello Rella et al

04-15-2022

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning
by Mathias Lechner et al

04-13-2022

A9-Dataset: Multi-Sensor Infrastructure-Based Dataset for Mobility Research
by Christian Creß et al

04-13-2022

Does depth estimation help object detection?
by Bedrettin Cetinkaya et al

04-14-2022

Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference
by Shell Xu Hu et al

04-15-2022

2D Human Pose Estimation: A Survey
by Haoming Chen et al

04-14-2022

MetaSets: Meta-Learning on Point Sets for Generalizable Representations
by Chao Huang et al

04-15-2022

End-to-End Sensitivity-Based Filter Pruning
by Zahra Babaiee et al

04-14-2022

Unsupervised Domain Adaptation with Implicit Pseudo Supervision for Semantic Segmentation
by Wanyu Xu et al

04-14-2022

Interpretable Vertebral Fracture Quantification via Anchor-Free Landmarks Localization
by Alexey Zakharov et al

04-13-2022

Character-focused Video Thumbnail Retrieval
by Shervin Ardeshir et al

04-15-2022

ResT V2: Simpler, Faster and Stronger
by Qing-Long Zhang et al

04-13-2022

SpoofGAN: Synthetic Fingerprint Spoof Images
by Steven A. Grosz et al

04-15-2022

Image Captioning In the Transformer Age
by Yang Xu et al

04-15-2022

Detecting Violence in Video Based on Deep Features Fusion Technique
by Heyam M. Bin Jahlan et al

04-15-2022

Scalable and Real-time Multi-Camera Vehicle Detection, Re-Identification, and Tracking
by Pirazh Khorramshahi et al

04-15-2022

Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation
by Damien Robert et al

04-13-2022

A Novel Approach for Optimum-Path Forest Classification Using Fuzzy Logic
by Renato W. R. de Souza et al

04-14-2022

Information fusion approach for biomass estimation in a plateau mountainous forest using a synergistic system comprising UAS-based digital camera and LiDAR
by Rong Huang et al

04-12-2022

AGQA 2.0: An Updated Benchmark for Compositional Spatio-Temporal Reasoning
by Madeleine Grunde-McLaughlin et al

04-15-2022

ORCNet: A context-based network to simultaneously segment the ocular region components
by Diego Rafael Lucio et al

04-15-2022

Patch-wise Contrastive Style Learning for Instagram Filter Removal
by Furkan Kınlı et al

04-15-2022

A Keypoint-based Global Association Network for Lane Detection
by Jinsheng Wang et al

04-13-2022

OccAMs Laser: Occlusion-based Attribution Maps for 3D Object Detectors on LiDAR Data
by David Schinagl et al

04-15-2022

Guiding Attention using Partial-Order Relationships for Image Captioning
by Murad Popattia et al

04-14-2022

Model-agnostic Multi-Domain Learning with Domain-Specific Adapters for Action Recognition
by Kazuki Omi et al

04-14-2022

Dense Learning based Semi-Supervised Object Detection
by Binghui Chen et al

04-13-2022

Illumination-Invariant Active Camera Relocalization for Fine-Grained Change Detection in the Wild
by Nan Li et al

04-15-2022

Sensitivity of sparse codes to image distortions
by Kyle Luther et al

04-14-2022

Feature Compression for Rate Constrained Object Detection on the Edge
by Zhongzheng Yuan et al

04-15-2022

Semi-supervised atmospheric component learning in low-light image problem
by Masud An Nur Islam Fahim et al

04-15-2022

FasterVideo: Efficient Online Joint Object Detection And Tracking
by Issa Mouawad et al

04-15-2022

SOTVerse: A User-defined Task Space of Single Object Tracking
by Shiyu Hu et al

04-13-2022

Adaptive Memory Management for Video Object Segmentation
by Ali Pourganjalikhan et al

04-15-2022

Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder
by Hanjing Ye et al

04-15-2022

CAiD: Context-Aware Instance Discrimination for Self-supervised Learning in Medical Imaging
by Mohammad Reza Hosseinzadeh Taher et al

04-14-2022

Interactive Object Segmentation in 3D Point Clouds
by Theodora Kontogianni et al

04-14-2022

Early Myocardial Infarction Detection with One-Class Classification over Multi-view Echocardiography
by Aysen Degerli et al

04-14-2022

Imposing Consistency for Optical Flow Estimation
by Jisoo Jeong et al

04-13-2022

Semantic-Aware Pretraining for Dense Video Captioning
by Teng Wang et al

04-13-2022

Deep Relation Learning for Regression and Its Application to Brain Age Estimation
by Sheng He et al

04-14-2022

Measuring Compositional Consistency for Video Question Answering
by Mona Gandhi et al

04-14-2022

Robotic and Generative Adversarial Attacks in Offline Writer-independent Signature Verification
by Jordan J. Bird

04-14-2022

PLGAN: Generative Adversarial Networks for Power-Line Segmentation in Aerial Images
by Rabab Abdelfattah et al

 
Craig Smith