2022.2.14 Vision papers

 

02-10-2022

Block-NeRF: Scalable Large Scene Neural View Synthesis
by Matthew Tancik et al

02-08-2022

MaskGIT: Masked Generative Image Transformer
by Huiwen Chang et al

02-08-2022

Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning
by Stephen James et al

02-08-2022

The EMory BrEast imaging Dataset (EMBED): A Racially Diverse, Granular Dataset of 3.5M Screening and Diagnostic Mammograms
by Jiwoong J. Jeong et al

02-09-2022

Conditional Motion In-betweening
by Jihoon Kim et al

02-08-2022

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers
by Jaemin Cho et al

02-11-2022

CLIPasso: Semantically-Aware Object Sketching
by Yael Vinker et al

02-09-2022

The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
by Jack Hessel et al

02-10-2022

N\UWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN
by Minheng Ni et al

02-08-2022

Causal Scene BERT: Improving object detection by searching for challenging groups of data
by Cinjon Resnick et al

02-09-2022

Point-Level Region Contrast for Object Detection Pre-Training
by Yutong Bai et al

02-10-2022

FILM: Frame Interpolation for Large Motion
by Fitsum Reda et al

02-09-2022

PINs: Progressive Implicit Networks for Multi-Scale Neural Representations
by Zoe Landgraf et al

02-08-2022

GiraffeDet: A Heavy-Neck Paradigm for Object Detection
by Yiqi Jiang et al

02-10-2022

Image-to-Image Regression with Distribution-Free Uncertainty Quantification and Applications in Imaging
by Anastasios N Angelopoulos et al

02-09-2022

Object-Guided Day-Night Visual Localization in Urban Scenes
by Assia Benbihi et al

02-08-2022

Self-Conditioned Generative Adversarial Networks for Image Editing
by Yunzhe Liu et al

02-08-2022

Results and findings of the 2021 Image Similarity Challenge
by Zoë Papakipos et al

02-09-2022

Estimation of Clinical Workload and Patient Activity using Deep Learning and Optical Flow
by Thanh Nguyen-Duc et al

02-09-2022

Can Open Domain Question Answering Systems Answer Visual Knowledge Questions?
by Jiawen Zhang et al

02-10-2022

Equivariance Regularization for Image Reconstruction
by Junqi Tang

02-08-2022

Whats Cracking? A Review and Analysis of Deep Learning Methods for Structural Crack Segmentation, Detection and Quantification
by Jacob König et al

02-09-2022

Image Difference Captioning with Pre-training and Contrastive Learning
by Linli Yao et al

02-11-2022

Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
by Yair Kittenplon et al

02-10-2022

Monotonically Convergent Regularization by Denoising
by Yuyang Hu et al

02-08-2022

Quality Metric Guided Portrait Line Drawing Generation from Unpaired Training Data
by Ran Yi et al

02-08-2022

How to Understand Masked Autoencoders
by Shuhao Cao et al

02-08-2022

Motion-Aware Transformer For Occluded Person Re-identification
by Mi Zhou et al

02-10-2022

Visual Servoing for Pose Control of Soft Continuum Arm in a Structured Environment
by Shivani Kamtikar et al

02-08-2022

Trained Model in Supervised Deep Learning is a Conditional Risk Minimizer
by Yutong Xie et al

02-09-2022

Predicting the intended action using internal simulation of perception
by Zahra Gharaee

02-10-2022

Motion Puzzle: Arbitrary Motion Style Transfer by Body Part
by Deok-Kyeong Jang et al

02-09-2022

Can Humans Do Less-Than-One-Shot Learning?
by Maya Malaviya et al

02-10-2022

OWL (Observe, Watch, Listen): Localizing Actions in Egocentric Video via Audiovisual Temporal Context
by Merey Ramazanova et al

02-09-2022

NIMBLE: A Non-rigid Hand Model with Bones and Muscles
by Yuwei Li et al

02-08-2022

Latent gaze information in highly dynamic decision-tasks
by Benedikt Hosp

02-09-2022

Multi-modal unsupervised brain image registration using edge maps
by Vasiliki Sideri-Lampretsa et al

02-09-2022

Anchor Graph Structure Fusion Hashing for Cross-Modal Similarity Search
by Lu Wang et al

02-08-2022

Self-supervised Contrastive Learning for Cross-domain Hyperspectral Image Representation
by Hyungtae Lee et al

02-09-2022

FCM-DNN: diagnosing coronary artery disease by deep accuracy Fuzzy C-Means clustering model
by Javad Hassannataj Joloudari et al

02-10-2022

Improving performance of aircraft detection in satellite imagery while limiting the labelling effort: Hybrid active learning
by Julie Imbert et al

02-08-2022

SCR: Smooth Contour Regression with Geometric Priors
by Gaetan Bahl et al

02-10-2022

Including Facial Expressions in Contextual Embeddings for Sign Language Generation
by Carla Viegas et al

02-08-2022

Hair Color Digitization through Imaging and Deep Inverse Graphics
by Robin Kips et al

02-11-2022

ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning
by Jia Huei Tan et al

02-10-2022

Deep Learning for Computational Cytology: A Survey
by Hao Jiang et al

02-10-2022

F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
by Qing Jin et al

02-10-2022

Memory-based gaze prediction in deep imitation learning for robot manipulation
by Heecheol Kim et al

02-10-2022

Towards the automated large-scale reconstruction of past road networks from historical maps
by Johannes H. Uhl et al

02-08-2022

Equivariance versus Augmentation for Spherical Images
by Jan E. Gerken et al

02-09-2022

Semantic Segmentation of Anaemic RBCs Using Multilevel Deep Convolutional Encoder-Decoder Network
by Muhammad Shahzad et al

02-09-2022

Learning to Bootstrap for Combating Label Noise
by Yuyin Zhou et al

02-10-2022

Adults as Augmentations for Children in Facial Emotion Recognition with Contrastive Learning
by Marco Virgolin et al

02-10-2022

Domain Adversarial Training: A Game Perspective
by David Acuna et al

02-10-2022

A Human-Centered Machine-Learning Approach for Muscle-Tendon Junction Tracking in Ultrasound Images
by Christoph Leitner et al

02-09-2022

CRAT-Pred: Vehicle Trajectory Prediction with Crystal Graph Convolutional Neural Networks and Multi-Head Self-Attention
by Julian Schmidt et al

02-09-2022

Decreasing Annotation Burden of Pairwise Comparisons with Human-in-the-Loop Sorting: Application in Medical Image Artifact Rating
by Ikbeom Jang et al

02-10-2022

Give me a knee radiograph, I will tell you where the knee joint area is: a deep convolutional neural network adventure
by Shi Yan et al

02-08-2022

A Survey of Breast Cancer Screening Techniques: Thermography and Electrical Impedance Tomography
by Juan Zuluaga-Gomez et al

02-10-2022

Feature-level augmentation to improve robustness of deep neural networks to affine transformations
by Adrian Sandru et al

02-08-2022

Uncertainty Modeling for Out-of-Distribution Generalization
by Xiaotong Li et al

02-09-2022

Amplitude Spectrum Transformation for Open Compound Domain Adaptive Semantic Segmentation
by Jogendra Nath Kundu et al

02-08-2022

CAD-RADS Scoring using Deep Learning and Task-Specific Centerline Labeling
by Felix Denzinger et al

02-09-2022

End-to-End Blind Quality Assessment for Laparoscopic Videos using Neural Networks
by Zohaib Amjad Khan et al

02-08-2022

Adversarial Detection without Model Information
by Abhishek Moitra et al

02-08-2022

Class Density and Dataset Quality in High-Dimensional, Unstructured Data
by Adam Byerly et al

02-08-2022

A Unified Multi-Task Learning Framework of Real-Time Drone Supervision for Crowd Counting
by Siqi Gu et al

02-08-2022

Social-DualCVAE: Multimodal Trajectory Forecasting Based on Social Interactions Pattern Aware and Dual Conditional Variational Auto-Encoder
by Jiashi Gao et al

02-11-2022

Unsupervised HDR Imaging: What Can Be Learned from a Single 8-bit Video?
by Francesco Banterle et al

02-08-2022

A multiscale spatiotemporal approach for smallholder irrigation detection
by Terence Conlon et al

02-08-2022

Real-Time Event-Based Tracking and Detection for Maritime Environments
by Stephanie Aelmore et al

02-09-2022

Exploring Structural Sparsity in Neural Image Compression
by Shanzhi Yin et al

02-09-2022

Discovering Concepts in Learned Representations using Statistical Inference and Interactive Visualization
by Adrianna Janik et al

02-10-2022

Describing image focused in cognitive and visual details for visually impaired people: An approach to generating inclusive paragraphs
by Daniel Louzada Fernandes et al

02-10-2022

Real-Time Siamese Multiple Object Tracker with Enhanced Proposals
by Lorenzo Vaquero et al

02-08-2022

Consistency-Regularized Region-Growing Network for Semantic Segmentation of Urban Scenes with Point-Level Annotations
by Yonghao Xu et al

02-08-2022

GLPU: A Geometric Approach For Lidar Pointcloud Upsampling
by George Eskandar et al

02-09-2022

Bias-Eliminated Semantic Refinement for Any-Shot Learning
by Liangjun Feng et al

02-10-2022

Spherical Transformer
by Sungmin Cho et al

02-08-2022

Segmentation by Test-Time Optimization (TTO) for CBCT-based Adaptive Radiation Therapy
by Xiao Liang et al

02-08-2022

Learning Robust Convolutional Neural Networks with Relevant Feature Focusing via Explanations
by Kazuki Adachi et al

02-09-2022

Deep Feature Rotation for Multimodal Image Style Transfer
by Son Truong Nguyen et al

02-08-2022

Self-Paced Imbalance Rectification for Class Incremental Learning
by Zhiheng Liu et al

02-08-2022

Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations
by Yun-Yun Tsai et al

02-08-2022

A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition
by Nie Jiwei et al

02-10-2022

PVSeRF: Joint Pixel-, Voxel- and Surface-Aligned Radiance Field for Single-Image Novel View Synthesis
by Xianggang Yu et al

02-10-2022

Consistency and Diversity induced Human Motion Segmentation
by Tao Zhou et al

02-08-2022

Exploring Inter-Channel Correlation for Diversity-preserved KnowledgeDistillation
by Li Liu et al

02-10-2022

Towards Predicting Fine Finger Motions from Ultrasound Images via Kinematic Representation
by Dean Zadok et al

02-08-2022

Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning
by Kexue Fu et al

02-11-2022

Multi-Modal Knowledge Graph Construction and Application: A Survey
by Xiangru Zhu et al

02-09-2022

Multiclass histogram-based thresholding using kernel density estimation and scale-space representations
by S. Korneev et al

02-09-2022

Adversarial Attack and Defense of YOLO Detectors in Autonomous Driving Scenarios
by Jung Im Choi et al

02-11-2022

Meta-learning with GANs for anomaly detection, with deployment in high-speed rail inspection system
by Haoyang Cao et al

02-10-2022

Towards Assessing and Characterizing the Semantic Robustness of Face Recognition
by Juan C. Pérez et al

02-09-2022

Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling
by Lixiang Ru et al

02-08-2022

STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation
by Zhengkai Jiang et al

02-08-2022

Learning Optical Flow with Adaptive Graph Reasoning
by Ao Luo et al

02-08-2022

Self-supervised Contrastive Learning for Volcanic Unrest Detection
by Nikolaos Ioannis Bountos et al

02-08-2022

Edge-based fever screening system over private 5G
by Murugan Sankaradas et al

02-08-2022

NEWSKVQA: Knowledge-Aware News Video Question Answering
by Pranay Gupta et al

02-09-2022

Geometric Digital Twinning of Industrial Facilities: Retrieval of Industrial Shapes
by Eva Agapaki et al

02-09-2022

Reducing Redundancy in the Bottleneck Representation of the Autoencoders
by Firas Laakom et al

02-08-2022

Navigating to Objects in Unseen Environments by Distance Prediction
by Minzhao Zhu et al

02-08-2022

If a Human Can See It, So Should Your System: Reliability Requirements for Machine Vision Components
by Boyue Caroline Hu et al

02-08-2022

Residual Aligned: Gradient Optimization for Non-Negative Image Synthesis
by Flora Yu Shen et al

02-08-2022

On the Pitfalls of Using the Residual Error as Anomaly Score
by Felix Meissen et al

02-08-2022

Binary Neural Networks as a general-propose compute paradigm for on-device computer vision
by Guhong Nie et al

02-08-2022

Ada-NETS: Face Clustering via Adaptive Neighbour Discovery in the Structure Space
by Yaohua Wang et al

02-09-2022

Distance Estimation and Animal Tracking for Wildlife Camera Trapping
by Peter Johanns et al

02-08-2022

TransformNet: Self-supervised representation learning through predicting geometric transformations
by Sayed Hashim et al

02-10-2022

Exploiting Spatial Sparsity for Event Cameras with Visual Transformers
by Zuowen Wang et al

02-08-2022

BIQ2021: A Large-Scale Blind Image Quality Assessment Database
by Nisar Ahmed et al

02-08-2022

Network Comparison Study of Deep Activation Feature Discriminability with Novel Objects
by Michael Karnes et al

02-08-2022

Cross-level Contrastive Learning and Consistency Constraint for Semi-supervised Medical Image Segmentation
by Xinkai Zhao et al

02-11-2022

Towards Adversarially Robust Deepfake Detection: An Ensemble Approach
by Ashish Hooda et al

02-11-2022

SuperCon: Supervised Contrastive Learning for Imbalanced Skin Lesion Classification
by Keyu Chen et al

02-08-2022

Untrimmed Action Anticipation
by Ivan Rodin et al

02-08-2022

A Novel Plug-in Module for Fine-Grained Visual Classification
by Po-Yung Chou et al

02-11-2022

Vehicle and License Plate Recognition with Novel Dataset for Toll Collection
by Muhammad Usama et al

02-11-2022

A Wasserstein GAN for Joint Learning of Inpainting and its Spatial Optimisation
by Pascal Peter

02-11-2022

Artemis: Articulated Neural Pets with Appearance and Motion synthesis
by Haimin Luo et al

02-08-2022

Disentangle Saliency Detection into Cascaded Detail Modeling and Body Filling
by Yue Song et al

02-08-2022

Addressing Data Scarcity in Multimodal User State Recognition by Combining Semi-Supervised and Supervised Learning
by Hendric Voß et al

02-11-2022

Assessing Privacy Risks from Feature Vector Reconstruction Attacks
by Emily Wenger et al

02-08-2022

Federated Learning of Generative Image Priors for MRI Reconstruction
by Gokberk Elmas et al

02-09-2022

A Joint Variational Multichannel Multiphase Segmentation Framework
by Nadja Gruber et al

02-09-2022

Sampling Strategy for Fine-Tuning Segmentation Models to Crisis Area under Scarcity of Data
by Adrianna Janik et al

02-11-2022

Entroformer: A Transformer-based Entropy Model for Learned Image Compression
by Yichen Qian et al

02-11-2022

Exemplar-free Online Continual Learning
by Jiangpeng He et al

02-11-2022

Deep soccer captioning with transformer: dataset, semantics-related losses, and multi-level evaluation
by Ahmad Hammoudeh et al

02-08-2022

Face2PPG: An unsupervised pipeline for blood volume pulse extraction from faces
by Constantino Álvarez Casado et al

02-10-2022

Learning the Pedestrian-Vehicle Interaction for Pedestrian Trajectory Prediction
by Chi Zhang et al

02-09-2022

Graph Neural Network for Cell Tracking in Microscopy Videos
by Tal Ben-Haim et al

02-11-2022

SafePicking: Learning Safe Object Extraction via Object-Level Mapping
by Kentaro Wada et al

02-10-2022

Incremental Learning of Structured Memory via Closed-Loop Transcription
by Shengbang Tong et al

02-10-2022

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks
by Nan Wu et al

02-11-2022

Tiny Object Tracking: A Large-scale Dataset and A Baseline
by Yabin Zhu et al

02-11-2022

WAD-CMSN: Wasserstein Distance based Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval
by Guanglong Xu et al

02-09-2022

On Real-time Image Reconstruction with Neural Networks for MRI-guided Radiotherapy
by David E. J. Waddington et al

02-10-2022

Mining the manifolds of deep generative models for multiple data-consistent solutions of ill-posed tomographic imaging problems
by Sayantan Bhadra et al

02-08-2022

Wireless Transmission of Images With The Assistance of Multi-level Semantic Information
by Zhenguo Zhang et al

02-11-2022

Borrowing from yourself: Faster future video segmentation with partial channel update
by Evann Courdier et al

02-10-2022

Dynamic Background Subtraction by Generative Neural Networks
by Fateme Bahri et al

02-11-2022

Multi-Modal Fusion for Sensorimotor Coordination in Steering Angle Prediction
by Farzeen Munir et al

02-10-2022

Face Beneath the Ink: Synthetic Data and Tattoo Removal with Application to Face Recognition
by Mathias Ibsen et al

02-10-2022

Coded ResNeXt: a network for designing disentangled information paths
by Apostolos Avranas et al

02-11-2022

Dilated convolutional neural network-based deep reference picture generation for video compression
by Haoyue Tian et al

02-10-2022

Optimal Transport for Super Resolution Applied to Astronomy Imaging
by Michael Rawson et al

02-10-2022

The MeLa BitChute Dataset
by Milo Trujillo et al

02-09-2022

Class Distance Weighted Cross-Entropy Loss for Ulcerative Colitis Severity Estimation
by Gorkem Polat et al

02-11-2022

Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition
by Yingfeng Cai et al

02-11-2022

Video-driven Neural Physically-based Facial Asset for Production
by Longwen Zhang et al

02-11-2022

Bench-Marking And Improving Arabic Automatic Image Captioning Through The Use Of Multi-Task Learning Paradigm
by Muhy Eddin Za'ter et al

02-08-2022

Joint-bone Fusion Graph Convolutional Network for Semi-supervised Skeleton Action Recognition
by Zhigang Tu et al

02-10-2022

Towards a Guideline for Evaluation Metrics in Medical Image Segmentation
by Dominik Müller et al

02-10-2022

A Deep Learning Approach for Digital ColorReconstruction of Lenticular Films
by Stefano D'Aronco et al

02-10-2022

A Plug-and-Play Approach to Multiparametric Quantitative MRI: Image Reconstruction using Pre-Trained Deep Denoisers
by Ketan Fatania et al

02-10-2022

HNF-Netv2 for Brain Tumor Segmentation using multi-modal MR Imaging
by Haozhe Jia et al

02-10-2022

A Field of Experts Prior for Adapting Neural Networks at Test Time
by Neerav Karani et al

 
Craig Smith