2022.3.21 Vision papers

 

03-15-2022

Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective
by Gowthami Somepalli et al

03-16-2022

Object discovery and representation networks
by Olivier J. Hénaff et al

03-15-2022

An explainability framework for cortical surface-based deep learning
by Fernanda L. Ribeiro et al

03-17-2022

TensoRF: Tensorial Radiance Fields
by Anpei Chen et al

03-16-2022

HybridNets: End-to-End Perception Network
by Dat Vu et al

03-16-2022

Latent Image Animator: Learning to Animate Images via Latent Space Navigation
by Yaohui Wang et al

03-17-2022

One-Shot Adaptation of GAN in Just One CLIP
by Gihyun Kwon et al

03-15-2022

DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
by Yingwei Li et al

03-15-2022

OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction
by Wenbin Lin et al

03-17-2022

Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image
by Xuanchi Ren et al

03-15-2022

Animatable Neural Implicit Surfaces for Creating Avatars from Videos
by Sida Peng et al

03-17-2022

Transframer: Arbitrary Frame Prediction with Generative Models
by Charlie Nash et al

03-17-2022

Community-Driven Comprehensive Scientific Paper Summarization: Insight from cvpaper.challenge
by Shintaro Yamamoto et al

03-16-2022

Integrating Language Guidance into Vision-based Deep Metric Learning
by Karsten Roth et al

03-15-2022

One Network Doesnt Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
by Sharath Girish et al

03-16-2022

Non-isotropy Regularization for Proxy-based Deep Metric Learning
by Karsten Roth et al

03-16-2022

Dual Diffusion Implicit Bridges for Image-to-Image Translation
by Xuan Su et al

03-18-2022

Three things everyone should know about Vision Transformers
by Hugo Touvron et al

03-17-2022

Fine Detailed Texture Learning for 3D Meshes with Generative Models
by Aysegul Dundar et al

03-15-2022

MotionCLIP: Exposing Human Motion Generation to CLIP Space
by Guy Tevet et al

03-17-2022

Finding Structural Knowledge in Multimodal-BERT
by Victor Milewski et al

03-16-2022

Deep vanishing point detection: Geometric priors make dataset variations vanish
by Yancong Lin et al

03-15-2022

Diffusion Probabilistic Modeling for Video Generation
by Ruihan Yang et al

03-16-2022

CtlGAN: Few-shot Artistic Portraits Generation with Contrastive Transfer Learning
by Yue Wang et al

03-17-2022

Semantic-aligned Fusion Transformer for One-shot Object Detection
by Yizhou Zhao et al

03-16-2022

Unsupervised Semantic Segmentation by Distilling Feature Correspondences
by Mark Hamilton et al

03-17-2022

How Many Data Samples is an Additional Instruction Worth?
by Ravsehaj Singh Puri et al

03-16-2022

UNIMO-2: End-to-End Unified Vision-Language Grounded Learning
by Wei Li et al

03-17-2022

AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation
by Paritosh Mittal et al

03-15-2022

Unified Visual Transformer Compression
by Shixing Yu et al

03-17-2022

An Interactive Explanatory AI System for Industrial Quality Control
by Dennis Müller et al

03-16-2022

DU-VLG: Unifying Vision-and-Language Generation via Dual Sequence-to-Sequence Pre-training
by Luyang Huang et al

03-16-2022

Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
by Wen-Li Wei et al

03-16-2022

FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
by Chen-Yu Lee et al

03-16-2022

Creating Multimedia Summaries Using Tweets and Videos
by Anietie Andy et al

03-15-2022

Active Exploration for Neural Global Illumination of Variable Scenes
by Stavros Diolatzis et al

03-15-2022

Deep learning for radar data exploitation of autonomous vehicle
by Arthur Ouaknine

03-16-2022

Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs
by Enis Simsar et al

03-18-2022

Do Deep Networks Transfer Invariances Across Classes?
by Allan Zhou et al

03-15-2022

Object Manipulation via Visual Target Localization
by Kiana Ehsani et al

03-16-2022

Attacking deep networks with surrogate-based adversarial black-box methods is easy
by Nicholas A. Lord et al

03-15-2022

Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
by Duo Zheng et al

03-15-2022

Learning Spatio-Temporal Downsampling for Effective Video Upscaling
by Xiaoyu Xiang et al

03-15-2022

Relative Pose from SIFT Features
by Daniel Barath et al

03-15-2022

CryoAI: Amortized Inference of Poses for Ab Initio Reconstruction of 3D Molecular Volumes from Real Cryo-EM Images
by Axel Levy et al

03-15-2022

Interspace Pruning: Using Adaptive Filter Representations to Improve Training of Sparse CNNs
by Paul Wimmer et al

03-17-2022

CoGS: Controllable Generation and Search from Sketch and Style
by Cusuh Ham et al

03-15-2022

From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction
by Evin Pınar Örnek et al

03-17-2022

Continual Learning Based on OOD Detection and Task Masking
by Gyuhak Kim et al

03-15-2022

S2F2: Self-Supervised High Fidelity Face Reconstruction from Monocular Image
by Abdallah Dib et al

03-15-2022

Securing the Classification of COVID-19 in Chest X-ray Images: A Privacy-Preserving Deep Learning Approach
by Wadii Boulila et al

03-15-2022

Object Detection as Probabilistic Set Prediction
by Georg Hess et al

03-16-2022

PPCD-GAN: Progressive Pruning and Class-Aware Distillation for Large-Scale Conditional GANs Compression
by Duc Minh Vo et al

03-16-2022

Robustness through Cognitive Dissociation Mitigation in Contrastive Adversarial Training
by Adir Rahamim et al

03-16-2022

Towards Practical Certifiable Patch Defense with Vision Transformer
by Zhaoyu Chen et al

03-17-2022

Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning
by Haoxiang Wang et al

03-17-2022

A workflow for segmenting soil and plant X-ray CT images with deep learning in Googles Colaboratory
by Devin A. Rippner et al

03-15-2022

Interactive Portrait Harmonization
by Jeya Maria Jose Valanarasu et al

03-17-2022

Are Vision Transformers Robust to Spurious Correlations?
by Soumya Suvra Ghosal et al

03-16-2022

The Devil Is in the Details: Window-based Attention for Image Compression
by Renjie Zou et al

03-15-2022

Things not Written in Text: Exploring Spatial Commonsense from Visual Signals
by Xiao Liu et al

03-15-2022

Disparities in Dermatology AI Performance on a Diverse, Curated Clinical Image Set
by Roxana Daneshjou et al

03-17-2022

Image Super-Resolution With Deep Variational Autoencoders
by Darius Chira et al

03-15-2022

A Noise-level-aware Framework for PET Image Denoising
by Ye Li et al

03-17-2022

deepNIR: Datasets for generating synthetic NIR images and improved fruit detection system using deep learning techniques
by Inkyu Sa et al

03-15-2022

Domain Adaptive Hand Keypoint and Pixel Localization in the Wild
by Takehiko Ohkawa et al

03-17-2022

POSTER: Diagnosis of COVID-19 through Transfer Learning Techniques on CT Scans: A Comparison of Deep Learning Models
by Aeyan Ashraf et al

03-15-2022

Generalized but not Robust? Comparing the Effects of Data Modification Methods on Out-of-Domain Generalization and Adversarial Robustness
by Tejas Gokhale et al

03-15-2022

Intrinsic Neural Fields: Learning Functions on Manifolds
by Lukas Koestler et al

03-16-2022

Topology-Preserving Shape Reconstruction and Registration via Neural Diffeomorphic Flow
by Shanlin Sun et al

03-15-2022

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy
by Yuanhan Zhang et al

03-16-2022

DePS: An improved deep learning model for de novo peptide sequencing
by Cheng Ge et al

03-16-2022

A Continual Learning Framework for Adaptive Defect Classification and Inspection
by Wenbo Sun et al

03-17-2022

Neural Compression-Based Feature Learning for Video Restoration
by Cong Huang et al

03-15-2022

Self-Supervised Deep Learning to Enhance Breast Cancer Detection on Screening Mammography
by John D. Miller et al

03-17-2022

Mutual Learning for Domain Adaptation: Self-distillation Image Dehazing Network with Sample-cycle
by Tian Ye et al

03-15-2022

K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition
by Kohei Uehara et al

03-16-2022

Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
by Haojun Jiang et al

03-15-2022

Seeking Commonness and Inconsistencies: A Jointly Smoothed Approach to Multi-view Subspace Clustering
by Xiaosha Cai et al

03-16-2022

Graph Flow: Cross-layer Graph Flow Distillation for Dual-Efficient Medical Image Segmentation
by Wenxuan Zou et al

03-17-2022

Vox2Cortex: Fast Explicit Reconstruction of Cortical Surfaces from 3D MRI Scans with Geometric Deep Neural Networks
by Fabian Bongratz et al

03-15-2022

Unpaired Deep Image Dehazing Using Contrastive Disentanglement Learning
by Xiang Chen et al

03-16-2022

A Survey on Infrared Image and Video Sets
by Kevser Irem Danaci et al

03-16-2022

Fusing Local Similarities for Retrieval-based 3D Orientation Estimation of Unseen Objects
by Chen Zhao et al

03-16-2022

What Do Adversarially trained Neural Networks Focus: A Fourier Domain-based Study
by Binxiao Huang et al

03-16-2022

Tangles and Hierarchical Clustering
by Eva Fluck

03-16-2022

Learning Where To Look -- Generative NAS is Surprisingly Efficient
by Jovita Lukasik et al

03-15-2022

Style Transformer for Image Inversion and Editing
by Xueqi Hu et al

03-15-2022

Neural Radiance Projection
by Pham Ngoc Huy et al

03-15-2022

Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance
by Chen Tang et al

03-16-2022

Efficient conditioned face animation using frontally-viewed embedding
by Maxime Oquab et al

03-16-2022

Is it all a cluster game? -- Exploring Out-of-Distribution Detection based on Clustering in the Embedding Space
by Poulami Sinhamahapatra et al

03-17-2022

DetMatch: Two Teachers are Better Than One for Joint 2D and 3D Semi-Supervised Object Detection
by Jinhyung Park et al

03-16-2022

Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning
by Yangji He et al

03-17-2022

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution
by Jianqi Ma et al

03-16-2022

EDTER: Edge Detection with Transformer
by Mengyang Pu et al

03-15-2022

Gradient Correction beyond Gradient Descent
by Zefan Li et al

03-18-2022

Analyzing EEG Data with Machine and Deep Learning: A Benchmark
by Danilo Avola et al

03-15-2022

ActFormer: A GAN Transformer Framework towards General Action-Conditioned 3D Human Motion Generation
by Ziyang Song et al

03-15-2022

LiP-Flow: Learning Inference-time Priors for Codec Avatars via Normalizing Flows in Latent Space
by Emre Aksan et al

03-15-2022

Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting
by Min Shi et al

03-17-2022

On the Properties of Adversarially-Trained CNNs
by Mattia Carletti et al

03-17-2022

Localizing Visual Sounds the Easy Way
by Shentong Mo et al

03-16-2022

Learning video retrieval models with relevance-aware online mining
by Alex Falcon et al

03-17-2022

Progressive Subsampling for Oversampled Data -- Application to Quantitative MRI
by Stefano B. Blumberg et al

03-16-2022

Occlusion Fields: An Implicit Representation for Non-Line-of-Sight Surface Reconstruction
by Javier Grau et al

03-15-2022

CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving
by Kaican Li et al

03-15-2022

Rich CNN-Transformer Feature Aggregation Networks for Super-Resolution
by Jinsu Yoo et al

03-16-2022

PosePipe: Open-Source Human Pose Estimation Pipeline for Clinical Research
by R. James Cotton

03-16-2022

Zero Pixel Directional Boundary by Vector Transform
by Edoardo Mello Rella et al

03-15-2022

Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation
by Zitian Wang et al

03-15-2022

Implicit field supervision for robust non-rigid shape matching
by Ramana Sundararaman et al

03-17-2022

On Multi-Domain Long-Tailed Recognition, Generalization and Beyond
by Yuzhe Yang et al

03-16-2022

Relational Self-Supervised Learning
by Mingkai Zheng et al

03-15-2022

Whats in the Black Box? The False Negative Mechanisms Inside Object Detectors
by Dimity Miller et al

03-16-2022

Complexity Reduction of Learned In-Loop Filtering in Video Coding
by Woody Bayliss et al

03-16-2022

PointAttN: You Only Need Attention for Point Cloud Completion
by Jun Wang et al

03-16-2022

QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation
by Xueqi Hu et al

03-16-2022

A Survey of Historical Document Image Datasets
by Konstantina Nikolaidou et al

03-16-2022

Data Efficient 3D Learner via Knowledge Transferred from 2D Model
by Ping-Chung Yu et al

03-16-2022

Conditional Measurement Density Estimation in Sequential Monte Carlo via Normalizing Flow
by Xiongjie Chen et al

03-17-2022

Synthetic-to-Real Domain Adaptation using Contrastive Unpaired Translation
by Benedikt T. Imbusch et al

03-15-2022

On Hyperbolic Embeddings in 2D Object Detection
by Christopher Lang et al

03-15-2022

A multi-organ point cloud registration algorithm for abdominal CT registration
by Samuel Joutard et al

03-15-2022

SISL:Self-Supervised Image Signature Learning for Splicing Detection and Localization
by Susmit Agrawal et al

03-16-2022

DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation
by Ailing Zeng et al

03-17-2022

Do We Really Need a Learnable Classifier at the End of Deep Neural Network?
by Yibo Yang et al

03-16-2022

DiFT: Differentiable Differential Feature Transform for Multi-View Stereo
by Kaizhang Kang et al

03-16-2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation
by Yinan He et al

03-17-2022

One-Stage Deep Edge Detection Based on Dense-Scale Feature Fusion and Pixel-Level Imbalance Learning
by Dawei Dai et al

03-16-2022

A Real-Time Region Tracking Algorithm Tailored to Endoscopic Video with Open-Source Implementation
by Jonathan P. Epperlein et al

03-15-2022

SATS: Self-Attention Transfer for Continual Semantic Segmentation
by Yiqiao Qiu et al

03-16-2022

Meta-Learning of NAS for Few-shot Learning in Medical Image Applications
by Viet-Khoa Vo-Ho et al

03-16-2022

WegFormer: Transformers for Weakly Supervised Semantic Segmentation
by Chunmeng Liu et al

03-16-2022

Attribute Group Editing for Reliable Few-shot Image Generation
by Guanqi Ding et al

03-16-2022

Know your sensORs –– A Modality Study For Surgical Action Classification
by Lennart Bastian et al

03-17-2022

Generalized Classification of Satellite Image Time Series with Thermal Positional Encoding
by Joachim Nyborg et al

03-18-2022

Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
by Cho-Ying Wu et al

03-16-2022

Coverage Optimization of Camera Network for Continuous Deformable Object
by Chang Li et al

03-16-2022

MonoJSG: Joint Semantic and Geometric Cost Volume for Monocular 3D Object Detection
by Qing Lian et al

03-16-2022

PMAL: Open Set Recognition via Robust Prototype Mining
by Jing Lu et al

03-17-2022

CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation
by Renhao Wang et al

03-16-2022

Multi-Scale Context-Guided Lumbar Spine Disease Identification with Coarse-to-fine Localization and Classification
by Zifan Chen et al

03-15-2022

Parking Analytics Framework using Deep Learning
by Bilel Benjdira et al

03-17-2022

Interacting Attention Graph for Single Image Two-Hand Reconstruction
by Mengcheng Li et al

03-17-2022

Fine-tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning
by Lin Zhang et al

03-15-2022

Motif Mining: Finding and Summarizing Remixed Image Content
by William Theisen et al

03-16-2022

UnseenNet: Fast Training Detector for Any Unseen Concept
by Asra Aslam et al

03-15-2022

CrowdMLP: Weakly-Supervised Crowd Counting via Multi-Granularity MLP
by Mingjie Wang et al

03-16-2022

Multi-focus thermal image fusion
by Radek Benes et al

03-16-2022

An Active Contour Model with Local Variance Force Term and Its Efficient Minimization Solver for Multi-phase Image Segmentation
by Chaoyu Liu et al

03-15-2022

Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels
by Yikai Wang et al

03-16-2022

Decoupled Knowledge Distillation
by Borui Zhao et al

03-17-2022

Video Prediction at Multiple Scales with Hierarchical Recurrent Networks
by Ani Karapetyan et al

03-15-2022

ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity
by Ginger Delmas et al

03-17-2022

PreTR: Spatio-Temporal Non-Autoregressive Trajectory Prediction Transformer
by Lina Achaji et al

03-17-2022

Modeling Intensification for Sign Language Generation: A Computational Approach
by Mert İnan et al

03-17-2022

HSC4D: Human-centered 4D Scene Capture in Large-scale Indoor-outdoor Space Using Wearable IMUs and LiDAR
by Yudi Dai et al

03-15-2022

Can you even tell left from right? Presenting a new challenge for VQA
by Sai Raam Venkatraman et al

03-15-2022

Progressive End-to-End Object Detection in Crowded Scenes
by Anlin Zheng et al

03-17-2022

Visualizing Global Explanations of Point Cloud DNNs
by Hanxiao Tan

03-17-2022

Towards Data-Efficient Detection Transformers
by Wen Wang et al

03-17-2022

Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution
by Jie Liang et al

03-17-2022

TO-Scene: A Large-scale Dataset for Understanding 3D Tabletop Scenes
by Mutian Xu et al

03-17-2022

Optimal Rejection Function Meets Character Recognition Tasks
by Xiaotong Ji et al

03-15-2022

APRNet: Attention-based Pixel-wise Rendering Network for Photo-Realistic Text Image Generation
by Yangming Shi et al

03-15-2022

Magnification Prior: A Self-Supervised Method for Learning Representations on Breast Cancer Histopathological Images
by Prakash Chandra Chhipa et al

03-16-2022

Layer Ensembles: A Single-Pass Uncertainty Estimation in Deep Learning for Segmentation
by Kaisar Kushibar et al

03-15-2022

HUMUS-Net: Hybrid unrolled multi-scale network architecture for accelerated MRI reconstruction
by Zalan Fabian et al

03-17-2022

A Novel End-To-End Network for Reconstruction of Non-Regularly Sampled Image Data Using Locally Fully Connected Layers
by Simon Grosche et al

03-17-2022

Surgical Workflow Recognition: from Analysis of Challenges to Architectural Study
by Tobias Czempiel et al

03-15-2022

Image Quality Assessment for Magnetic Resonance Imaging
by Segrey Kastryulin et al

03-15-2022

2-speed network ensemble for efficient classification of incremental land-use/land-cover satellite image chips
by Michael James Horry et al

03-17-2022

Deep Unsupervised Hashing with Latent Semantic Components
by Qinghong Lin et al

03-16-2022

Robust Table Detection and Structure Recognition from Heterogeneous Document Images
by Chixiang Ma et al

03-15-2022

Adversarial Counterfactual Augmentation: Application in Alzheimers Disease Classification
by Tian Xia et al

03-15-2022

Breast Cancer Molecular Subtypes Prediction on Pathological Images with Discriminative Patch Selecting and Multi-Instance Learning
by Hong Liu et al

03-15-2022

Wave-SAN: Wavelet based Style Augmentation Network for Cross-Domain Few-Shot Learning
by Yuqian Fu et al

03-15-2022

Revitalize Region Feature for Democratizing Video-Language Pre-training
by Guanyu Cai et al

03-18-2022

Robot peels banana with goal-conditioned dual-action deep imitation learning
by Heecheol Kim et al

03-16-2022

Panini-Net: GAN Prior Based Degradation-Aware Feature Interpolation for Face Restoration
by Yinhuai Wang et al

03-16-2022

DATA: Domain-Aware and Task-Aware Pre-training
by Qing Chang et al

03-17-2022

Using the Order of Tomographic Slices as a Prior for Neural Networks Pre-Training
by Yaroslav Zharov et al

03-17-2022

Neural Part Priors: Learning to Optimize Part-Based Object Completion in RGB-D Scans
by Alexey Bokhovkin et al

03-17-2022

A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift
by Shi Guo et al

03-15-2022

Meta Ordinal Regression Forest for Medical Image Classification with Ordinal Labels
by Yiming Lei et al

03-17-2022

Medium Transmission Map Matters for Learning to Restore Real-World Underwater Images
by Yan Kai et al

03-17-2022

ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation
by Yongzhi Su et al

03-16-2022

RBC: Rectifying the Biased Context in Continual Semantic Segmentation
by Hanbin Zhao et al

03-18-2022

Diffusion and Volume Maximization-Based Clustering of Highly Mixed Hyperspectral Images
by Sam L. Polk et al

03-16-2022

Patch-Fool: Are Vision Transformers Always Robust Against Adversarial Perturbations?
by Yonggan Fu et al

03-16-2022

Privacy-preserving Online AutoML for Domain-Specific Face Detection
by Chenqian Yan et al

03-15-2022

Implicit Feature Decoupling with Depthwise Quantization
by Iordanis Fostiropoulos et al

03-17-2022

HybridCap: Inertia-aid Monocular Capture of Challenging Human Motions
by Han Liang et al

03-18-2022

Beyond a Video Frame Interpolator: A Space Decoupled Learning Approach to Continuous Image Transition
by Tao Yang et al

03-15-2022

WeakM3D: Towards Weakly Supervised Monocular 3D Object Detection
by Liang Peng et al

03-17-2022

Bi-directional Object-context Prioritization Learning for Saliency Ranking
by Xin Tian et al

03-17-2022

MotionAug: Augmentation with Physical Correction for Human Motion Prediction
by Takahiro Maeda et al

03-18-2022

Ultra-low Latency Spiking Neural Networks with Spatio-Temporal Compression and Synaptic Convolutional Block
by Changqing Xu et al

03-16-2022

Semantic-diversity transfer network for generalized zero-shot learning via inner disagreement based OOD detector
by Bo Liu et al

03-17-2022

FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos
by Yan Wang et al

03-16-2022

SC2: Supervised Compression for Split Computing
by Yoshitomo Matsubara et al

03-15-2022

UNet Architectures in Multiplanar Volumetric Segmentation -- Validated on Three Knee MRI Cohorts
by Sandeep Singh Sengara et al

03-18-2022

AutoAdversary: A Pixel Pruning Method for Sparse Adversarial Attack
by Jinqiao Li et al

03-17-2022

Modulated Contrast for Versatile Image Synthesis
by Fangneng Zhan et al

03-17-2022

Object Localization under Single Coarse Point Supervision
by Xuehui Yu et al

03-15-2022

Auto-Gait: Automatic Ataxia Risk Assessment with Computer Vision on Gait Task Videos
by Wasifur Rahman et al

03-17-2022

Deterministic Bridge Regression for Compressive Classification
by Kar-Ann Toh et al

03-17-2022

Contrastive Learning for Cross-Domain Open World Recognition
by Francesco Cappio Borlino et al

03-16-2022

Open Set Recognition using Vision Transformer with an Additional Detection Head
by Feiyang Cai et al

03-15-2022

Fast Autofocusing using Tiny Networks for Digital Holographic Microscopy
by Stéphane Cuenat et al

03-15-2022

P-STMO: Pre-Trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation
by Wenkang Shan et al

03-17-2022

ART-SS: An Adaptive Rejection Technique for Semi-Supervised restoration for adverse weather-affected images
by Rajeev Yasarla et al

03-15-2022

InsCon:Instance Consistency Feature Representation via Self-Supervised Learning
by Junwei Yang et al

03-18-2022

Learning Consistency from High-quality Pseudo-labels for Weakly Supervised Object Localization
by Kangbo Sun et al

03-16-2022

Towards True Detail Restoration for Super-Resolution: A Benchmark and a Quality Metric
by Eugene Lyapustin et al

03-17-2022

Simulation-Driven Training of Vision Transformers Enabling Metal Segmentation in X-Ray Images
by Fuxin Fan et al

03-16-2022

Example Perplexity
by Nevin L. Zhang et al

03-15-2022

A Deep Dive into Dataset Imbalance and Bias in Face Identification
by Valeriia Cherepanova et al

03-17-2022

PanoFormer: Panorama Transformer for Indoor 360{\deg} Depth Estimation
by Zhijie Shen et al

03-15-2022

GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting
by Yan Di et al

03-18-2022

Multi-Modal Masked Pre-Training for Monocular Panoramic Depth Completion
by Zhiqiang Yan et al

03-18-2022

Lunar Rover Localization Using Craters as Landmarks
by Larry Matthies et al

03-15-2022

Panoptic SwiftNet: Pyramidal Fusion for Real-time Panoptic Segmentation
by Josip Šarić et al

03-15-2022

Smoothing Matters: Momentum Transformer for Domain Adaptive Semantic Segmentation
by Runfa Chen et al

03-15-2022

On the focusing of thermal images
by Marcos Faundez-Zanuy et al

03-16-2022

Automated Grading of Radiographic Knee Osteoarthritis Severity Combined with Joint Space Narrowing
by Hanxue Gu et al

03-15-2022

Driving Anomaly Detection Using Conditional Generative Adversarial Network
by Yuning Qiu et al

03-15-2022

Simultaneous Localisation and Mapping with Quadric Surfaces
by Tristan Laidlow et al

03-15-2022

Pose-MUM : Reinforcing Key Points Relationship for Semi-Supervised Human Pose Estimation
by JongMok Kim et al

03-15-2022

DialogueNeRF: Towards Realistic Avatar Face-to-face Conversation Video Generation
by Zanwei Zhou et al

03-16-2022

Hybrid Pixel-Unshuffled Network for Lightweight Image Super-Resolution
by Bin Sun et al

03-17-2022

Rethinking the optimization process for self-supervised model-driven MRI reconstruction
by Weijian Huang et al

03-18-2022

REALY: Rethinking the Evaluation of 3D Face Reconstruction
by Zenghao Chai et al

03-15-2022

Multi-Curve Translator for Real-Time High-Resolution Image-to-Image Translation
by Yuda Song et al

03-16-2022

Scribble-Supervised LiDAR Semantic Segmentation
by Ozan Unal et al

03-16-2022

STPLS3D: A Large-Scale Synthetic and Real Aerial Photogrammetry 3D Point Cloud Dataset
by Meida Chen et al

03-17-2022

Deep Point Cloud Simplification for High-quality Surface Reconstruction
by Yuanqi Li et al

03-17-2022

UWED: Unsigned Distance Field for Accurate 3D Scene Representation and Completion
by Jean Pierre Richa et al

03-17-2022

Transforming Gait: Video-Based Spatiotemporal Gait Analysis
by R. James Cotton et al

03-17-2022

Biasing Like Human: A Cognitive Bias Framework for Scene Graph Generation
by Xiaoguang Chang et al

03-16-2022

CapsNet for Medical Image Segmentation
by Minh Tran et al

03-16-2022

ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation
by Khoa Vo et al

03-16-2022

Sat-NeRF: Learning Multi-View Satellite Photogrammetry With Transient Objects and Shadow Modeling Using RPC Cameras
by Roger Marí et al

03-16-2022

Gate-Shift-Fuse for Video Action Recognition
by Swathikiran Sudhakaran et al

03-17-2022

Active Visuo-Haptic Object Shape Completion
by Lukas Rustler et al

03-15-2022

Inverted Pyramid Multi-task Transformer for Dense Scene Understanding
by Hanrong Ye et al

03-17-2022

Mutual Generative Transformer Learning for Cross-view Geo-localization
by Jianwei Zhao et al

03-18-2022

Transferable Class-Modelling for Decentralized Source Attribution of GAN-Generated Images
by Brandon B. G. Khoo et al

03-18-2022

Convolutional Simultaneous Sparse Approximation with Applications to RGB-NIR Image Fusion
by Farshad G. Veshki et al

03-16-2022

Point-Unet: A Context-aware Point-based Neural Network for Volumetric Segmentation
by Ngoc-Vuong Ho et al

03-16-2022

3D-UCaps: 3D Capsules Unet for Volumetric Image Segmentation
by Tan Nguyen et al

03-18-2022

Application of Top-hat Transformation for Enhanced Blood Vessel Extraction
by Tithi Parna Das et al

03-18-2022

Parametric Scaling of Preprocessing assisted U-net Architecture for Improvised Retinal Vessel Segmentation
by Kundan Kumar et al

03-16-2022

Extensive Threat Analysis of Vein Attack Databases and Attack Detection by Fusion of Comparison Scores
by Johannes Schuiki et al

03-15-2022

MOBDrone: a Drone Video Dataset for Man OverBoard Rescue
by Donato Cafarelli et al

03-15-2022

On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow Analysis
by Dominik Rivoir et al

03-17-2022

Depth-aware Neural Style Transfer using Instance Normalization
by Eleftherios Ioannou et al

03-18-2022

Pseudo Bias-Balanced Learning for Debiased Chest X-ray Classification
by Luyang Luo et al

03-18-2022

Nonnegative-Constrained Joint Collaborative Representation with Union Dictionary for Hyperspectral Anomaly Detection
by Shizhen Chang et al

03-15-2022

SocialVAE: Human Trajectory Prediction using Timewise Latents
by Pei Xu et al

03-17-2022

MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
by Yang Ding et al

03-16-2022

Hyperbolic Uncertainty Aware Semantic Segmentation
by Bike Chen et al

03-17-2022

Improving the Transferability of Targeted Adversarial Examples through Object-Based Diverse Input
by Junyoung Byun et al

03-17-2022

DRAG: Dynamic Region-Aware GCN for Privacy-Leaking Image Detection
by Guang Yang et al

03-17-2022

Novel Consistency Check For Fast Recursive Reconstruction Of Non-Regularly Sampled Video Data
by Simon Grosche et al

03-15-2022

End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding
by Mengze Li et al

03-17-2022

Facial Geometric Detail Recovery via Implicit Representation
by Xingyu Ren et al

03-17-2022

Group Contextualization for Video Recognition
by Yanbin Hao et al

03-18-2022

Elastica Models for Color Image Regularization
by Hao Liu et al

03-18-2022

Unbiased Subclass Regularization for Semi-Supervised Semantic Segmentation
by Dayan Guan et al

03-18-2022

Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion
by Xiaopei Wu et al

03-15-2022

CSN: Component-Supervised Network for Few-Shot Classification
by Shuai Shao et al

03-18-2022

Bayesian Inversion for Nonlinear Imaging Models using Deep Generative Priors
by Pakshal Bohra et al

03-18-2022

Distortion-Tolerant Monocular Depth Estimation On Omnidirectional Images Using Dual-cubemap
by Zhijie Shen et al

03-18-2022

A Dual Weighting Label Assignment Scheme for Object Detection
by Shuai Li et al

03-18-2022

Enhancement of Novel View Synthesis Using Omnidirectional Image Completion
by Takayuki Hara et al

03-17-2022

Multi-similarity based Hyperrelation Network for few-shot segmentation
by Xiangwen Shi et al

03-15-2022

Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization
by Yabin Zhang et al

03-17-2022

Human Gait Analysis using Gait Energy Image
by Sagor Chandro Bakchy et al

03-15-2022

SPA-VAE: Similar-Parts-Assignment for Unsupervised 3D Point Cloud Generation
by Shidi Li et al

03-18-2022

Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation
by Xingning Dong et al

03-18-2022

CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance
by Tianchen Zhao et al

03-18-2022

SynthStrip: Skull-Stripping for Any Brain Image
by Andrew Hoopes et al

03-17-2022

VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention
by Shengheng Deng et al

03-17-2022

Video-based Formative and Summative Assessment of Surgical Tasks using Deep Learning
by Erim Yanik et al

03-18-2022

DTA: Physical Camouflage Attacks using Differentiable Transformation Network
by Naufal Suryanto et al

03-18-2022

ContrastMask: Contrastive Learning to Segment Every Thing
by Xuehui Wang et al

03-18-2022

Completing Partial Point Clouds with Outliers by Collaborative Completion and Segmentation
by Changfeng Ma et al

03-18-2022

Local-Global Context Aware Transformer for Language-Guided Video Segmentation
by Chen Liang et al

03-18-2022

Fourier Document Restoration for Robust Document Dewarping and Recognition
by Chuhui Xue et al

03-17-2022

Delta Distillation for Efficient Video Processing
by Amirhossein Habibian et al

03-18-2022

Grasp Pre-shape Selection by Synthetic Training: Eye-in-hand Shared Control on the Hannes Prosthesis
by Federico Vasile et al

03-18-2022

Learning Affordance Grounding from Exocentric Images
by Hongchen Luo et al

03-17-2022

Surface Defect Detection and Evaluation for Marine Vessels using Multi-Stage Deep Learning
by Li Yu et al

03-18-2022

Laneformer: Object-aware Row-Column Transformers for Lane Detection
by Jianhua Han et al

03-18-2022

Perspective Flow Aggregation for Data-Limited 6D Object Pose Estimation
by Yinlin Hu et al

03-18-2022

Series Photo Selection via Multi-view Graph Learning
by Jin Huang et al

03-18-2022

ESS: Learning Event-based Semantic Segmentation from Still Images
by Zhaoning Sun et al

03-17-2022

SepTr: Separable Transformer for Audio Spectrogram Processing
by Nicolae-Catalin Ristea et al

03-18-2022

Location-Free Camouflage Generation Network
by Yangyang Li et al

03-18-2022

Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation
by Ruihuang Li et al

03-18-2022

Semi-Supervised Learning with Mutual Distillation for Monocular Depth Estimation
by Jongbeom Baek et al

03-18-2022

Deepfake Style Transfer Mixture: a First Forensic Ballistics Study on Synthetic Images
by Luca Guarnera et al

03-18-2022

SHREC 2021: Classification in cryo-electron tomograms
by Ilja Gubins et al

03-18-2022

Multi-input segmentation of damaged brain in acute ischemic stroke patients using slow fusion with skip connection
by Luca Tomasetti et al

03-18-2022

Towards Robust 2D Convolution for Reliable Visual Recognition
by Lida Li et al

03-18-2022

GiNGR: Generalized Iterative Non-Rigid Point Cloud and Surface Registration Using Gaussian Process Regression
by Dennis Madsen et al

03-16-2022

Understanding robustness and generalization of artificial neural networks through Fourier masks
by Nikos Karantzas et al

03-18-2022

Imaging-based histological features are predictive of MET alterations in Non-Small Cell Lung Cancer
by Rohan P. Joshi et al

03-15-2022

An Annotation-free Restoration Network for Cataractous Fundus Images
by Heng Li et al

03-17-2022

Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation
by Tianfei Zhou et al

03-16-2022

Neural Enhanced Belief Propagation for Data Assocation in Multiobject Tracking
by Mingchao Liang et al

03-16-2022

On the sensitivity of pose estimation neural networks: rotation parameterizations, Lipschitz constants, and provable bounds
by Trevor Avant et al

03-17-2022

Unified Line and Paragraph Detection by Graph Convolutional Networks
by Shuang Liu et al

03-17-2022

Revealing Reliable Signatures by Learning Top-Rank Pairs
by Xiaotong Ji et al

03-17-2022

MatchFormer: Interleaving Attention in Transformers for Feature Matching
by Qing Wang et al

03-17-2022

Cascade Transformers for End-to-End Person Search
by Rui Yu et al

03-17-2022

3DAC: Learning Attribute Compression for Point Clouds
by Guangchi Fang et al

03-15-2022

CaRTS: Causality-driven Robot Tool Segmentation from Vision and Kinematics Data
by Hao Ding et al

03-15-2022

Self-Normalized Density Map (SNDM) for Counting Microbiological Objects
by Krzysztof M. Graczyk et al

03-16-2022

Computer Vision Algorithm for Predicting the Welding Efficiency of Friction Stir Welded Copper Joints from its Microstructures
by Akshansh Mishra et al

 
Craig Smith