2022.3.7 Vision papers

 

03-10-2022

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
by Mitchell Wortsman et al

03-09-2022

On the surprising tradeoff between ImageNet accuracy and perceptual similarity
by Manoj Kumar et al

03-08-2022

EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
by Haokui Zhang et al

03-10-2022

Cluttered Food Grasping with Adaptive Fingers and Synthetic-Data Trained Object Detection
by Avinash Ummadisingu et al

03-11-2022

The Role of ImageNet Classes in Fr\echet Inception Distance
by Tuomas Kynkäänniemi et al

03-10-2022

LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
by Jie Lei et al

03-09-2022

Leveling Down in Computer Vision: Pareto Inefficiencies in Fair Deep Classifiers
by Dominik Zietlow et al

03-10-2022

Conditional Prompt Learning for Vision-Language Models
by Kaiyang Zhou et al

03-08-2022

Dynamic Dual-Output Diffusion Models
by Yaniv Benny et al

03-10-2022

BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis
by Haiyang Liu et al

03-10-2022

StyleBabel: Artistic Style Tagging and Captioning
by Dan Ruta et al

03-08-2022

RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
by Di Chang et al

03-09-2022

NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks
by Fawaz Sammani et al

03-10-2022

MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes
by Yang Jiao et al

03-08-2022

ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation
by Robin Wang et al

03-10-2022

A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach
by Xiaohan Lan et al

03-10-2022

Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless Objects
by Manuel Stoiber et al

03-08-2022

StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pretrained StyleGAN
by Fei Yin et al

03-11-2022

ActiveMLP: An MLP-like Architecture with Active Token Mixer
by Guoqiang Wei et al

03-10-2022

Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling
by Tengpeng Li et al

03-11-2022

FLAG: Flow-based 3D Avatar Generation from Sparse Observations
by Sadegh Aliakbarian et al

03-11-2022

Masked Visual Pre-training for Motor Control
by Tete Xiao et al

03-08-2022

Multi-Modal Mixup for Robust Fine-tuning
by Junhyuk So et al

03-08-2022

Tuning-free multi-coil compressed sensing MRI with Parallel Variable Density Approximate Message Passing (P-VDAMP)
by Charles Millard et al

03-08-2022

Visual-Language Navigation Pretraining via Prompt-based Environmental Self-exploration
by Xiwen Liang et al

03-09-2022

Pose Guided Multi-person Image Generation From Text
by Soon Yau Cheong et al

03-09-2022

A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection
by Yukun Su et al

03-09-2022

FlexIT: Towards Flexible Semantic Image Translation
by Guillaume Couairon et al

03-08-2022

Semantic Distillation Guided Salient Object Detection
by Bo Xu et al

03-08-2022

Where Does the Performance Improvement Come From? - A Reproducibility Concern about Image-Text Retrieval
by Jun Rao et al

03-09-2022

Mapping global dynamics of benchmark creation and saturation in artificial intelligence
by Adriano Barbosa-Silva et al

03-10-2022

Hyperspectral Imaging for cherry tomato
by Yun Xiang et al

03-08-2022

On Generalizing Beyond Domains in Cross-Domain Continual Learning
by Christian Simon et al

03-08-2022

Analyzing General-Purpose Deep-Learning Detection and Segmentation Models with Images from a Lidar as a Camera Sensor
by Yu Xianjia et al

03-09-2022

Model-Agnostic Multitask Fine-tuning for Few-shot Vision-Language Transfer Learning
by Zhenhailong Wang et al

03-08-2022

Motron: Multimodal Probabilistic Human Motion Forecasting
by Tim Salzmann et al

03-09-2022

Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction
by Jing Lin et al

03-11-2022

Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision
by Yufeng Cui et al

03-08-2022

Source-free Domain Adaptation for Multi-site and Lifespan Brain Skull Stripping
by Yunxiang Li et al

03-08-2022

Efficient and Accurate Hyperspectral Pansharpening Using 3D VolumeNet and 2.5D Texture Transfer
by Yinao Li et al

03-10-2022

Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding
by Yidan Sun et al

03-08-2022

A New 27 Class Sign Language Dataset Collected from 173 Individuals
by Arda Mavi et al

03-09-2022

Low-light Image and Video Enhancement via Selective Manipulation of Chromaticity
by Sumit Shekhar et al

03-09-2022

Cross-modal Map Learning for Vision and Language Navigation
by Georgios Georgakis et al

03-08-2022

Breast cancer detection using artificial intelligence techniques: A systematic literature review
by Ali Bou Nassif et al

03-10-2022

Zero-Shot Action Recognition with Transformer-based Video Semantic Embedding
by Keval Doshi et al

03-10-2022

Online Deep Metric Learning via Mutual Distillation
by Gao-Dong Liu et al

03-09-2022

HDL: Hybrid Deep Learning for the Synthesis of Myocardial Velocity Maps in Digital Twins for Cardiac Analysis
by Xiaodan Xing et al

03-10-2022

Autofocusing+: Noise-Resilient Motion Correction in Magnetic Resonance Imaging
by Ekaterina Kuzmina et al

03-10-2022

An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection
by Ganglai Wang et al

03-09-2022

The Transitive Information Theory and its Application to Deep Generative Models
by Trung Ngo et al

03-10-2022

Toward Efficient Hyperspectral Image Processing inside Camera Pixels
by Gourav Datta et al

03-10-2022

Back to Reality: Weakly-supervised 3D Object Detection with Shape-guided Label Enhancement
by Xiuwei Xu et al

03-10-2022

ReF -- Rotation Equivariant Features for Local Feature Matching
by Abhishek Peri et al

03-08-2022

Understanding person identification via gait
by Simon Hanisch et al

03-10-2022

Representation Compensation Networks for Continual Semantic Segmentation
by Chang-Bin Zhang et al

03-10-2022

AGCN: Augmented Graph Convolutional Network for Lifelong Multi-label Image Recognition
by Kaile Du et al

03-09-2022

What Matters For Meta-Learning Vision Regression Tasks?
by Ning Gao et al

03-10-2022

Towards Less Constrained Macro-Neural Architecture Search
by Vasco Lopes et al

03-09-2022

Triangular Character Animation Sampling with Motion, Emotion, and Relation
by Yizhou Zhao et al

03-10-2022

Learning-based Localizability Estimation for Robust LiDAR Localization
by Julian Nubert et al

03-11-2022

Multi-modal Graph Learning for Disease Prediction
by Shuai Zheng et al

03-11-2022

Graph Neural Networks for Relational Inductive Bias in Vision-based Deep Reinforcement Learning of Robot Control
by Marco Oliva et al

03-09-2022

Adaptive Trajectory Prediction via Transferable GNN
by Yi Xu et al

03-08-2022

End-to-end Multiple Instance Learning with Gradient Accumulation
by Axel Andersson et al

03-08-2022

VoViT: Low Latency Graph-based Audio-Visual Voice Separation Transformer
by Juan F. Montesinos et al

03-09-2022

CEU-Net: Ensemble Semantic Segmentation of Hyperspectral Images Using Clustering
by Nicholas Soucy et al

03-11-2022

Flexible Amortized Variational Inference in qBOLD MRI
by Ivor J. A. Simpson et al

03-08-2022

Trustable Co-label Learning from Multiple Noisy Annotators
by Shikun Li et al

03-09-2022

Ray Tracing-Guided Design of Plenoptic Cameras
by Tim Michels et al

03-08-2022

Selective-Supervised Contrastive Learning with Noisy Labels
by Shikun Li et al

03-11-2022

WLASL-LEX: a Dataset for Recognising Phonological Properties in American Sign Language
by Federico Tavella et al

03-08-2022

Sharing Generative Models Instead of Private Data: A Simulation Study on Mammography Patch Classification
by Zuzanna Szafranowska et al

03-10-2022

Membership Privacy Protection for Image Translation Models via Adversarial Knowledge Distillation
by Saeed Ranjbar Alvar et al

03-08-2022

Easy Ensemble: Simple Deep Ensemble Learning for Sensor-Based Human Activity Recognition
by Tatsuhito Hasegawa et al

03-09-2022

A Tree-Structured Multi-Task Model Recommender
by Lijun Zhang et al

03-09-2022

Frequency-driven Imperceptible Adversarial Attack on Semantic Similarity
by Cheng Luo et al

03-11-2022

ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRI
by Lyndon Boone et al

03-09-2022

A Neuro-vector-symbolic Architecture for Solving Ravens Progressive Matrices
by Michael Hersche et al

03-10-2022

Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing
by Zhuo Wang et al

03-10-2022

Suspected Object Matters: Rethinking Models Prediction for One-stage Visual Grounding
by Yang Jiao et al

03-08-2022

The Flag Median and FlagIRLS
by Nathan Mankovich et al

03-09-2022

Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice
by Peihao Wang et al

03-10-2022

A Survey of Surface Defect Detection of Industrial Products Based on A Small Number of Labeled Data
by Qifan Jin et al

03-10-2022

Prediction-Guided Distillation for Dense Object Detection
by Chenhongyi Yang et al

03-10-2022

EyeLoveGAN: Exploiting domain-shifts to boost network learning with cycleGANs
by Josefine Vilsbøll Sundgaard et al

03-08-2022

YouTube-GDD: A challenging gun detection dataset with rich contextual information
by Yongxiang Gu et al

03-10-2022

Information-Theoretic Odometry Learning
by Sen Zhang et al

03-08-2022

DeltaCNN: End-to-End CNN Inference of Sparse Frame Differences in Videos
by Mathias Parger et al

03-10-2022

Domain Generalisation for Object Detection
by Karthik Seemakurthy et al

03-09-2022

OpenTAL: Towards Open Set Temporal Action Localization
by Wentao Bao et al

03-10-2022

TrueType Transformer: Character and Font Style Recognition in Outline Format
by Yusuke Nagata et al

03-08-2022

ClearPose: Large-scale Transparent Object Dataset and Benchmark
by Xiaotong Chen et al

03-11-2022

Detection of multiple retinal diseases in ultra-widefield fundus images using deep learning: data-driven identification of relevant regions
by Justin Engelmann et al

03-09-2022

Practical Evaluation of Adversarial Robustness via Adaptive Auto Attack
by Ye Liu et al

03-10-2022

Learning Distinctive Margin toward Active Domain Adaptation
by Ming Xie et al

03-09-2022

Improving Neural ODEs via Knowledge Distillation
by Haoyu Chu et al

03-09-2022

Align-Deform-Subtract: An Interventional Framework for Explaining Object Differences
by Cian Eastwood et al

03-08-2022

A Gating Model for Bias Calibration in Generalized Zero-shot Learning
by Gukyeong Kwon et al

03-09-2022

Simulation of Plenoptic Cameras
by Tim Michels et al

03-09-2022

Inadequately Pre-trained Models are Better Feature Extractors
by Andong Deng et al

03-09-2022

Manifold Modeling in Quotient Space: Learning An Invariant Mapping with Decodability of Image Patches
by Tatsuya Yokota et al

03-10-2022

An Empirical Investigation of 3D Anomaly Detection and Segmentation
by Eliahu Horwitz et al

03-11-2022

Deep AutoAugment
by Yu Zheng et al

03-08-2022

Data augmentation with mixtures of max-entropy transformations for filling-level classification
by Apostolos Modas et al

03-08-2022

Dynamic Group Transformer: A General Vision Transformer Backbone with Dynamic Group Attention
by Kai Liu et al

03-10-2022

GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains
by Lei Fan et al

03-08-2022

Robust Multi-Task Learning and Online Refinement for Spacecraft Pose Estimation across Domain Gap
by Tae Ha Park et al

03-08-2022

Learning to Erase the Bayer-Filter to See in the Dark
by Xingbo Dong et al

03-08-2022

MICDIR: Multi-scale Inverse-consistent Deformable Image Registration using UNetMSS with Self-Constructing Graph Latent
by Soumick Chatterjee et al

03-11-2022

AI-enabled Automatic Multimodal Fusion of Cone-Beam CT and Intraoral Scans for Intelligent 3D Tooth-Bone Reconstruction and Clinical Applications
by Jin Hao et al

03-09-2022

Intention-aware Feature Propagation Network for Interactive Segmentation
by Chuyu Zhang et al

03-09-2022

Multiscale Convolutional Transformer with Center Mask Pretraining for Hyperspectral Image Classificationtion
by Yifan Wang et al

03-10-2022

MVP: Multimodality-guided Visual Pre-training
by Longhui Wei et al

03-08-2022

Evolutionary Neural Cascade Search across Supernetworks
by Alexander Chebykin et al

03-10-2022

NeRFocus: Neural Radiance Field for 3D Synthetic Defocus
by Yinhuai Wang et al

03-08-2022

Mutual Contrastive Learning to Disentangle Whole Slide Image Representations for Glioma Grading
by Lipei Zhang et al

03-10-2022

QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization
by Xiuying Wei et al

03-09-2022

Rethinking data-driven point spread function modeling with a differentiable optical model
by Tobias Liaudat et al

03-11-2022

PseudoProp: Robust Pseudo-Label Generation for Semi-Supervised Object Detection in Autonomous Driving Systems
by Shu Hu et al

03-11-2022

Deep Class Incremental Learning from Decentralized Data
by Xiaohan Zhang et al

03-10-2022

Crowd Source Scene Change Detection and Local Map Update
by Itzik Wilf et al

03-10-2022

Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability
by Ruifei He et al

03-08-2022

Universal Prototype Transport for Zero-Shot Action Recognition and Localization
by Pascal Mettes

03-08-2022

Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM
by Pierre-Yves Lajoie et al

03-08-2022

DuMLP-Pin: A Dual-MLP-dot-product Permutation-invariant Network for Set Feature Extraction
by Jiajun Fei et al

03-11-2022

Active Phase-Encode Selection for Slice-Specific Fast MR Scanning Using a Transformer-Based Deep Reinforcement Learning Framework
by Yiming Liu et al

03-11-2022

Federated Remote Physiological Measurement with Imperfect Data
by Xin Liu et al

03-10-2022

A Screen-Shooting Resilient Document Image Watermarking Scheme using Deep Neural Network
by Sulong Ge et al

03-09-2022

Efficient Image Representation Learning with Federated Sampled Softmax
by Sagar M. Waghmare et al

03-08-2022

Generative Cooperative Learning for Unsupervised Video Anomaly Detection
by Muhammad Zaigham Zaheer et al

03-11-2022

Embedding Earth: Self-supervised contrastive pre-training for dense land cover classification
by Michail Tarasiou et al

03-10-2022

Evaluating U-net Brain Extraction for Multi-site and Longitudinal Preclinical Stroke Imaging
by Erendiz Tarakci et al

03-09-2022

Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction
by Matthieu Zins et al

03-09-2022

Uni4Eye: Unified 2D and 3D Self-supervised Pre-training via Masked Image Modeling Transformer for Ophthalmic Image Classification
by Zhiyuan Cai et al

03-11-2022

TAPE: Task-Agnostic Prior Embedding for Image Restoration
by Lin Liu et al

03-09-2022

Domain Generalization using Pretrained Models without Fine-tuning
by Ziyue Li et al

03-08-2022

A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation
by Yutong Chen et al

03-08-2022

Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework
by Mehwish Ghafoor et al

03-10-2022

Annotation Efficient Person Re-Identification with Diverse Cluster-Based Pair Selection
by Lantian Xue et al

03-08-2022

End-to-End Semi-Supervised Learning for Video Action Detection
by Akash Kumar et al

03-08-2022

Gait Recognition with Mask-based Regularization
by Chuanfu Shen et al

03-10-2022

Spatial Commonsense Graph for Object Localisation in Partial Scenes
by Francesco Giuliari et al

03-08-2022

Skating-Mixer: Multimodal MLP for Scoring Figure Skating
by Jingfei Xia et al

03-10-2022

Non-generative Generalized Zero-shot Learning via Task-correlated Disentanglement and Controllable Samples Synthesis
by Yaogong Feng et al

03-08-2022

NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Implicit Representation
by Ziyu Wang et al

03-10-2022

Towards Scale Consistent Monocular Visual Odometry by Learning from the Virtual World
by Sen Zhang et al

03-10-2022

Image-based Stroke Assessment for Multi-site Preclinical Evaluation of Cerebroprotectants
by Ryan P. Cabeen et al

03-08-2022

Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild
by Ganglai Wang et al

03-08-2022

Globally-Optimal Event Camera Motion Estimation
by Xin Peng et al

03-10-2022

Towards Bi-directional Skip Connections in Encoder-Decoder Architectures and Beyond
by Tiange Xiang et al

03-09-2022

Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction
by Xiaoqi Zhao et al

03-10-2022

Contrastive Boundary Learning for Point Cloud Segmentation
by Liyao Tang et al

03-08-2022

Contrastive Enhancement Using Latent Prototype for Few-Shot Segmentation
by Xiaoyu Zhao et al

03-09-2022

Attention-effective multiple instance learning on weakly stem cell colony segmentation
by Novanto Yudistira et al

03-08-2022

Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences
by Prune Truong et al

03-10-2022

Geometric Synthesis: A Free lunch for Large-scale Palmprint Recognition Model Pretraining
by Kai Zhao et al

03-08-2022

Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework
by Xiaodong Chen et al

03-08-2022

Image Steganography based on Style Transfer
by Donghui Hu et al

03-08-2022

An Efficient Polyp Segmentation Network
by Tugberk Erol et al

03-09-2022

CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
by Huayao Liu et al

03-08-2022

SimpleTrack: Rethinking and Improving the JDE Approach for Multi-Object Tracking
by Jiaxin Li et al

03-10-2022

Real-time Scene Text Detection Based on Global Level and Word Level Features
by Fuqiang Zhao et al

03-11-2022

Automatic Fine-grained Glomerular Lesion Recognition in Kidney Pathology
by Yang Nan et al

03-09-2022

Normal and Visibility Estimation of Human Face from a Single Image
by Fuzhi Zhong et al

03-09-2022

Semi-supervision semantic segmentation with uncertainty-guided self cross supervision
by Yunyang Zhang et al

03-09-2022

MetAug: Contrastive Learning via Meta Feature Augmentation
by Jiangmeng Li et al

03-08-2022

The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks
by Xin Yu et al

03-10-2022

Deep Learning-Based Perceptual Stimulus Encoder for Bionic Vision
by Lucas Relic et al

03-10-2022

SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning
by Jaehoon Choi et al

03-08-2022

Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective
by Quan Cui et al

03-09-2022

Learning the Degradation Distribution for Blind Image Super-Resolution
by Zhengxiong Luo et al

03-09-2022

DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
by Seonghyeon Kim et al

03-08-2022

E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation
by Tao Zhang et al

03-08-2022

Part-Aware Self-Supervised Pre-Training for Person Re-Identification
by Kuan Zhu et al

03-08-2022

MLSeg: Image and Video Segmentation as Multi-Label Classification and Selected-Label Pixel Classification
by Haodi He et al

03-09-2022

Structure-Aware Flow Generation for Human Body Reshaping
by Jianqiang Ren et al

03-11-2022

Multi-sensor large-scale dataset for multi-view 3D reconstruction
by Oleg Voynov et al

03-10-2022

Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking
by Boyu Chen et al

03-08-2022

CIDER: Exploiting Hyperspherical Embeddings for Out-of-Distribution Detection
by Yifei Ming et al

03-08-2022

Counting with Adaptive Auxiliary Learning
by Yanda Meng et al

03-08-2022

Lightweight Monocular Depth Estimation through Guided Decoding
by Michael Rudolph et al

03-08-2022

AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant
by Benita Wong et al

03-10-2022

Transfer of Representations to Video Label Propagation: Implementation Factors Matter
by Daniel McKee et al

03-09-2022

Fast Road Segmentation via Uncertainty-aware Symmetric Network
by Yicong Chang et al

03-08-2022

Robust Local Preserving and Global Aligning Network for Adversarial Domain Adaptation
by Wenwen Qiang et al

03-08-2022

BEVSegFormer: Birds Eye View Semantic Segmentation From Arbitrary Camera Rigs
by Lang Peng et al

03-08-2022

UENAS: A Unified Evolution-based NAS Framework
by Zimian Wei et al

03-08-2022

A Lightweight and Detector-free 3D Single Object Tracker on Point Clouds
by Yan Xia et al

03-11-2022

PD-Flow: A Point Cloud Denoising Framework with Normalizing Flows
by Aihua Mao et al

03-11-2022

Peng Cheng Object Detection Benchmark for Smart City
by Yaowei Wang et al

03-08-2022

GaitStrip: Gait Recognition via Effective Strip-based Feature Representations and Multi-Level Framework
by Ming Wang et al

03-10-2022

6-DoF Pose Estimation of Household Objects for Robotic Manipulation: An Accessible Dataset and Benchmark
by Stephen Tyree et al

03-08-2022

Graph Attention Transformer Network for Multi-Label Image Classification
by Jin Yuan et al

03-08-2022

Shape-invariant 3D Adversarial Point Clouds
by Qidong Huang et al

03-08-2022

PyNET-QxQ: A Distilled PyNET for QxQ Bayer Pattern Demosaicing in CMOS Image Sensor
by Minhyeok Cho et al

03-08-2022

Neural Face Identification in a 2D Wireframe Projection of a Manifold Object
by Kehan Wang et al

03-08-2022

Towards Universal Texture Synthesis by Combining Texton Broadcasting with Noise Injection in StyleGAN-2
by Jue Lin et al

03-10-2022

Transferring Dual Stochastic Graph Convolutional Network for Facial Micro-expression Recognition
by Hui Tang et al

03-09-2022

SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters
by Albert Mosella-Montoro et al

03-08-2022

Stage-Aware Feature Alignment Network for Real-Time Semantic Segmentation of Street Scenes
by Xi Weng et al

03-09-2022

Neural Data-Dependent Transform for Learned Image Compression
by Dezhao Wang et al

03-10-2022

Attack Analysis of Face Recognition Authentication Systems Using Fast Gradient Sign Method
by Arbena Musa et al

03-09-2022

NeRF-Pose: A First-Reconstruct-Then-Regress Approach for Weakly-supervised 6D Object Pose Estimation
by Fu Li et al

03-11-2022

Physics-informed Reinforcement Learning for Perception and Reasoning about Fluids
by Beatriz Moya et al

03-10-2022

Dual-Domain Reconstruction Networks with V-Net and K-Net for fast MRI
by Xiaohan Liu et al

03-10-2022

Towards Open-Set Text Recognition via Label-to-Prototype Learning
by Chang Liu et al

03-10-2022

Temporal Context for Robust Maritime Obstacle Detection
by Lojze Žust et al

03-10-2022

Adaptive Background Matting Using Background Matching
by Jinlin Liu

03-10-2022

Two-stream Hierarchical Similarity Reasoning for Image-text Matching
by Ran Chen et al

03-08-2022

GaitEdge: Beyond Plain End-to-end Gait Recognition for Better Practicality
by Junhao Liang et al

03-11-2022

WiCV 2021: The Eighth Women In Computer Vision Workshop
by Arushi Goel et al

03-09-2022

Creating Realistic Ground Truth Data for the Evaluation of Calibration Methods for Plenoptic and Conventional Cameras
by Tim Michels et al

03-09-2022

Using Human Gaze For Surgical Activity Recognition
by Abdishakour Awale et al

03-11-2022

LFW-Beautified: A Dataset of Face Images with Beautification and Augmented Reality Filters
by Pontus Hedman et al

03-10-2022

PETR: Position Embedding Transformation for Multi-View 3D Object Detection
by Yingfei Liu et al

03-09-2022

Resource-Efficient Invariant Networks: Exponential Gains by Unrolled Optimization
by Sam Buchanan et al

03-08-2022

An Online Semantic Mapping System for Extending and Enhancing Visual SLAM
by Thorsten Hempel et al

03-10-2022

The Overlooked Classifier in Human-Object Interaction Recognition
by Ying Jin et al

03-09-2022

Practical No-box Adversarial Attacks with Training-free Hybrid Image Transformation
by Qilong Zhang et al

03-11-2022

Hyperbolic Image Segmentation
by Mina GhadimiAtigh et al

03-11-2022

DRTAM: Dual Rank-1 Tensor Attention Module
by Hanxing Chi et al

03-11-2022

Spatial Consistency Loss for Training Multi-Label Classifiers from Single-Label Annotations
by Thomas Verelst et al

03-08-2022

Lane Detection with Versatile AtrousFormer and Local Semantic Guidance
by Jiaxing Yang et al

03-10-2022

Point Density-Aware Voxels for LiDAR 3D Object Detection
by Jordan S. K. Hu et al

03-09-2022

Defending Black-box Skeleton-based Human Activity Classifiers
by He Wang et al

03-09-2022

Controllable Evaluation and Generation of Physical Adversarial Patch on Face Recognition
by Xiao Yang et al

03-08-2022

Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting
by Chuhui Xue et al

03-09-2022

VGQ-CNN: Moving Beyond Fixed Cameras and Top-Grasps for Grasp Quality Prediction
by A. Konrad et al

03-08-2022

3SD: Self-Supervised Saliency Detection With No Labels
by Rajeev Yasarla et al

03-09-2022

Evaluating Proposed Fairness Models for Face Recognition Algorithms
by John J. Howard et al

03-09-2022

SynWoodScape: Synthetic Surround-view Fisheye Camera Dataset for Autonomous Driving
by Ahmed Rida Sekkat et al

03-08-2022

Weakly Supervised Semantic Segmentation using Out-of-Distribution Data
by Jungbeom Lee et al

03-08-2022

Boosting Mask R-CNN Performance for Long, Thin Forensic Traces with Pre-Segmentation and IoU Region Merging
by Moritz Zink et al

03-08-2022

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels
by Yuchao Wang et al

03-10-2022

High Definition, Inexpensive, Underwater Mapping
by Bharat Joshi et al

03-09-2022

3D Dense Face Alignment with Fused Features by Aggregating CNNs and GCNs
by Yanda Meng et al

03-08-2022

Autonomous Mosquito Habitat Detection Using Satellite Imagery and Convolutional Neural Networks for Disease Risk Mapping
by Sriram Elango et al

03-10-2022

Gesture based Arabic Sign Language Recognition for Impaired People based on Convolution Neural Network
by Rady El Rwelli et al

03-10-2022

Human Face Recognition from Part of a Facial Image based on Image Stitching
by Osama R. Shahin et al

03-09-2022

Dynamic Instance Domain Adaptation
by Zhongying Deng et al

03-08-2022

Update Compression for Deep Neural Networks on the Edge
by Bo Chen et al

03-08-2022

Visual anomaly detection in video by variational autoencoder
by Faraz Waseem et al

03-10-2022

City-wide Street-to-Satellite Image Geolocalization of a Mobile Ground Agent
by Lena M. Downes et al

03-11-2022

Visualizing and Understanding Patch Interactions in Vision Transformer
by Jie Ma et al

03-11-2022

TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal Reasoning
by Shiwen Zhang

03-08-2022

Probabilistic Rotation Representation With an Efficiently Computable Bingham Loss Function and Its Application to Pose Estimation
by Hiroya Sato et al

03-11-2022

aiWave: Volumetric Image Compression with 3-D Trained Affine Wavelet-like Transform
by Dongmei Xue et al

03-08-2022

Self-Supervision, Remote Sensing and Abstraction: Representation Learning Across 3 Million Locations
by Sachith Seneviratne et al

03-08-2022

Pointillism: Accurate 3D bounding box estimation with multi-radars
by Kshitiz Bansal et al

03-09-2022

Monocular Depth Distribution Alignment with Low Computation
by Fei Sheng et al

03-08-2022

Unrolled Primal-Dual Networks for Lensless Cameras
by Oliver Kingshott et al

03-11-2022

REX: Reasoning-aware and Grounded Explanation
by Shi Chen et al

03-08-2022

Diffusion Models for Medical Anomaly Detection
by Julia Wolleb et al

03-08-2022

SuperPoint features in endoscopy
by O. L. Barbed et al

03-11-2022

Neuromorphic Data Augmentation for Training Spiking Neural Networks
by Yuhang Li et al

03-11-2022

Font Shape-to-Impression Translation
by Masaya Ueda et al

03-09-2022

A high-precision underwater object detection based on joint self-supervised deblurring and improved spatial transformer network
by Xiuyuan Li et al

03-08-2022

Multi-Scale Adaptive Network for Single Image Denoising
by Yuanbiao Gou et al

03-10-2022

Self Pre-training with Masked Autoencoders for Medical Image Analysis
by Lei Zhou et al

03-11-2022

Towards Self-Supervised Learning of Global and Object-Centric Representations
by Federico Baldassarre et al

03-09-2022

Optical Flow Training under Limited Label Budget via Active Learning
by Shuai Yuan et al

03-08-2022

Deep Multi-Branch Aggregation Network for Real-Time Semantic Segmentation in Street Scenes
by Xi Weng et al

03-11-2022

Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection
by Siyue Yu et al

03-09-2022

How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting
by Alessio Monti et al

03-09-2022

Metastatic Cancer Outcome Prediction with Injective Multiple Instance Pooling
by Jianan Chen et al

03-09-2022

UNeXt: MLP-based Rapid Medical Image Segmentation Network
by Jeya Maria Jose Valanarasu et al

03-09-2022

MLNav: Learning to Safely Navigate on Martian Terrains
by Shreyansh Daftry et al

03-09-2022

All You Need is LUV: Unsupervised Collection of Labeled Images using Invisible UV Fluorescent Indicators
by Brijen Thananjeyan et al

03-11-2022

Improve Convolutional Neural Network Pruning by Maximizing Filter Variety
by Nathan Hubens et al

03-08-2022

Live Laparoscopic Video Retrieval with Compressed Uncertainty
by Tong Yu et al

03-09-2022

Artificial Intelligence Solution for Effective Treatment Planning for Glioblastoma Patients
by Vikram Goddla

03-11-2022

Video Coding for Machines with Feature-Based Rate-Distortion Optimization
by Kristian Fischer et al

03-09-2022

Evaluation of YOLO Models with Sliced Inference for Small Object Detection
by Muhammed Can Keles et al

03-09-2022

CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction
by Zhuoran Song et al

03-10-2022

PC-SwinMorph: Patch Representation for Unsupervised Medical Image Registration and Segmentation
by Lihao Liu et al

03-10-2022

Leveraging Labeling Representations in Uncertainty-based Semi-supervised Segmentation
by Sukesh Adiga et al

03-10-2022

Deep Multimodal Guidance for Medical Image Classification
by Mayur Mallya et al

03-10-2022

Deep Convolutional Neural Networks for Molecular Subtyping of Gliomas Using Magnetic Resonance Imaging
by Dong Wei et al

03-08-2022

End-to-end system for object detection from sub-sampled radar data
by Madhumitha Sakthi et al

03-09-2022

Multi-modal Brain Tumor Segmentation via Missing Modality Synthesis and Modality-level Attention Fusion
by Ziqi Huang et al

03-08-2022

Predicting conversion of mild cognitive impairment to Alzheimers disease
by Yiran Wei et al

03-09-2022

PHTrans: Parallelly Aggregating Global and Local Representations for Medical Image Segmentation
by Wentao Liu et al

03-08-2022

An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production
by Anwesha Roy et al

03-11-2022

BabyNet: Reconstructing 3D faces of babies from uncalibrated photographs
by Araceli Morales et al

03-11-2022

Saliency-Driven Versatile Video Coding for Neural Object Detection
by Kristian Fischer et al

03-09-2022

ChiTransformer:Towards Reliable Stereo from Cues
by Qing Su et al

03-09-2022

Learning Temporal Consistency for Source-Free Video Domain Adaptation
by Yuecong Xu et al

03-09-2022

A high-precision self-supervised monocular visual odometry in foggy weather based on robust cycled generative adversarial networks and multi-task learning aided depth estimation
by Xiuyuan Li et al

03-09-2022

Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
by Mohamed Ali Souibgui et al

03-09-2022

Recovering medical images from CT film photos
by Quan Quan et al

03-09-2022

Active Self-Semi-Supervised Learning for Few Labeled Samples Fast Training
by Ziting Wen et al

03-09-2022

Region-Aware Face Swapping
by Chao Xu et al

03-11-2022

Human Silhouette and Skeleton Video Synthesis through Wi-Fi signals
by Danilo Avola et al

03-10-2022

On-the-Fly Test-time Adaptation for Medical Image Segmentation
by Jeya Maria Jose Valanarasu et al

03-10-2022

Unfolded Deep Kernel Estimation for Blind Image Super-resolution
by Hongyi Zheng et al

03-10-2022

Multi-Channel Convolutional Analysis Operator Learning for Dual-Energy CT Reconstruction
by Alessandro Perelli et al

03-10-2022

Label-efficient Hybrid-supervised Learning for Medical Image Segmentation
by Junwen Pan et al

03-09-2022

LiftReg: Limited Angle 2D/3D Deformable Registration
by Lin Tian et al

 
Craig Smith