2022.3.28 Vision papers

03-24-2022	Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors by Oran Gafni et al
03-23-2022	R3M: A Universal Visual Representation for Robot Manipulation by Suraj Nair et al
03-22-2022	GradViT: Gradient Inversion of Vision Transformers by Ali Hatamizadeh et al
03-23-2022	Learning to generate line drawings that convey geometry and semantics by Caroline Chan et al
03-22-2022	Focal Modulation Networks by Jianwei Yang et al
03-22-2022	Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions by Jing Gu et al
03-24-2022	Neural Neighbor Style Transfer by Nicholas Kolkin et al
03-24-2022	SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation by Chenming Zhu et al
03-23-2022	Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera by Jae Shin Yoon et al
03-23-2022	Revisiting Multi-Scale Feature Fusion for Semantic Segmentation by Tianjian Meng et al
03-22-2022	Self-supervision through Random Segments with Autoregressive Coding (RandSAC) by Tianyu Hua et al
03-22-2022	WuDaoMM: A large-scale Multi-Modal Dataset for Pre-training models by Sha Yuan et al
03-22-2022	Dataset Distillation by Matching Training Trajectories by George Cazenavette et al
03-23-2022	How Do You Do It? Fine-Grained Action Understanding with Pseudo-Adverbs by Hazel Doughty et al
03-24-2022	Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction by M. Saquib Sarfraz et al
03-24-2022	Text to Mesh Without 3D Supervision Using Limit Subdivision by Nasir Khalid et al
03-24-2022	NPBG++: Accelerating Neural Point-Based Graphics by Ruslan Rakhimov et al
03-22-2022	Open-Vocabulary DETR with Conditional Matching by Yuhang Zang et al
03-24-2022	Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory by Li Siyao et al
03-22-2022	Visual Prompt Tuning by Menglin Jia et al
03-23-2022	NeuMan: Neural Human Radiance Field from a Single Video by Wei Jiang et al
03-22-2022	Improving Generalization in Federated Learning by Seeking Flat Minima by Debora Caldarola et al
03-22-2022	Generating natural images with direct Patch Distributions Matching by Ariel Elnekave et al
03-23-2022	VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training by Zhan Tong et al
03-23-2022	Industrial Style Transfer with Large-scale Geometric Warping and Content Preservation by Jinchao Yang et al
03-24-2022	Is Geometry Enough for Matching in Visual Localization? by Qunjie Zhou et al
03-24-2022	Learning Dense Correspondence from Synthetic Environments by Mithun Lal et al
03-24-2022	BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training by Likun Cai et al
03-22-2022	Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation by Jiankun Li et al
03-22-2022	A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning by Hugo Berg et al
03-22-2022	A Real-time Junk Food Recognition System based on Machine Learning by Sirajum Munira Shifat et al
03-25-2022	Efficient-VDVAE: Less is more by Louay Hazami et al
03-24-2022	Global Tracking Transformers by Xingyi Zhou et al
03-25-2022	3D GAN Inversion for Controllable Portrait Image Animation by Connor Z. Lin et al
03-24-2022	Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer by Shuai Yang et al
03-23-2022	Interpretable Prediction of Lung Squamous Cell Carcinoma Recurrence With Self-supervised Learning by Weicheng Zhu et al
03-25-2022	AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling by Ziqian Bai et al
03-23-2022	When Accuracy Meets Privacy: Two-Stage Federated Transfer Learning Framework in Classification of Medical Images on Limited Data: A COVID-19 Case Study by Alexandros Shikun Zhang et al
03-24-2022	CVF-SID: Cyclic multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise from Image by Reyhaneh Neshatavar et al
03-24-2022	Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation by Xian Liu et al
03-23-2022	Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation by Yanwu Xu et al
03-22-2022	HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation by Yanyuan Qiao et al
03-22-2022	CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training by Haitian Zheng et al
03-23-2022	Random Forest Regression for continuous affect using Facial Action Units by Saurabh Hinduja et al
03-24-2022	Beyond Fixation: Dynamic Window Visual Transformer by Pengzhen Ren et al
03-23-2022	Physics-Driven Deep Learning for Computational Magnetic Resonance Imaging by Kerstin Hammernik et al
03-23-2022	Evaluation of Non-Invasive Thermal Imaging for detection of Viability of Onchocerciasis worms by Ronak Dedhiya et al
03-22-2022	Enabling faster and more reliable sonographic assessment of gestational age through machine learning by Chace Lee et al
03-22-2022	Lymphocyte Classification in Hyperspectral Images of Ovarian Cancer Tissue Biopsy Samples by Benjamin Paulson et al
03-23-2022	MR Image Denoising and Super-Resolution Using Regularized Reverse Diffusion by Hyungjin Chung et al
03-24-2022	NPC: Neuron Path Coverage via Characterizing Decision Logic of Deep Neural Networks by Xiaofei Xie et al
03-22-2022	Learning from All Vehicles by Dian Chen et al
03-22-2022	IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment by Yiming Zeng et al
03-24-2022	Open-set Recognition via Augmentation-based Similarity Learning by Sepideh Esmaeilpour et al
03-24-2022	A Representation Separation Perspective to Correspondences-free Unsupervised 3D Point Cloud Registration by Zhiyuan Zhang et al
03-23-2022	GriTS: Grid table similarity metric for table structure recognition by Brandon Smock et al
03-24-2022	Interpretable Prediction of Pulmonary Hypertension in Newborns using Echocardiograms by Hanna Ragnarsdottir et al
03-24-2022	EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation by Hansheng Chen et al
03-24-2022	Dexterous Imitation Made Easy: A Learning-Based Framework for Efficient Dexterous Manipulation by Sridhar Pandian Arunachalam et al
03-24-2022	A Deep-Discrete Learning Framework for Spherical Surface Registration by Mohamed A. Suliman et al
03-24-2022	Direct evaluation of progression or regression of disease burden in brain metastatic disease with Deep Neuroevolution by Joseph Stember et al
03-24-2022	RayTran: 3D pose estimation and shape reconstruction of multiple objects from videos with ray-traced transformers by Michał J. Tyszkiewicz et al
03-23-2022	The Challenges of Continuous Self-Supervised Learning by Senthil Purushwalkam et al
03-22-2022	PlaneMVS: 3D Plane Reconstruction from Multi-View Stereo by Jiachen Liu et al
03-22-2022	Convolutional Neural Network to Restore Low-Dose Digital Breast Tomosynthesis Projections in a Variance Stabilization Domain by Rodrigo de Barros Vimieiro et al
03-24-2022	Towards Exemplar-Free Continual Learning in Vision Transformers: an Account of Attention, Functional and Weight Regularization by Francesco Pelosin et al
03-22-2022	Was that so hard? Estimating human classification difficulty by Morten Rieger Hannemose et al
03-22-2022	Fast on-line signature recognition based on VQ with time modeling by Juan-Manuel Pascual-Gaspar et al
03-23-2022	Self-Supervised Robust Scene Flow Estimation via the Alignment of Probability Density Functions by Pan He et al
03-22-2022	{\phi}-SfT: Shape-from-Template with a Physics-Based Deformation Model by Navami Kairanda et al
03-23-2022	Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin by Hangyu Li et al
03-23-2022	Binary Morphological Neural Network by Theodore Aouad et al
03-23-2022	A Deep Learning Framework to Reconstruct Face under Mask by Gourango Modak et al
03-22-2022	Meta-attention for ViT-backed Continual Learning by Mengqi Xue et al
03-24-2022	VRNet: Learning the Rectified Virtual Corresponding Points for 3D Point Cloud Registration by Zhiyuan Zhang et al
03-23-2022	Enhancing Classifier Conservativeness and Robustness by Polynomiality by Ziqi Wang et al
03-23-2022	Learning Scene Flow in 3D Point Clouds with Noisy Pseudo Labels by Bing Li et al
03-23-2022	Improving the Fairness of Chest X-ray Classifiers by Haoran Zhang et al
03-23-2022	Biceph-Net: A robust and lightweight framework for the diagnosis of Alzheimers disease using 2D-MRI scans and deep similarity learning by A. H. Rashid et al
03-23-2022	Learning to Censor by Noisy Sampling by Ayush Chopra et al
03-24-2022	Multitask Emotion Recognition Model with Knowledge Distillation and Task Discriminator by Euiseok Jeong et al
03-22-2022	Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos by Tomáš Souček et al
03-24-2022	Multi-modal Emotion Estimation for in-the-wild Videos by Liyu Meng et al
03-24-2022	Coarse-to-Fine Cascaded Networks with Smooth Predicting for Video Facial Expression Recognition by Fanglei Xue et al
03-23-2022	Negative Selection by Clustering for Contrastive Learning in Human Activity Recognition by Jinqiang Wang et al
03-23-2022	A Hybrid Mesh-neural Representation for 3D Transparent Object Reconstruction by Jiamin Xu et al
03-24-2022	IA-FaceS: A Bidirectional Method for Semantic Face Editing by Wenjing Huang et al
03-25-2022	Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap by Yifei Wang et al
03-23-2022	Cell segmentation from telecentric bright-field transmitted light microscopic images using a Residual Attention U-Net: a case study on HeLa line by Ali Ghaznavi et al
03-23-2022	Q-FW: A Hybrid Classical-Quantum Frank-Wolfe for Quadratic Binary Optimization by Alp Yurtsever et al
03-22-2022	Generative Modeling Helps Weak Supervision (and Vice Versa) by Benedikt Boecking et al
03-24-2022	FitCLIP: Refining Large-Scale Pretrained Image-Text Models for Zero-Shot Video Understanding Tasks by Santiago Castro et al
03-23-2022	Sparse Instance Activation for Real-Time Instance Segmentation by Tianheng Cheng et al
03-22-2022	Pixel VQ-VAEs for Improved Pixel Art Representation by Akash Saravanan et al
03-24-2022	Facial Expression Recognition based on Multi-head Cross Attention Network by Jae-Yeop Jeong et al
03-23-2022	Computed Tomography Reconstruction using Generative Energy-Based Priors by Martin Zach et al
03-24-2022	Feature visualization for convolutional neural network models trained on neuroimaging data by Fabian Eitel et al
03-22-2022	A Broad Study of Pre-training for Domain Generalization and Adaptation by Donghyun Kim et al
03-23-2022	Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video by Shun Taguchi et al
03-23-2022	A Method of Data Augmentation to Train a Small Area Fingerprint Recognition Deep Neural Network with a Normal Fingerprint Database by JuSong Kim
03-22-2022	Improving Neural Predictivity in the Visual Cortex with Gated Recurrent Connections by Simone Azeglio et al
03-23-2022	StructToken : Rethinking Semantic Segmentation with Structural Prior by Fangjian Lin et al
03-22-2022	Weakly-Supervised Salient Object Detection Using Point Supervison by Shuyong Gao et al
03-24-2022	Transformer Compressed Sensing via Global Image Tokens by Marlon Bran Lorenzana et al
03-22-2022	Channel Self-Supervision for Online Knowledge Distillation by Shixiao Fan et al
03-22-2022	A Novel Framework for Assessment of Learning-based Detectors in Realistic Conditions with Application to Deepfake Detection by Yuhang Lu et al
03-23-2022	SMEMO: Social Memory for Trajectory Forecasting by Francesco Marchetti et al
03-22-2022	AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network by Wooseok Lee et al
03-24-2022	RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization by Yan Xu et al
03-23-2022	Event-Based Dense Reconstruction Pipeline by Kun Xiao et al
03-22-2022	Fine-Grained Scene Graph Generation with Data Transfer by Ao Zhang et al
03-23-2022	Activation-Based Sampling for Pixel- to Image-Level Aggregation in Weakly-Supervised Segmentation by Arvi Jonnarth et al
03-25-2022	Polarization Multiplexed Diffractive Computing: All-Optical Implementation of a Group of Linear Transformations Through a Polarization-Encoded Diffractive Network by Jingxi Li et al
03-24-2022	Deep learning for laboratory earthquake prediction and autoregressive forecasting of fault zone stress by Laura Laurenti et al
03-24-2022	Compound Domain Generalization via Meta-Knowledge Encoding by Chaoqi Chen et al
03-23-2022	Deep Frequency Filtering for Domain Generalization by Shiqi Lin et al
03-23-2022	Refine-Net: Normal Refinement Neural Network for Noisy Point Clouds by Haoran Zhou et al
03-22-2022	Multi-layer Clustering-based Residual Sparsifying Transform for Low-dose CT Image Reconstruction by Xikai Yang et al
03-25-2022	On the performance of preconditioned methods to solve LpLp-norm phase unwrapping by Ricardo Legarda-Saenz et al
03-24-2022	Learning Disentangled Representation for One-shot Progressive Face Swapping by Qi Li et al
03-24-2022	R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning by Qiankun Gao et al
03-25-2022	A Unified Contrastive Energy-based Model for Understanding the Generative Ability of Adversarial Training by Yifei Wang et al
03-25-2022	CNN LEGO: Disassembling and Assembling Convolutional Neural Network by Jiacong Hu et al
03-22-2022	Reinforcement-based frugal learning for satellite image change detection by Sebastien Deschamps et al
03-24-2022	Egocentric Prediction of Action Target in 3D by Yiming Li et al
03-24-2022	Facial Action Unit Recognition With Multi-models Ensembling by Wenqiang Jiang et al
03-24-2022	SIFT and SURF based feature extraction for the anomaly detection by Simon Bilik et al
03-22-2022	Deep Portrait Delighting by Joshua Weir et al
03-24-2022	Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer by Omkar Thawakar et al
03-22-2022	Semi-Supervised Hybrid Spine Network for Segmentation of Spine MR Images by Meiyan Huang et al
03-23-2022	MT-UDA: Towards Unsupervised Cross-modality Medical Image Segmentation with Limited Source Labels by Ziyuan Zhao et al
03-24-2022	Moving Window Regression: A Novel Approach to Ordinal Regression by Nyeong-Ho Shin et al
03-23-2022	A Multi-Characteristic Learning Method with Micro-Doppler Signatures for Pedestrian Identification by Yu Xiang et al
03-23-2022	U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search by Ahmet Caner Yüzügüler et al
03-22-2022	Mask Usage Recognition using Vision Transformer with Transfer Learning and Data Augmentation by Hensel Donato Jahja et al
03-24-2022	DyRep: Bootstrapping Training with Dynamic Re-parameterization by Tao Huang et al
03-24-2022	Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning by Juncheng Li et al
03-24-2022	A Simulation Benchmark for Vision-based Autonomous Navigation by Lauri Suomela et al
03-22-2022	A New Approach to Improve Learning-based Deepfake Detection in Realistic Conditions by Yuhang Lu et al
03-24-2022	Physics-based Learning of Parameterized Thermodynamics from Real-time Thermography by Hamza El-Kebir et al
03-22-2022	CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation by Feng Wang et al
03-24-2022	AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception by Shaoyu Chen et al
03-24-2022	Focus-and-Detect: A Small Object Detection Framework for Aerial Images by Onur Can Koyun et al
03-24-2022	A Preliminary Research on Space Situational Awareness Based on Event Cameras by Kun Xiao et al

03-24-2022	Steganalysis of Image with Adaptively Parametric Activation by Hai Su et al
03-24-2022	Self-supervised Video-centralised Transformer for Video Face Clustering by Yujiang Wang et al
03-23-2022	Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection by Liang Chen et al
03-24-2022	A Perturbation Constrained Adversarial Attack for Evaluating the Robustness of Optical Flow by Jenny Schmalfuss et al
03-23-2022	Transformer-based Multimodal Information Fusion for Facial Expression Analysis by Wei Zhang et al
03-22-2022	Unsupervised Anomaly Detection in Medical Images with a Memory-augmented Multi-level Cross-attentional Masked Autoencoder by Yu Tian et al
03-22-2022	Adaptive Patch Exiting for Scalable Single Image Super-Resolution by Shizun Wang et al
03-23-2022	UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection by Ye Liu et al
03-22-2022	DepthGAN: GAN-based Depth Generation of Indoor Scenes from Semantic Layouts by Yidi Li et al
03-23-2022	Subjective and Objective Analysis of Streamed Gaming Videos by Xiangxu Yu et al
03-22-2022	4D-OR: Semantic Scene Graphs for OR Domain Modeling by Ege Özsoy et al
03-22-2022	Unified Negative Pair Generation toward Well-discriminative Feature Space for Face Recognition by Junuk Jung et al
03-23-2022	HMFS: Hybrid Masking for Few-Shot Segmentation by Seonghyeon Moon et al
03-23-2022	ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator by Zi-Chao Zhang et al
03-24-2022	Neural Reflectance for Shape Recovery with Shadow Handling by Junxuan Li et al
03-24-2022	Privileged Attribution Constrained Deep Networks for Facial Expression Recognition by Jules Bonnard et al
03-23-2022	Robust Text Line Detection in Historical Documents: Learning and Evaluation Methods by Mélodie Boillet et al
03-22-2022	Cross-View Panorama Image Synthesis by Songsong Wu et al
03-22-2022	Contrastive Transformer-based Multiple Instance Learning for Weakly Supervised Polyp Frame Detection by Yu Tian et al
03-23-2022	AIMusicGuru: Music Assisted Human Pose Correction by Snehesh Shrestha et al
03-22-2022	Remember Intentions: Retrospective-Memory-based Trajectory Prediction by Chenxin Xu et al
03-22-2022	Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization by Yu Zhan et al
03-22-2022	Under the Hood of Transformer Networks for Trajectory Forecasting by Luca Franco et al
03-23-2022	Real-time Object Detection for Streaming Perception by Jinrong Yang et al
03-23-2022	CroMo: Cross-Modal Learning for Monocular Depth Estimation by Yannick Verdié et al
03-23-2022	Scale-Equivalent Distillation for Semi-Supervised Object Detection by Qiushan Guo et al
03-23-2022	DR.VIC: Decomposition and Reasoning for Video Individual Counting by Tao Han et al
03-23-2022	3D Adapted Random Forest Vision (3DARFV) for Untangling Heterogeneous-Fabric Exceeding Deep Learning Semantic Segmentation Efficiency at the Utmost Accuracy by Omar Alfarisi et al
03-24-2022	Continuous Emotion Recognition using Visual-audio-linguistic information: A Technical Report for ABAW3 by Su Zhang et al
03-24-2022	Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering by Chengyang Fang et al
03-23-2022	Autofocus for Event Cameras by Shijie Lin et al
03-23-2022	Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition by Junho Kim et al
03-25-2022	Versatile Multi-Modal Pre-Training for Human-Centric Perception by Fangzhou Hong et al
03-23-2022	Self-supervised HDR Imaging from Motion and Exposure Cues by Michal Nazarczuk et al
03-24-2022	WarpingGAN: Warping Multiple Uniform Priors for Adversarial 3D Point Cloud Generation by Yingzhi Tang et al
03-23-2022	Domain-Generalized Textured Surface Anomaly Detection by Shang-Fu Chen et al
03-23-2022	Lane detection with Position Embedding by Jun Xie et al
03-23-2022	DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition by Denis Coquenet et al
03-22-2022	ProgressiveMotionSeg: Mutually Reinforced Framework for Event-Based Motion Segmentation by Jinze Chen et al
03-24-2022	Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis by Kai Zhang et al
03-23-2022	Affective Feedback Synthesis Towards Multimodal Text and Image Data by Puneet Kumar et al
03-22-2022	Rebalanced Siamese Contrastive Mining for Long-Tailed Recognition by Zhisheng Zhong et al
03-23-2022	On the (Limited) Generalization of MasterFace Attacks and Its Relation to the Capacity of Face Representations by Philipp Terhörst et al
03-24-2022	Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection by Hitesh Sapkota et al
03-24-2022	Multiple Emotion Descriptors Estimation at the ABAW3 Challenge by Didan Deng
03-24-2022	Keypoints Tracking via Transformer Networks by Oleksii Nasypanyi et al
03-24-2022	Semantic Image Manipulation with Background-guided Internal Learning by Zhongping Zhang et al
03-23-2022	An Attention-based Method for Action Unit Detection at the 3rd ABAW Competition by Duy Le Hoai et al
03-23-2022	Your Attention Deserves Attention: A Self-Diversified Multi-Channel Attention for Facial Action Analysis by Xiaotian Li et al
03-23-2022	Multidimensional Belief Quantification for Label-Efficient Meta-Learning by Deep Pandey et al
03-23-2022	Training-free Transformer Architecture Search by Qinqin Zhou et al
03-24-2022	Searching for fingerspelled content in American Sign Language by Bowen Shi et al
03-22-2022	QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation by Yuxin Hong et al
03-24-2022	X-ray Dissectography Improves Lung Nodule Detection by Chuang Niu et al
03-24-2022	The Fixed Sub-Center: A Better Way to Capture Data Complexity by Zhemin Zhang et al
03-22-2022	High-resolution Iterative Feedback Network for Camouflaged Object Detection by Xiaobin Hu et al
03-23-2022	Multi-label Transformer for Action Unit Detection by Gauthier Tallec et al
03-22-2022	Exploring and Evaluating Image Restoration Potential in Dynamic Scenes by Cheng Zhang et al
03-22-2022	Mixed Differential Privacy in Computer Vision by Aditya Golatkar et al
03-23-2022	Towards Efficient and Elastic Visual Question Answering with Doubly Slimmable Transformer by Zhou Yu et al
03-22-2022	Dense Residual Networks for Gaze Mapping on Indian Roads by Chaitanya Kapoor et al
03-25-2022	Neural Networks with Divisive normalization for image segmentation with application in cityscapes dataset by Pablo Hernández-Cámara et al
03-22-2022	Leveraging Textures in Zero-shot Understanding of Fine-Grained Domains by Chenyun Wu et al
03-22-2022	Unifying Motion Deblurring and Frame Interpolation with Events by Xiang Zhang et al
03-22-2022	Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing by Hsin-Ping Huang et al
03-22-2022	Frugal Learning of Virtual Exemplars for Label-Efficient Satellite Image Change Detection by Hichem Sahbi et al
03-24-2022	Transformers Meet Visual Learning Understanding: A Comprehensive Review by Yuting Yang et al
03-22-2022	Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition by Wondimu Dikubab et al
03-22-2022	Detection, Recognition, and Tracking: A Survey by Shiyao Chen et al
03-23-2022	Unsupervised Salient Object Detection with Spectral Cluster Voting by Gyungin Shin et al
03-25-2022	Interpretation of Chest x-rays affected by bullets using deep transfer learning by Shaheer Khan et al
03-22-2022	GOSS: Towards Generalized Open-set Semantic Segmentation by Jie Hong et al
03-23-2022	What to Hide from Your Students: Attention-Guided Masked Image Modeling by Ioannis Kakogeorgiou et al
03-24-2022	Quantum Motion Segmentation by Federica Arrigoni et al
03-23-2022	DynamicEarthNet: Daily Multi-Spectral Satellite Dataset for Semantic Change Segmentation by Aysim Toker et al
03-25-2022	Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion by Tianpei Gu et al
03-23-2022	Hyper-Spectral Imaging for Overlapping Plastic Flakes Segmentation by Guillem Martinez et al
03-25-2022	The TerraByte Client: providing access to terabytes of plant data by Michael A. Beck et al
03-25-2022	Non-Probability Sampling Network for Stochastic Human Trajectory Prediction by Inhwan Bae et al
03-25-2022	ST-FL: Style Transfer Preprocessing in Federated Learning for COVID-19 Segmentation by Antonios Georgiadis et al
03-24-2022	Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals by Simon Vandenhende et al
03-22-2022	End-to-End Learned Block-Based Image Compression with Block-Level Masked Convolutions and Asymptotic Closed Loop Training by Fatih Kamisli
03-24-2022	Expression Classification using Concatenation of Deep Neural Network for the 3rd ABAW3 Competition by Kim Ngan Phan et al
03-23-2022	Adaptively Re-weighting Multi-Loss Untrained Transformer for Sparse-View Cone-Beam CT Reconstruction by Minghui Wu et al
03-22-2022	Convolutional Neural Network-based Efficient Dense Point Cloud Generation using Unsigned Distance Fields by Abol Basher et al
03-24-2022	Probing Representation Forgetting in Supervised and Unsupervised Continual Learning by MohammadReza Davari et al
03-25-2022	MDsrv -- visual sharing and analysis of molecular dynamics simulations by Michelle Kampfrath et al
03-25-2022	Facial Expression Recognition with Swin Transformer by Jun-Hwa Kim et al
03-25-2022	Interactive Style Transfer: All is Your Palette by Zheng Lin et al
03-24-2022	Weakly-Supervised End-to-End CAD Retrieval to Scan Objects by Tim Beyer et al
03-25-2022	Compare learning: bi-attention network for few-shot learning by Li Ke et al
03-22-2022	Semantic State Estimation in Cloth Manipulation Tasks by Georgies Tzelepis et al
03-25-2022	PANDORA: Polarization-Aided Neural Decomposition Of Radiance by Akshat Dave et al
03-25-2022	SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance by Xinchi Zhou et al
03-22-2022	SSD-KD: A Self-supervised Diverse Knowledge Distillation Method for Lightweight Skin Lesion Classification Using Dermoscopic Images by Yongwei Wang et al
03-25-2022	Continual Test-Time Domain Adaptation by Qin Wang et al
03-23-2022	Efficient Few-Shot Object Detection via Knowledge Inheritance by Ze Yang et al
03-24-2022	Effectively leveraging Multi-modal Features for Movie Genre Classification by Zhongping Zhang et al
03-24-2022	Intrinsic Bias Identification on Medical Image Datasets by Shijie Zhang et al
03-22-2022	DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for Histopathology Whole Slide Image Classification by Hongrun Zhang et al
03-22-2022	Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework by Botao Ye et al
03-22-2022	Unsupervised Deraining: Where Contrastive Learning Meets Self-similarity by Ye Yuntong et al
03-25-2022	Lightweight Graph Convolutional Networks with Topologically Consistent Magnitude Pruning by Hichem Sahbi
03-24-2022	Microstructure Surface Reconstruction from SEM Images: An Alternative to Digital Image Correlation (DIC) by Khalid El-Awady
03-24-2022	FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization by Kecheng Zheng et al
03-24-2022	An Ensemble Approach for Facial Expression Analysis in Video by Hong-Hai Nguyen et al
03-22-2022	FxP-QNet: A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation by Ahmad Shawahna et al
03-22-2022	WayFAST: Traversability Predictive Navigation for Field Robots by Mateus Valverde Gasparino et al
03-25-2022	Navigable Proximity Graph-Driven Native Hybrid Queries with Structured and Unstructured Constraints by Mengzhao Wang et al
03-24-2022	Noisy Boundaries: Lemon or Lemonade for Semi-supervised Instance Segmentation? by Zhenyu Wang et al
03-25-2022	Deformable Butterfly: A Highly Structured and Sparse Linear Transform by Rui Lin et al
03-24-2022	Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes by Zengjie Song et al
03-22-2022	TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers by Xuyang Bai et al
03-22-2022	FrameHopper: Selective Processing of Video Frames in Detection-driven Real-Time Video Analytics by Md Adnan Arefeen et al
03-25-2022	Vision Transformer Compression with Structured Pruning and Low Rank Approximation by Ankur Kumar
03-25-2022	StretchBEV: Stretching Future Instance Prediction Spatially and Temporally by Adil Kaan Akan et al
03-25-2022	Analysis of the Production Strategy of Mask Types in the COVID-19 Environment by Xiangri Lu et al
03-25-2022	Searching for Network Width with Bilaterally Coupled Network by Xiu Su et al
03-25-2022	A Visual Navigation Perspective for Category-Level Object Pose Estimation by Jiaxin Guo et al
03-24-2022	Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation by Theodoros Pissas et al
03-25-2022	Analysis of the use of color and its emotional relationship in visual creations based on experiences during the context of the COVID-19 pandemic by César González-Martín et al
03-25-2022	Implicit Neural Representations for Variable Length Human Motion Generation by Pablo Cervantes et al
03-25-2022	Unsupervised Image Deraining: Optimization Model Driven Deep CNN by Changfeng Yu et al
03-22-2022	Satellite Infrastructure/Mission Tradeoffs by Matthew Ciolino
03-22-2022	Learning Geodesic-Aware Local Features from RGB-D Images by Guilherme Potje et al
03-25-2022	Salt Detection Using Segmentation of Seismic Image by Mrinmoy Sarkar
03-25-2022	Efficient Visual Tracking via Hierarchical Cross-Attention Transformer by Xin Chen et al
03-25-2022	Multimodal Pre-training Based on Graph Attention Network for Document Understanding by Zhenrong Zhang et al
03-25-2022	High-Performance Transformer Tracking by Xin Chen et al
03-25-2022	Spatially Multi-conditional Image Generation by Ritika Chakraborty et al
03-25-2022	Visual-based Safe Landing for UAVs in Populated Areas: Real-time Validation in Virtual Environments by Hector Tovanche-Picon et al
03-25-2022	Playing Lottery Tickets in Style Transfer Models by Meihao Kong et al
03-25-2022	Continuous Dynamic-NeRF: Spline-NeRF by Julian Knodt
03-24-2022	Multi-modal Multi-label Facial Action Unit Detection with Transformer by Lingfeng Wang et al
03-24-2022	Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos by Reza Ghoddoosian et al
03-24-2022	Human Gait Recognition Using Bag of Words Feature Representation Method by Nasrin Bayat et al
03-24-2022	MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection by Renrui Zhang et al
03-25-2022	CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification by Philip Chikontwe et al
03-25-2022	RD-Optimized Trit-Plane Coding of Deep Compressed Image Latent Tensors by Seungmin Jeon et al
03-25-2022	Improving Adversarial Transferability with Spatial Momentum by Guoqiu Wang et al
03-25-2022	Dense Continuous-Time Optical Flow from Events and Frames by Mathias Gehrig et al
03-25-2022	Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification by Sohini Roychowdhury
03-25-2022	PCA-Based Knowledge Distillation Towards Lightweight and Content-Style Balanced Photorealistic Style Transfer Models by Tai-Yin Chiu et al
03-25-2022	Unsupervised Pre-training for Temporal Action Localization Tasks by Can Zhang et al
03-25-2022	Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing Images by Gongyang Li et al
03-25-2022	MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis by Liwen Xu et al
03-25-2022	Fast Hybrid Image Retargeting by Daniel Valdez-Balderas et al
03-24-2022	Continuous-Time Audiovisual Fusion with Recurrence vs. Attention for In-The-Wild Affect Recognition by Vincent Karas et al
03-25-2022	Learning to Adapt to Unseen Abnormal Activities under Weak Supervision by Jaeyoo Park et al
03-25-2022	Class-Incremental Learning for Action Recognition in Videos by Jaeyoo Park et al
03-25-2022	Stabilizing Adversarially Learned One-Class Novelty Detection Using Pseudo Anomalies by Muhammad Zaigham Zaheer et al
03-24-2022	Frame-level Prediction of Facial Expressions, Valence, Arousal and Action Units for Mobile Devices by Andrey V. Savchenko
03-24-2022	BCOT: A Markerless High-Precision 3D Object Tracking Benchmark by Jiachen Li et al
03-25-2022	FReSCO: Flow Reconstruction and Segmentation for low latency Cardiac Output monitoring using deep artifact suppression and segmentation by Olivier Jaubert et al
03-25-2022	Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task by Xiaoqing Ye et al
03-22-2022	Learning Patch-to-Cluster Attention in Vision Transformer by Ryan Grainger et al
03-25-2022	Clustering Aided Weakly Supervised Training to Detect Anomalous Events in Surveillance Videos by Muhammad Zaigham Zaheer et al
03-24-2022	Point2Seq: Detecting 3D Objects as Sequences by Yujing Xue et al
03-25-2022	Digital Fingerprinting of Microstructures by Michael D. White et al
03-25-2022	Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation by Jinheng Xie et al
03-25-2022	Give Me Your Attention: Dot-Product Attention Considered Harmful for Adversarial Patch Robustness by Giulio Lovisotto et al
03-24-2022	CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation by Mohammed Hassanin et al
03-24-2022	Occluded Human Mesh Recovery by Rawal Khirodkar et al
03-24-2022	Repairing Group-Level Errors for DNNs Using Weighted Regularization by Ziyuan Zhong et al

Craig SmithMarch 28, 2022