2021.12.6 Vision papers

12-02-2021	Zero-Shot Text-Guided Object Generation with Dream Fields by Ajay Jain et al
11-30-2021	HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing by Yuval Alaluf et al
12-02-2021	SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency by Devendra Singh Chaplot et al
12-01-2021	RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs by Michael Niemeyer et al
12-02-2021	Editing a classifier by rewriting its prediction rules by Shibani Santurkar et al
11-30-2021	Hallucinated Neural Radiance Fields in the Wild by Xingyu Chen et al
12-02-2021	Learning to Detect Every Thing in an Open World by Kuniaki Saito et al
12-01-2021	PartImageNet: A Large, High-Quality Dataset of Parts by Ju He et al
12-02-2021	BEVT: BERT Pretraining of Video Transformers by Rui Wang et al
12-01-2021	Object-Aware Cropping for Self-Supervised Learning by Shlok Mishra et al
11-30-2021	Diffusion Autoencoders: Toward a Meaningful and Decodable Representation by Konpat Preechakul et al
11-30-2021	Task2Sim : Towards Effective Pre-training and Transfer from Synthetic Data by Samarth Mishra et al
12-02-2021	Improved Multiscale Vision Transformers for Classification and Detection by Yanghao Li et al
11-30-2021	3D Photo Stylization: Learning to Generate Stylized Novel Views from a Single Image by Fangzhou Mu et al
12-02-2021	GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras by Ye Yuan et al
11-30-2021	DiffSDFSim: Differentiable Rigid-Body Dynamics With Implicit Shapes by Michael Strecke et al
12-01-2021	Translation-equivariant Image Quantizer for Bi-directional Image-Text Generation by Woncheol Shin et al
12-02-2021	FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization by Xingchao Liu et al
12-02-2021	DenseCLIP: Extract Free Dense Labels from CLIP by Chong Zhou et al
12-02-2021	DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting by Yongming Rao et al
11-30-2021	Sound-Guided Semantic Image Manipulation by Seung Hyun Lee et al
11-30-2021	AdaViT: Adaptive Vision Transformers for Efficient Image Recognition by Lingchen Meng et al
11-30-2021	NeuSample: Neural Sample Field for Efficient View Synthesis by Jiemin Fang et al
12-02-2021	Masked-attention Mask Transformer for Universal Image Segmentation by Bowen Cheng et al
12-01-2021	SegDiff: Image Segmentation with Diffusion Probabilistic Models by Tomer Amit et al
12-01-2021	Extrapolating from a Single Image to a Thousand Classes using Distillation by Yuki M. Asano et al
12-02-2021	Neural Head Avatars from Monocular RGB Videos by Philip-William Grassal et al
11-30-2021	ATS: Adaptive Token Sampling For Efficient Vision Transformers by Mohsen Fayyaz et al
12-01-2021	CLIPstyler: Image Style Transfer with a Single Text Condition by Gihyun Kwon et al
12-02-2021	StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions by Lukas Höllein et al
12-01-2021	Improving GAN Equilibrium by Raising Spatial Awareness by Jianyuan Wang et al
12-01-2021	Routing with Self-Attention for Multimodal Capsule Networks by Kevin Duarte et al
12-01-2021	MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions by Mattia Soldan et al
12-03-2021	Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research by Bernard Koch et al
12-02-2021	Neural Weight Step Video Compression by Mikolaj Czerkawski et al
12-02-2021	Efficient Neural Radiance Fields with Learned Depth-Guided Sampling by Haotong Lin et al
12-01-2021	Object-aware Video-language Pre-training for Retrieval by Alex Jinpeng Wang et al
12-01-2021	Robustness in Deep Learning for Computer Vision: Mind the gap? by Nathan Drenkow et al
12-01-2021	Object-Centric Unsupervised Image Captioning by Zihang Meng et al
11-30-2021	CRIS: CLIP-Driven Referring Image Segmentation by Zhaoqing Wang et al
12-01-2021	Vision Pair Learning: An Efficient Training Framework for Image Classification by Bei Tong et al
11-30-2021	VoRTX: Volumetric 3D Reconstruction With Transformers for Voxelwise View Selection and Fusion by Noah Stier et al
12-01-2021	GANORCON: Are Generative Models Useful for Few-shot Segmentation? by Oindrila Saha et al
12-02-2021	Learning Neural Light Fields with Ray-Space Embedding Networks by Benjamin Attal et al
12-02-2021	Neural Point Light Fields by Julian Ost et al
11-30-2021	Unsupervised Domain Adaptation: A Reality Check by Kevin Musgrave et al
12-02-2021	Differentiable Spatial Planning using Transformers by Devendra Singh Chaplot et al
12-02-2021	Dimensions of Motion: Learning to Predict a Subspace of Optical Flow from a Single Image by Richard Strong Bowen et al
12-03-2021	NeRF-SR: High-Quality Neural Radiance Fields using Super-Sampling by Chen Wang et al
12-03-2021	CoNeRF: Controllable Neural Radiance Fields by Kacper Kania et al
12-01-2021	MonoScene: Monocular 3D Semantic Scene Completion by Anh-Quan Cao et al
12-03-2021	Coupling Vision and Proprioception for Navigation of Legged Robots by Zipeng Fu et al
12-01-2021	Consensus Graph Representation Learning for Better Grounded Image Captioning by Wenqiao Zhang et al
12-01-2021	HyperInverter: Improving StyleGAN Inversion via Hypernetwork by Tan M. Dinh et al
11-30-2021	NeRFReN: Neural Radiance Fields with Reflections by Yuan-Chen Guo et al
12-01-2021	Reference-guided Pseudo-Label Generation for Medical Semantic Segmentation by Constantin Seibold et al
12-01-2021	FaceTuneGAN: Face Autoencoder for Convolutional Expression Transfer Using Neural Generative Adversarial Networks by Nicolas Olivier et al
12-01-2021	Confidence Propagation Cluster: Unleash Full Potential of Object Detectors by Yichun Shen* et al
12-01-2021	The Shape Part Slot Machine: Contact-based Reasoning for Generating 3D Shapes from Parts by Kai Wang et al
11-30-2021	Exponentially Tilted Gaussian Prior for Variational Autoencoder by Griffin Floto et al
12-03-2021	Class-agnostic Reconstruction of Dynamic Objects from Videos by Zhongzheng Ren et al
11-30-2021	Shunted Self-Attention via Multi-Scale Token Aggregation by Sucheng Ren et al
12-02-2021	Recognizing Scenes from Novel Viewpoints by Shengyi Qian et al
12-02-2021	Quantifying the uncertainty of neural networks using Monte Carlo dropout for deep learning based quantitative MRI by Mehmet Yigit Avci et al
12-02-2021	Controllable Video Captioning with an Exemplar Sentence by Yitian Yuan et al
11-30-2021	Shallow Network Based on Depthwise Over-Parameterized Convolution for Hyperspectral Image Classification by Hongmin Gao et al
12-02-2021	Syntax Customized Video Captioning by Imitating Exemplar Sentences by Yitian Yuan et al
12-02-2021	Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention by Kun Yan et al
11-30-2021	SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Editing by Jing Shi et al
12-03-2021	Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation by Minghui Hu et al
12-01-2021	Relational Graph Learning for Grounded Video Description Generation by Wenqiao Zhang et al
12-03-2021	Hierarchical Optimal Transport for Unsupervised Domain Adaptation by Mourad El Hamri et al
11-30-2021	FENeRF: Face Editing in Neural Radiance Fields by Jingxiang Sun et al
12-01-2021	The Norm Must Go On: Dynamic Unsupervised Domain Adaptation by Normalization by M. Jehanzeb Mirza et al
12-03-2021	Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer by Frederic Z. Zhang et al
12-01-2021	CYBORG: Blending Human Saliency Into the Loss Improves Deep Learning by Aidan Boyd et al
11-30-2021	Is the use of Deep Learning and Artificial Intelligence an appropriate means to locate debris in the ocean without harming aquatic wildlife? by Zoe Moorton et al
12-01-2021	Hierarchical Neural Implicit Pose Network for Animation and Motion Retargeting by Sourav Biswas et al
12-02-2021	D3Net: A Speaker-Listener Architecture for Semi-supervised Dense Captioning and Visual Grounding in RGB-D Scans by Dave Zhenyu Chen et al
12-02-2021	Altering Facial Expression Based on Textual Emotion by Mohammad Imrul Jubair et al
12-01-2021	Learning Transformer Features for Image Quality Assessment by Chao Zeng et al
12-01-2021	PreViTS: Contrastive Pretraining with Video Tracking Supervision by Brian Chen et al
12-01-2021	Incomplete Multi-view Clustering via Cross-view Relation Transfer by Yiming Wang et al
12-01-2021	Forward Operator Estimation in Generative Models with Kernel Transfer Operators by Zhichun Huang et al
12-01-2021	FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection by Danila Rukhovich et al
12-01-2021	CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded Systems by Priyank Kalgaonkar et al
12-01-2021	Weakly-Supervised Video Object Grounding via Causal Intervention by Wei Wang et al
12-01-2021	Federated Learning with Adaptive Batchnorm for Personalized Healthcare by Yiqiang Chen et al
12-01-2021	Total-Body Low-Dose CT Image Denoising using Prior Knowledge Transfer Technique with Contrastive Regularization Mechanism by Minghan Fu et al
11-30-2021	Revisiting Temporal Alignment for Video Restoration by Kun Zhou et al
11-30-2021	LossPlot: A Better Way to Visualize Loss Landscapes by Robert Bain et al
12-03-2021	Frame Averaging for Equivariant Shape Space Learning by Matan Atzmon et al
12-01-2021	Rethink, Revisit, Revise: A Spiral Reinforced Self-Revised Network for Zero-Shot Learning by Zhe Liu et al
12-02-2021	LMR-CBT: Learning Modality-fused Representations with CB-Transformer for Multimodal Emotion Recognition from Unaligned Multimodal Sequences by Ziwang Fu et al
12-01-2021	MDFM: Multi-Decision Fusing Model for Few-Shot Learning by Shuai Shao et al
11-30-2021	PoseKernelLifter: Metric Lifting of 3D Human Pose using Sound by Zhijian Yang et al
12-02-2021	Deep residential representations: Using unsupervised learning to unlock elevation data for geo-demographic prediction by Matthew Stevenson et al
11-30-2021	Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources by Sahar Abdelnabi et al
12-01-2021	Neural Emotion Director: Speech-preserving semantic control of facial expressions in in-the-wild videos by Foivos Paraperas Papantoniou et al
12-01-2021	CDLNet: Noise-Adaptive Convolutional Dictionary Learning Network for Blind Denoising and Demosaicing by Nikola Janjušević et al
12-01-2021	Automatic tumour segmentation in H&E-stained whole-slide images of the pancreas by Pierpaolo Vendittelli et al
11-30-2021	NeeDrop: Self-supervised Shape Representation from Sparse Point Clouds using Needle Dropping by Alexandre Boulch et al
11-30-2021	Leveraging The Topological Consistencies of Learning in Deep Neural Networks by Stuart Synakowski et al
11-30-2021	Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection by Deepti Hegde et al
12-02-2021	Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks by Xizhou Zhu et al
12-01-2021	The Majority Can Help The Minority: Context-rich Minority Oversampling for Long-tailed Classification by Seulki Park et al
12-01-2021	Deep Measurement Updates for Bayes Filters by Johannes Pankert et al
12-03-2021	Data-Free Neural Architecture Search via Recursive Label Calibration by Zechun Liu et al
12-01-2021	Learning to automate cryo-electron microscopy data collection with Ptolemy by Paul T. Kim et al
12-01-2021	A Unified Benchmark for the Unknown Detection Capability of Deep Neural Networks by Jihyo Kim et al
11-30-2021	Semi-Local Convolutions for LiDAR Scan Processing by Larissa T. Triess et al
12-02-2021	Fast Neural Representations for Direct Volume Rendering by Sebastian Weiss et al
11-30-2021	MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale by Kasra Hosseini et al
12-03-2021	A Structured Dictionary Perspective on Implicit Neural Representations by Gizem Yüce et al
11-30-2021	Light Field Implicit Representation for Flexible Resolution Reconstruction by Paramanand Chandramouli et al
12-01-2021	Semi-Supervised Surface Anomaly Detection of Composite Wind Turbine Blades From Drone Imagery by Jack. W. Barker et al
12-01-2021	DFTS2: Simulating Deep Feature Transmission Over Packet Loss Channels by Ashiv Dhondea et al
11-30-2021	Scalable Primitives for Generalized Sensor Fusion in Autonomous Vehicles by Sammy Sidhu et al
11-30-2021	The Devil is in the Margin: Margin-based Label Smoothing for Network Calibration by Bingyuan Liu et al
11-30-2021	PokeBNN: A Binary Pursuit of Lightweight Accuracy by Yichi Zhang et al
11-30-2021	A Highly Effective Low-Rank Compression of Deep Neural Networks with Modified Beam-Search and Modified Stable Rank by Moonjung Eo et al
12-01-2021	Using Deep Image Prior to Assist Variational Selective Segmentation Deep Learning Algorithms by Liam Burrows et al
12-02-2021	A Fast Knowledge Distillation Framework for Visual Recognition by Zhiqiang Shen et al
11-30-2021	MC-SSL0.0: Towards Multi-Concept Self-Supervised Learning by Sara Atito et al
11-30-2021	Improved sparse PCA method for face and image recognition by Loc Hoang Tran et al
12-02-2021	CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer by Moein Sorkhei et al
12-01-2021	On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification by Rutika Moharir et al
12-02-2021	Co-domain Symmetry for Complex-Valued Deep Learning by Utkarsh Singhal et al
12-03-2021	ROCA: Robust CAD Model Retrieval and Alignment from a Single Image by Can Gümeli et al
12-01-2021	Saliency Enhancement using Superpixel Similarity by Leonardo de Melo Joao et al
11-30-2021	Ranking Distance Calibration for Cross-Domain Few-Shot Learning by Pan Li et al
12-02-2021	N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras by Junho Kim et al
12-03-2021	Adversarial Attacks against a Satellite-borne Multispectral Cloud Detector by Andrew Du et al
12-02-2021	Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness? by Peter Lorenz et al
12-02-2021	SCNet: A Generalized Attention-based Model for Crack Fault Segmentation by Hrishikesh Sharma et al
12-03-2021	SSDL: Self-Supervised Dictionary Learning by Shuai Shao et al
12-01-2021	Highly accelerated MR parametric mapping by undersampling the k-space and reducing the contrast number simultaneously with deep learning by Yanjie Zhu et al
12-02-2021	Sample Prior Guided Robust Model Learning to Suppress Noisy Labels by Wenkai Chen et al
11-30-2021	Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding by Abdullah Hamdi et al
12-02-2021	Localized Feature Aggregation Module for Semantic Segmentation by Ryouichi Furukawa et al
12-01-2021	CLAWS: Contrastive Learning with hard Attention and Weak Supervision by Jansel Herrera-Gerena et al
11-30-2021	Training BatchNorm Only in Neural Architecture Search and Beyond by Yichen Zhu et al
12-02-2021	Structure-Aware Multi-Hop Graph Convolution for Graph Neural Networks by Yang Li et al
12-03-2021	Adaptive Poincar\e Point to Set Distance for Few-Shot Classification by Rongkai Ma et al
12-01-2021	Point Cloud Segmentation Using Sparse Temporal Local Attention by Joshua Knights et al
12-03-2021	Geometric Feature Learning for 3D Meshes by Huan Lei et al
11-30-2021	ConDA: Unsupervised Domain Adaptation for LiDAR Segmentation via Regularized Domain Concatenation by Lingdong Kong et al
12-03-2021	Mind Your Clever Neighbours: Unsupervised Person Re-identification via Adaptive Clustering Relationship Modeling by Lianjie Jia et al
12-02-2021	Training Efficiency and Robustness in Deep Learning by Fartash Faghri
12-01-2021	Trimap-guided Feature Mining and Fusion Network for Natural Image Matting by Weihao Jiang et al
11-30-2021	EdiBERT, a generative model for image editing by Thibaut Issenhuth et al
12-03-2021	A Survey: Deep Learning for Hyperspectral Image Classification with Few Labeled Samples by Sen Jia et al
12-02-2021	Probabilistic Approach for Road-Users Detection by G. Melotti et al
11-30-2021	ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds by Georg Bökman et al
12-02-2021	Active Learning for Domain Adaptation: An Energy-based Approach by Binhui Xie et al
12-01-2021	3D Reconstruction Using a Linear Laser Scanner and a Camera by Rui Wang
12-01-2021	ℓ∞ℓ∞-Robustness and Beyond: Unleashing Efficient Adversarial Training by Hadi M. Dolatabadi et al

11-30-2021	Assessment of Data Consistency through Cascades of Independently Recurrent Inference Machines for fast and robust accelerated MRI reconstruction by D. Karkalousos et al
12-03-2021	Bridging the Gap: Point Clouds for Merging Neurons in Connectomics by Jules Berman et al
12-01-2021	Adv-4-Adv: Thwarting Changing Adversarial Perturbations via Adversarial Domain Adaptation by Tianyue Zheng et al
12-02-2021	Object-aware Monocular Depth Prediction with Instance Convolutions by Enis Simsar et al
11-30-2021	Benchmarking Deep Deblurring Algorithms: A Large-Scale Multi-Cause Dataset and A New Baseline Model by Kaihao Zhang et al
12-01-2021	A benchmark with decomposed distribution shifts for 360 monocular depth estimation by Georgios Albanis et al
11-30-2021	Automatic Synthesis of Diverse Weak Supervision Sources for Behavior Analysis by Albert Tseng et al
12-02-2021	TCTN: A 3D-Temporal Convolutional Transformer Network for Spatiotemporal Predictive Learning by Ziao Yang et al
12-02-2021	Batch Normalization Tells You Which Filter is Important by Junghun Oh et al
12-01-2021	Multi-View Stereo with Transformer by Jie Zhu et al
11-30-2021	Fully Automatic Deep Learning Framework for Pancreatic Ductal Adenocarcinoma Detection on Computed Tomography by Natália Alves et al
11-30-2021	Querying Labelled Data with Scenario Programs for Sim-to-Real Validation by Edward Kim et al
11-30-2021	3DVNet: Multi-View Depth Prediction and Volumetric Refinement by Alexander Rich et al
12-01-2021	Human-Object Interaction Detection via Weak Supervision by Mert Kilickaya et al
12-02-2021	Deep Depth from Focus with Differential Focus Volume by Fengting Yang et al
12-01-2021	Optimizing for In-memory Deep Learning with Emerging Memory Technology by Zhehui Wang et al
12-01-2021	Label-Free Model Evaluation with Semi-Structured Dataset Representations by Xiaoxiao Sun et al
12-01-2021	Transformer-based Network for RGB-D Saliency Detection by Yue Wang et al
12-01-2021	Background Activation Suppression for Weakly Supervised Object Localization by Pingyu Wu et al
11-30-2021	Regularized directional representations for medical image registration by Vincent Jaouen et al
12-01-2021	Information Theoretic Representation Distillation by Roy Miles et al
12-01-2021	On Salience-Sensitive Sign Classification in Autonomous Vehicle Path Planning: Experimental Explorations with a Novel Dataset by Ross Greer et al
12-01-2021	Dual Spoof Disentanglement Generation for Face Anti-spoofing with Depth Uncertainty Learning by Hangtong Wu et al
12-02-2021	Engineering AI Tools for Systematic and Scalable Quality Assessment in Magnetic Resonance Imaging by Yukai Zou et al
11-30-2021	GLocal: Global Graph Reasoning and Local Structure Transfer for Person Image Generation by Liyuan Ma et al
12-01-2021	Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding by Xianzheng Ma et al
12-02-2021	CloudWalker: 3D Point Cloud Learning by Random Walks for Shape Analysis by Adi Mesika et al
11-30-2021	Predicting Poverty Level from Satellite Imagery using Deep Neural Networks by Varun Chitturi et al
12-03-2021	Image-to-image Translation as a Unique Source of Knowledge by Alejandro D. Mousist
12-01-2021	Revisiting the Transferability of Supervised Pretraining: an MLP Perspective by Yizhou Wang et al
12-03-2021	Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior by Feng Zhang et al
12-03-2021	MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification by Jingye Chen et al
12-03-2021	Boosting Unsupervised Domain Adaptation with Soft Pseudo-label and Curriculum Learning by Shengjia Zhang et al
12-02-2021	The Surprising Effectiveness of Representation Learning for Visual Imitation by Jyothish Pari et al
12-01-2021	Dyadic Human Motion Prediction by Isinsu Katircioglu et al
11-30-2021	Seeking Salient Facial Regions for Cross-Database Micro-Expression Recognition by Xingxun Jiang et al
11-30-2021	ARTSeg: Employing Attention for Thermal images Semantic Segmentation by Farzeen Munir et al
12-03-2021	MSP : Refine Boundary Segmentation via Multiscale Superpixel by Jie Zhu et al
12-01-2021	Subtask-dominated Transfer Learning for Long-tail Person Search by Chuang Liu et al
12-03-2021	AirDet: Few-Shot Detection without Fine-tuning for Autonomous Exploration by Bowen Li et al
12-02-2021	Self-Supervised Material and Texture Representation Learning for Remote Sensing Tasks by Peri Akiva et al
12-01-2021	FDA-GAN: Flow-based Dual Attention GAN for Human Pose Transfer by Liyuan Ma et al
12-01-2021	DeepSportLab: a Unified Framework for Ball Detection, Player Instance Segmentation and Pose Estimation in Team Sports Scenes by Seyed Abolfazl Ghasemzadeh et al
11-30-2021	Spatio-Temporal Multi-Flow Network for Video Frame Interpolation by Duolikun Danier et al
12-02-2021	Multi-modal application: Image Memes Generation by Zhiyuan Liu et al
12-02-2021	Just Drive: Colour Bias Mitigation for Semantic Segmentation in the Context of Urban Driving by Jack Stelling et al
11-30-2021	Affect-DML: Context-Aware One-Shot Recognition of Human Affect using Deep Metric Learning by Kunyu Peng et al
11-30-2021	The MIS Check-Dam Dataset for Object Detection and Instance Segmentation Tasks by Chintan Tundia et al
11-30-2021	Anonymization for Skeleton Action Recognition by Myeonghyeon Kim et al
11-30-2021	AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-shot Interactions by Yian Wang et al
11-30-2021	CT-block: a novel local and global features extractor for point cloud by Shangwei Guo et al
12-03-2021	Incremental Learning in Semantic Segmentation from Image Labels by Fabio Cermelli et al
12-03-2021	MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object Detection by Yongri Piao et al
12-03-2021	Action Units That Constitute Trainable Micro-expressions (and A Large-scale Synthetic Dataset) by Yuchi Liu et al
12-03-2021	Gesture Recognition with a Skeleton-Based Keyframe Selection Module by Yunsoo Kim et al
12-03-2021	Music-to-Dance Generation with Optimal Transport by Shuang Wu et al
11-30-2021	Improving Differentiable Architecture Search with a Generative Model by Ruisi Zhang et al
11-30-2021	Two-stage Temporal Modelling Framework for Video-based Depression Recognition using Graph Representation by Jiaqi Xu et al
12-02-2021	Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation by Xiang Li et al
12-01-2021	Camera Motion Agnostic 3D Human Pose Estimation by Seong Hyun Kim et al
12-01-2021	Automatic travel pattern extraction from visa page stamps using CNN models by Eimantas Ledinauskas et al
11-30-2021	Boosting Discriminative Visual Representation Learning with Scenario-Agnostic Mixup by Siyuan Li et al
12-01-2021	Learning Oriented Remote Sensing Object Detection via Naive Geometric Computing by Yanjie Wang et al
12-02-2021	TransZero: Attribute-guided Transformer for Zero-Shot Learning by Shiming Chen et al
12-03-2021	TRNR: Task-Driven Image Rain and Noise Removal with a Few Images Based on Patch Analysis by Wu Ran et al
12-02-2021	Make A Long Image Short: Adaptive Token Length for Vision Transformers by Yichen Zhu et al
12-03-2021	Detect Faces Efficiently: A Survey and Evaluations by Yuantao Feng et al
12-01-2021	Attribute Artifacts Removal for Geometry-based Point Cloud Compression by Xihua Sheng et al
12-01-2021	Maximum Consensus by Weighted Influences of Monotone Boolean Functions by Erchuan Zhang et al
12-03-2021	Detection of Large Vessel Occlusions using Deep Learning by Deforming Vessel Tree Segmentations by Florian Thamm et al
12-01-2021	Push Stricter to Decide Better: A Class-Conditional Feature Adaptive Framework for Improving Adversarial Robustness by Jia-Li Yin et al
12-01-2021	Graph Convolutional Module for Temporal Action Localization in Videos by Runhao Zeng et al
12-01-2021	Visual-Semantic Transformer for Scene Text Recognition by Xin Tang et al
11-30-2021	A Face Recognition Systems Worst Morph Nightmare, Theoretically by Una M. Kelly et al
11-30-2021	Pattern-Aware Data Augmentation for LiDAR 3D Object Detection by Jordan S. K. Hu et al
12-02-2021	3D-Aware Semantic-Guided Generative Model for Human Synthesis by Jichao Zhang et al
12-02-2021	Attention based Occlusion Removal for Hybrid Telepresence Systems by Surabhi Gupta et al
11-30-2021	An implementation of the Guess who? game using CLIP by Arnau Martí Sarri et al
12-01-2021	Interpretable Deep Learning-Based Forensic Iris Segmentation and Recognition by Andrey Kuehlkamp et al
11-30-2021	Beyond Flatland: Pre-training with a Strong 3D Inductive Bias by Shubhaankar Gupta et al
11-30-2021	PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction by Qingyu Wang et al
12-02-2021	TISE: A Toolbox for Text-to-Image Synthesis Evaluation by Tan M. Dinh et al
11-30-2021	Point Cloud Instance Segmentation with Semi-supervised Bounding-Box Mining by Yongbin Liao et al
12-01-2021	Generalized Closed-form Formulae for Feature-based Subpixel Alignment in Patch-based Matching by Laurent Valentin Jospin et al
12-02-2021	Video-Text Pre-training with Learned Regions by Rui Yan et al
12-03-2021	Semantic Map Injected GAN Training for Image-to-Image Translation by Balaram Singh Kshatriya et al
11-30-2021	ESL: Event-based Structured Light by Manasi Muglikar et al
11-30-2021	Contrastive Learning for Local and Global Learning MRI Reconstruction by Qiaosi Yi et al
11-30-2021	HRNET: AI on Edge for mask detection and social distancing by Kinshuk Sengupta et al
11-30-2021	TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information by Suraj Kothawade et al
11-30-2021	Detecting Extratropical Cyclones of the Northern Hemisphere with Single Shot Detector by Minjing Shi et al
12-02-2021	TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation by Zhaoyuan Yin et al
12-02-2021	Machine Learning-Based Classification Algorithms for the Prediction of Coronary Heart Diseases by Kelvin Kwakye et al
12-01-2021	Generating Diverse 3D Reconstructions from a Single Occluded Face Image by Rahul Dey et al
12-02-2021	Learning Spatial-Temporal Graphs for Active Speaker Detection by Sourya Roy et al
11-30-2021	TridentAdapt: Learning Domain-invariance via Source-Target Confrontation and Self-induced Cross-domain Augmentation by Fengyi Shen et al
11-30-2021	RADU: Ray-Aligned Depth Update Convolutions for ToF Data Denoising by Michael Schelling et al
11-30-2021	FMD-cGAN: Fast Motion Deblurring using Conditional Generative Adversarial Networks by Jatin Kumar et al
12-02-2021	Self-supervised Video Transformer by Kanchana Ranasinghe et al
12-02-2021	Probabilistic Tracking with Deep Factors by Fan Jiang et al
12-02-2021	OW-DETR: Open-world Detection Transformer by Akshita Gupta et al
12-01-2021	Event Neural Networks by Matthew Dutson et al
12-03-2021	Panoptic-based Object Style-Align for Image-to-Image Translation by Liyun Zhang et al
11-30-2021	PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite Images by Stefano Zorzi et al
12-03-2021	Total Scale: Face-to-Body Detail Reconstruction from Sparse RGBD Sensors by Zheng Dong et al
12-01-2021	FaSS-MVS -- Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery by Boitumelo Ruf et al
11-30-2021	360MonoDepth: High-Resolution 360{\deg} Monocular Depth Estimation by Manuel Rey-Area et al
12-02-2021	TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework using Self-Supervised Multi-Task Learning by Linhao Qu et al
12-03-2021	Lightweight Attentional Feature Fusion for Video Retrieval by Text by Fan Hu et al
11-30-2021	Automated Damage Inspection of Power Transmission Towers from UAV Images by Aleixo Cambeiro Barreiro et al
12-02-2021	3rd Place Solution for NeurIPS 2021 Shifts Challenge: Vehicle Motion Prediction by Ching-Yu Tseng et al
12-01-2021	Unsupervised Statistical Learning for Die Analysis in Ancient Numismatics by Andreas Heinecke et al
12-03-2021	Towards Super-Resolution CEST MRI for Visualization of Small Structures by Lukas Folle et al
12-03-2021	Novel Class Discovery in Semantic Segmentation by Yuyang Zhao et al
12-03-2021	The Box Size Confidence Bias Harms Your Object Detector by Johannes Gilg et al
12-01-2021	Multiple Fusion Adaptation: A Strong Framework for Unsupervised Semantic Segmentation Adaptation by Kai Zhang et al
11-30-2021	A Unified Pruning Framework for Vision Transformers by Hao Yu et al
11-30-2021	Probabilistic Estimation of 3D Human Shape and Pose with a Semantic Local Parametric Model by Akash Sengupta et al
11-30-2021	Boosting EfficientNets Ensemble Performance via Pseudo-Labels and Synthetic Images by pix2pixHD for Infection and Ischaemia Classification in Diabetic Foot Ulcers by Louise Bloch et al
12-02-2021	Open-set 3D Object Detection by Jun Cen et al
12-02-2021	Hamiltonian prior to Disentangle Content and Motion in Image Sequences by Asif Khan et al
12-02-2021	SwinTrack: A Simple and Strong Baseline for Transformer Tracking by Liting Lin et al
11-30-2021	Robust Partial-to-Partial Point Cloud Registration in a Full Range by Liang Pan et al
11-30-2021	Human Imperceptible Attacks and Applications to Improve Fairness by Xinru Hua et al
12-02-2021	Putting 3D Spatially Sparse Networks on a Diet by Junha Lee et al
12-02-2021	Unconstrained Face Sketch Synthesis via Perception-Adaptive Network and A New Benchmark by Lin Nie et al
11-30-2021	MEFNet: Multi-scale Event Fusion Network for Motion Deblurring by Lei Sun et al
11-30-2021	Large-Scale Video Analytics through Object-Level Consolidation by Daniel Rivas et al
12-02-2021	Iterative Frame-Level Representation Learning And Classification For Semi-Supervised Temporal Action Segmentation by Dipika Singhania et al
12-02-2021	FIBA: Frequency-Injection based Backdoor Attack in Medical Image Analysis by Yu Feng et al
12-02-2021	Video Frame Interpolation without Temporal Priors by Youjian Zhang et al
11-30-2021	CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning by Bang Yang et al
12-02-2021	MTFNet: Mutual-Transformer Fusion Network for RGB-D Salient Object Detection by Xixi Wang et al
12-02-2021	Overcoming the Domain Gap in Neural Action Representations by Semih Günel et al
12-02-2021	MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment by Jie Ren et al
12-02-2021	NeSF: Neural Shading Field for Image Harmonization by Zhongyun Hu et al
12-01-2021	Unleashing the Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification by Zizheng Yang et al
12-01-2021	Temporally Resolution Decrement: Utilizing the Shape Consistency for Higher Computational Efficiency by Tianshu Xie et al
11-30-2021	Low-light Image Enhancement via Breaking Down the Darkness by Qiming Hu et al
11-30-2021	ColibriDoc: An Eye-in-Hand Autonomous Trocar Docking System by Shervin Dehghani et al
11-30-2021	Reconstruction Student with Attention for Student-Teacher Pyramid Matching by Shinji Yamada et al
12-02-2021	Deep Learning-Based Carotid Artery Vessel Wall Segmentation in Black-Blood MRI Using Anatomical Priors by Dieuwertje Alblas et al
11-30-2021	Zero-Shot Semantic Segmentation via Spatial and Multi-Scale Aware Visual Class Embedding by Sungguk Cha et al
11-30-2021	SamplingAug: On the Importance of Patch Sampling Augmentation for Single Image Super-Resolution by Shizun Wang et al
12-02-2021	Semantic-Sparse Colorization Network for Deep Exemplar-based Colorization by Yunpeng Bai et al
11-30-2021	Semi-Supervised 3D Hand Shape and Pose Estimation with Label Propagation by Samira Kaviani et al
12-02-2021	Stronger Baseline for Person Re-Identification by Fengliang Qi et al
12-02-2021	The Second Place Solution for ICCV2021 VIPriors Instance Segmentation Challenge by Bo Yan et al
12-02-2021	Fast automatic deforestation detectors and their extensions for other spatial objects by Jesper Muren et al
12-02-2021	InsCLR: Improving Instance Retrieval with Self-Supervision by Zelu Deng et al
12-02-2021	Local Similarity Pattern and Cost Self-Reassembling for Deep Stereo Matching Networks by Biyang Liu et al
12-01-2021	Optimization of phase-only holograms calculated with scaled diffraction calculation through deep neural networks by Yoshiyuki Ishii et al
12-03-2021	SGM3D: Stereo Guided Monocular 3D Object Detection by Zheyuan Zhou et al
11-30-2021	Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features by Byeonghu Na et al
12-02-2021	TBN-ViT: Temporal Bilateral Network with Vision Transformer for Video Scene Parsing by Bo Yan et al
12-02-2021	Leveraging Human Selective Attention for Medical Image Analysis with Limited Training Data by Yifei Huang et al
12-02-2021	GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation by Xingzhe He et al
12-02-2021	Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips by Lijin Yang et al
11-30-2021	Using a GAN to Generate Adversarial Examples to Facial Image Recognition by Andrew Merrigan et al
11-30-2021	HEAT: Holistic Edge Attention Transformer for Structured Reconstruction by Jiacheng Chen et al
11-30-2021	Robust 3D Garment Digitization from Monocular 2D Images for 3D Virtual Try-On Systems by Sahib Majithia et al
11-30-2021	MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark by Xiaotian Han et al
11-30-2021	AirObject: A Temporally Evolving Graph Embedding for Object Identification by Nikhil Varma Keetha et al
11-30-2021	A Dataset-Dispersion Perspective on Reconstruction Versus Recognition in Single-View 3D Reconstruction Networks by Yefan Zhou et al
12-03-2021	Fully automatic integration of dental CBCT images and full-arch intraoral impressions with stitching error correction via individual tooth segmentation and identification by Tae Jun Jang et al
12-03-2021	A Systematic IoU-Related Method: Beyond Simplified Regression for Better Localization by Hanyang Peng et al
12-02-2021	Bio-inspired Polarization Event Camera by Germain Haessig et al
11-30-2021	Generative Convolution Layer for Image Generation by Seung Park et al
12-01-2021	Multi-task fusion for improving mammography screening data classification by Maria Wimmer et al
12-01-2021	Multi-Content Complementation Network for Salient Object Detection in Optical Remote Sensing Images by Gongyang Li et al

Craig SmithDecember 7, 2021