2021.12.6 Vision papers

 

12-02-2021

Zero-Shot Text-Guided Object Generation with Dream Fields
by Ajay Jain et al

11-30-2021

HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing
by Yuval Alaluf et al

12-02-2021

SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency
by Devendra Singh Chaplot et al

12-01-2021

RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs
by Michael Niemeyer et al

12-02-2021

Editing a classifier by rewriting its prediction rules
by Shibani Santurkar et al

11-30-2021

Hallucinated Neural Radiance Fields in the Wild
by Xingyu Chen et al

12-02-2021

Learning to Detect Every Thing in an Open World
by Kuniaki Saito et al

12-01-2021

PartImageNet: A Large, High-Quality Dataset of Parts
by Ju He et al

12-02-2021

BEVT: BERT Pretraining of Video Transformers
by Rui Wang et al

12-01-2021

Object-Aware Cropping for Self-Supervised Learning
by Shlok Mishra et al

11-30-2021

Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
by Konpat Preechakul et al

11-30-2021

Task2Sim : Towards Effective Pre-training and Transfer from Synthetic Data
by Samarth Mishra et al

12-02-2021

Improved Multiscale Vision Transformers for Classification and Detection
by Yanghao Li et al

11-30-2021

3D Photo Stylization: Learning to Generate Stylized Novel Views from a Single Image
by Fangzhou Mu et al

12-02-2021

GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras
by Ye Yuan et al

11-30-2021

DiffSDFSim: Differentiable Rigid-Body Dynamics With Implicit Shapes
by Michael Strecke et al

12-01-2021

Translation-equivariant Image Quantizer for Bi-directional Image-Text Generation
by Woncheol Shin et al

12-02-2021

FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization
by Xingchao Liu et al

12-02-2021

DenseCLIP: Extract Free Dense Labels from CLIP
by Chong Zhou et al

12-02-2021

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
by Yongming Rao et al

11-30-2021

Sound-Guided Semantic Image Manipulation
by Seung Hyun Lee et al

11-30-2021

AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
by Lingchen Meng et al

11-30-2021

NeuSample: Neural Sample Field for Efficient View Synthesis
by Jiemin Fang et al

12-02-2021

Masked-attention Mask Transformer for Universal Image Segmentation
by Bowen Cheng et al

12-01-2021

SegDiff: Image Segmentation with Diffusion Probabilistic Models
by Tomer Amit et al

12-01-2021

Extrapolating from a Single Image to a Thousand Classes using Distillation
by Yuki M. Asano et al

12-02-2021

Neural Head Avatars from Monocular RGB Videos
by Philip-William Grassal et al

11-30-2021

ATS: Adaptive Token Sampling For Efficient Vision Transformers
by Mohsen Fayyaz et al

12-01-2021

CLIPstyler: Image Style Transfer with a Single Text Condition
by Gihyun Kwon et al

12-02-2021

StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions
by Lukas Höllein et al

12-01-2021

Improving GAN Equilibrium by Raising Spatial Awareness
by Jianyuan Wang et al

12-01-2021

Routing with Self-Attention for Multimodal Capsule Networks
by Kevin Duarte et al

12-01-2021

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
by Mattia Soldan et al

12-03-2021

Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research
by Bernard Koch et al

12-02-2021

Neural Weight Step Video Compression
by Mikolaj Czerkawski et al

12-02-2021

Efficient Neural Radiance Fields with Learned Depth-Guided Sampling
by Haotong Lin et al

12-01-2021

Object-aware Video-language Pre-training for Retrieval
by Alex Jinpeng Wang et al

12-01-2021

Robustness in Deep Learning for Computer Vision: Mind the gap?
by Nathan Drenkow et al

12-01-2021

Object-Centric Unsupervised Image Captioning
by Zihang Meng et al

11-30-2021

CRIS: CLIP-Driven Referring Image Segmentation
by Zhaoqing Wang et al

12-01-2021

Vision Pair Learning: An Efficient Training Framework for Image Classification
by Bei Tong et al

11-30-2021

VoRTX: Volumetric 3D Reconstruction With Transformers for Voxelwise View Selection and Fusion
by Noah Stier et al

12-01-2021

GANORCON: Are Generative Models Useful for Few-shot Segmentation?
by Oindrila Saha et al

12-02-2021

Learning Neural Light Fields with Ray-Space Embedding Networks
by Benjamin Attal et al

12-02-2021

Neural Point Light Fields
by Julian Ost et al

11-30-2021

Unsupervised Domain Adaptation: A Reality Check
by Kevin Musgrave et al

12-02-2021

Differentiable Spatial Planning using Transformers
by Devendra Singh Chaplot et al

12-02-2021

Dimensions of Motion: Learning to Predict a Subspace of Optical Flow from a Single Image
by Richard Strong Bowen et al

12-03-2021

NeRF-SR: High-Quality Neural Radiance Fields using Super-Sampling
by Chen Wang et al

12-03-2021

CoNeRF: Controllable Neural Radiance Fields
by Kacper Kania et al

12-01-2021

MonoScene: Monocular 3D Semantic Scene Completion
by Anh-Quan Cao et al

12-03-2021

Coupling Vision and Proprioception for Navigation of Legged Robots
by Zipeng Fu et al

12-01-2021

Consensus Graph Representation Learning for Better Grounded Image Captioning
by Wenqiao Zhang et al

12-01-2021

HyperInverter: Improving StyleGAN Inversion via Hypernetwork
by Tan M. Dinh et al

11-30-2021

NeRFReN: Neural Radiance Fields with Reflections
by Yuan-Chen Guo et al

12-01-2021

Reference-guided Pseudo-Label Generation for Medical Semantic Segmentation
by Constantin Seibold et al

12-01-2021

FaceTuneGAN: Face Autoencoder for Convolutional Expression Transfer Using Neural Generative Adversarial Networks
by Nicolas Olivier et al

12-01-2021

Confidence Propagation Cluster: Unleash Full Potential of Object Detectors
by Yichun Shen* et al

12-01-2021

The Shape Part Slot Machine: Contact-based Reasoning for Generating 3D Shapes from Parts
by Kai Wang et al

11-30-2021

Exponentially Tilted Gaussian Prior for Variational Autoencoder
by Griffin Floto et al

12-03-2021

Class-agnostic Reconstruction of Dynamic Objects from Videos
by Zhongzheng Ren et al

11-30-2021

Shunted Self-Attention via Multi-Scale Token Aggregation
by Sucheng Ren et al

12-02-2021

Recognizing Scenes from Novel Viewpoints
by Shengyi Qian et al

12-02-2021

Quantifying the uncertainty of neural networks using Monte Carlo dropout for deep learning based quantitative MRI
by Mehmet Yigit Avci et al

12-02-2021

Controllable Video Captioning with an Exemplar Sentence
by Yitian Yuan et al

11-30-2021

Shallow Network Based on Depthwise Over-Parameterized Convolution for Hyperspectral Image Classification
by Hongmin Gao et al

12-02-2021

Syntax Customized Video Captioning by Imitating Exemplar Sentences
by Yitian Yuan et al

12-02-2021

Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention
by Kun Yan et al

11-30-2021

SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Editing
by Jing Shi et al

12-03-2021

Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
by Minghui Hu et al

12-01-2021

Relational Graph Learning for Grounded Video Description Generation
by Wenqiao Zhang et al

12-03-2021

Hierarchical Optimal Transport for Unsupervised Domain Adaptation
by Mourad El Hamri et al

11-30-2021

FENeRF: Face Editing in Neural Radiance Fields
by Jingxiang Sun et al

12-01-2021

The Norm Must Go On: Dynamic Unsupervised Domain Adaptation by Normalization
by M. Jehanzeb Mirza et al

12-03-2021

Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer
by Frederic Z. Zhang et al

12-01-2021

CYBORG: Blending Human Saliency Into the Loss Improves Deep Learning
by Aidan Boyd et al

11-30-2021

Is the use of Deep Learning and Artificial Intelligence an appropriate means to locate debris in the ocean without harming aquatic wildlife?
by Zoe Moorton et al

12-01-2021

Hierarchical Neural Implicit Pose Network for Animation and Motion Retargeting
by Sourav Biswas et al

12-02-2021

D3Net: A Speaker-Listener Architecture for Semi-supervised Dense Captioning and Visual Grounding in RGB-D Scans
by Dave Zhenyu Chen et al

12-02-2021

Altering Facial Expression Based on Textual Emotion
by Mohammad Imrul Jubair et al

12-01-2021

Learning Transformer Features for Image Quality Assessment
by Chao Zeng et al

12-01-2021

PreViTS: Contrastive Pretraining with Video Tracking Supervision
by Brian Chen et al

12-01-2021

Incomplete Multi-view Clustering via Cross-view Relation Transfer
by Yiming Wang et al

12-01-2021

Forward Operator Estimation in Generative Models with Kernel Transfer Operators
by Zhichun Huang et al

12-01-2021

FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection
by Danila Rukhovich et al

12-01-2021

CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded Systems
by Priyank Kalgaonkar et al

12-01-2021

Weakly-Supervised Video Object Grounding via Causal Intervention
by Wei Wang et al

12-01-2021

Federated Learning with Adaptive Batchnorm for Personalized Healthcare
by Yiqiang Chen et al

12-01-2021

Total-Body Low-Dose CT Image Denoising using Prior Knowledge Transfer Technique with Contrastive Regularization Mechanism
by Minghan Fu et al

11-30-2021

Revisiting Temporal Alignment for Video Restoration
by Kun Zhou et al

11-30-2021

LossPlot: A Better Way to Visualize Loss Landscapes
by Robert Bain et al

12-03-2021

Frame Averaging for Equivariant Shape Space Learning
by Matan Atzmon et al

12-01-2021

Rethink, Revisit, Revise: A Spiral Reinforced Self-Revised Network for Zero-Shot Learning
by Zhe Liu et al

12-02-2021

LMR-CBT: Learning Modality-fused Representations with CB-Transformer for Multimodal Emotion Recognition from Unaligned Multimodal Sequences
by Ziwang Fu et al

12-01-2021

MDFM: Multi-Decision Fusing Model for Few-Shot Learning
by Shuai Shao et al

11-30-2021

PoseKernelLifter: Metric Lifting of 3D Human Pose using Sound
by Zhijian Yang et al

12-02-2021

Deep residential representations: Using unsupervised learning to unlock elevation data for geo-demographic prediction
by Matthew Stevenson et al

11-30-2021

Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources
by Sahar Abdelnabi et al

12-01-2021

Neural Emotion Director: Speech-preserving semantic control of facial expressions in in-the-wild videos
by Foivos Paraperas Papantoniou et al

12-01-2021

CDLNet: Noise-Adaptive Convolutional Dictionary Learning Network for Blind Denoising and Demosaicing
by Nikola Janjušević et al

12-01-2021

Automatic tumour segmentation in H&E-stained whole-slide images of the pancreas
by Pierpaolo Vendittelli et al

11-30-2021

NeeDrop: Self-supervised Shape Representation from Sparse Point Clouds using Needle Dropping
by Alexandre Boulch et al

11-30-2021

Leveraging The Topological Consistencies of Learning in Deep Neural Networks
by Stuart Synakowski et al

11-30-2021

Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection
by Deepti Hegde et al

12-02-2021

Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks
by Xizhou Zhu et al

12-01-2021

The Majority Can Help The Minority: Context-rich Minority Oversampling for Long-tailed Classification
by Seulki Park et al

12-01-2021

Deep Measurement Updates for Bayes Filters
by Johannes Pankert et al

12-03-2021

Data-Free Neural Architecture Search via Recursive Label Calibration
by Zechun Liu et al

12-01-2021

Learning to automate cryo-electron microscopy data collection with Ptolemy
by Paul T. Kim et al

12-01-2021

A Unified Benchmark for the Unknown Detection Capability of Deep Neural Networks
by Jihyo Kim et al

11-30-2021

Semi-Local Convolutions for LiDAR Scan Processing
by Larissa T. Triess et al

12-02-2021

Fast Neural Representations for Direct Volume Rendering
by Sebastian Weiss et al

11-30-2021

MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale
by Kasra Hosseini et al

12-03-2021

A Structured Dictionary Perspective on Implicit Neural Representations
by Gizem Yüce et al

11-30-2021

Light Field Implicit Representation for Flexible Resolution Reconstruction
by Paramanand Chandramouli et al

12-01-2021

Semi-Supervised Surface Anomaly Detection of Composite Wind Turbine Blades From Drone Imagery
by Jack. W. Barker et al

12-01-2021

DFTS2: Simulating Deep Feature Transmission Over Packet Loss Channels
by Ashiv Dhondea et al

11-30-2021

Scalable Primitives for Generalized Sensor Fusion in Autonomous Vehicles
by Sammy Sidhu et al

11-30-2021

The Devil is in the Margin: Margin-based Label Smoothing for Network Calibration
by Bingyuan Liu et al

11-30-2021

PokeBNN: A Binary Pursuit of Lightweight Accuracy
by Yichi Zhang et al

11-30-2021

A Highly Effective Low-Rank Compression of Deep Neural Networks with Modified Beam-Search and Modified Stable Rank
by Moonjung Eo et al

12-01-2021

Using Deep Image Prior to Assist Variational Selective Segmentation Deep Learning Algorithms
by Liam Burrows et al

12-02-2021

A Fast Knowledge Distillation Framework for Visual Recognition
by Zhiqiang Shen et al

11-30-2021

MC-SSL0.0: Towards Multi-Concept Self-Supervised Learning
by Sara Atito et al

11-30-2021

Improved sparse PCA method for face and image recognition
by Loc Hoang Tran et al

12-02-2021

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer
by Moein Sorkhei et al

12-01-2021

On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification
by Rutika Moharir et al

12-02-2021

Co-domain Symmetry for Complex-Valued Deep Learning
by Utkarsh Singhal et al

12-03-2021

ROCA: Robust CAD Model Retrieval and Alignment from a Single Image
by Can Gümeli et al

12-01-2021

Saliency Enhancement using Superpixel Similarity
by Leonardo de Melo Joao et al

11-30-2021

Ranking Distance Calibration for Cross-Domain Few-Shot Learning
by Pan Li et al

12-02-2021

N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras
by Junho Kim et al

12-03-2021

Adversarial Attacks against a Satellite-borne Multispectral Cloud Detector
by Andrew Du et al

12-02-2021

Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?
by Peter Lorenz et al

12-02-2021

SCNet: A Generalized Attention-based Model for Crack Fault Segmentation
by Hrishikesh Sharma et al

12-03-2021

SSDL: Self-Supervised Dictionary Learning
by Shuai Shao et al

12-01-2021

Highly accelerated MR parametric mapping by undersampling the k-space and reducing the contrast number simultaneously with deep learning
by Yanjie Zhu et al

12-02-2021

Sample Prior Guided Robust Model Learning to Suppress Noisy Labels
by Wenkai Chen et al

11-30-2021

Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding
by Abdullah Hamdi et al

12-02-2021

Localized Feature Aggregation Module for Semantic Segmentation
by Ryouichi Furukawa et al

12-01-2021

CLAWS: Contrastive Learning with hard Attention and Weak Supervision
by Jansel Herrera-Gerena et al

11-30-2021

Training BatchNorm Only in Neural Architecture Search and Beyond
by Yichen Zhu et al

12-02-2021

Structure-Aware Multi-Hop Graph Convolution for Graph Neural Networks
by Yang Li et al

12-03-2021

Adaptive Poincar\e Point to Set Distance for Few-Shot Classification
by Rongkai Ma et al

12-01-2021

Point Cloud Segmentation Using Sparse Temporal Local Attention
by Joshua Knights et al

12-03-2021

Geometric Feature Learning for 3D Meshes
by Huan Lei et al

11-30-2021

ConDA: Unsupervised Domain Adaptation for LiDAR Segmentation via Regularized Domain Concatenation
by Lingdong Kong et al

12-03-2021

Mind Your Clever Neighbours: Unsupervised Person Re-identification via Adaptive Clustering Relationship Modeling
by Lianjie Jia et al

12-02-2021

Training Efficiency and Robustness in Deep Learning
by Fartash Faghri

12-01-2021

Trimap-guided Feature Mining and Fusion Network for Natural Image Matting
by Weihao Jiang et al

11-30-2021

EdiBERT, a generative model for image editing
by Thibaut Issenhuth et al

12-03-2021

A Survey: Deep Learning for Hyperspectral Image Classification with Few Labeled Samples
by Sen Jia et al

12-02-2021

Probabilistic Approach for Road-Users Detection
by G. Melotti et al

11-30-2021

ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds
by Georg Bökman et al

12-02-2021

Active Learning for Domain Adaptation: An Energy-based Approach
by Binhui Xie et al

12-01-2021

3D Reconstruction Using a Linear Laser Scanner and a Camera
by Rui Wang

12-01-2021

ℓ∞ℓ∞-Robustness and Beyond: Unleashing Efficient Adversarial Training
by Hadi M. Dolatabadi et al

11-30-2021

Assessment of Data Consistency through Cascades of Independently Recurrent Inference Machines for fast and robust accelerated MRI reconstruction
by D. Karkalousos et al

12-03-2021

Bridging the Gap: Point Clouds for Merging Neurons in Connectomics
by Jules Berman et al

12-01-2021

Adv-4-Adv: Thwarting Changing Adversarial Perturbations via Adversarial Domain Adaptation
by Tianyue Zheng et al

12-02-2021

Object-aware Monocular Depth Prediction with Instance Convolutions
by Enis Simsar et al

11-30-2021

Benchmarking Deep Deblurring Algorithms: A Large-Scale Multi-Cause Dataset and A New Baseline Model
by Kaihao Zhang et al

12-01-2021

A benchmark with decomposed distribution shifts for 360 monocular depth estimation
by Georgios Albanis et al

11-30-2021

Automatic Synthesis of Diverse Weak Supervision Sources for Behavior Analysis
by Albert Tseng et al

12-02-2021

TCTN: A 3D-Temporal Convolutional Transformer Network for Spatiotemporal Predictive Learning
by Ziao Yang et al

12-02-2021

Batch Normalization Tells You Which Filter is Important
by Junghun Oh et al

12-01-2021

Multi-View Stereo with Transformer
by Jie Zhu et al

11-30-2021

Fully Automatic Deep Learning Framework for Pancreatic Ductal Adenocarcinoma Detection on Computed Tomography
by Natália Alves et al

11-30-2021

Querying Labelled Data with Scenario Programs for Sim-to-Real Validation
by Edward Kim et al

11-30-2021

3DVNet: Multi-View Depth Prediction and Volumetric Refinement
by Alexander Rich et al

12-01-2021

Human-Object Interaction Detection via Weak Supervision
by Mert Kilickaya et al

12-02-2021

Deep Depth from Focus with Differential Focus Volume
by Fengting Yang et al

12-01-2021

Optimizing for In-memory Deep Learning with Emerging Memory Technology
by Zhehui Wang et al

12-01-2021

Label-Free Model Evaluation with Semi-Structured Dataset Representations
by Xiaoxiao Sun et al

12-01-2021

Transformer-based Network for RGB-D Saliency Detection
by Yue Wang et al

12-01-2021

Background Activation Suppression for Weakly Supervised Object Localization
by Pingyu Wu et al

11-30-2021

Regularized directional representations for medical image registration
by Vincent Jaouen et al

12-01-2021

Information Theoretic Representation Distillation
by Roy Miles et al

12-01-2021

On Salience-Sensitive Sign Classification in Autonomous Vehicle Path Planning: Experimental Explorations with a Novel Dataset
by Ross Greer et al

12-01-2021

Dual Spoof Disentanglement Generation for Face Anti-spoofing with Depth Uncertainty Learning
by Hangtong Wu et al

12-02-2021

Engineering AI Tools for Systematic and Scalable Quality Assessment in Magnetic Resonance Imaging
by Yukai Zou et al

11-30-2021

GLocal: Global Graph Reasoning and Local Structure Transfer for Person Image Generation
by Liyuan Ma et al

12-01-2021

Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding
by Xianzheng Ma et al

12-02-2021

CloudWalker: 3D Point Cloud Learning by Random Walks for Shape Analysis
by Adi Mesika et al

11-30-2021

Predicting Poverty Level from Satellite Imagery using Deep Neural Networks
by Varun Chitturi et al

12-03-2021

Image-to-image Translation as a Unique Source of Knowledge
by Alejandro D. Mousist

12-01-2021

Revisiting the Transferability of Supervised Pretraining: an MLP Perspective
by Yizhou Wang et al

12-03-2021

Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior
by Feng Zhang et al

12-03-2021

MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification
by Jingye Chen et al

12-03-2021

Boosting Unsupervised Domain Adaptation with Soft Pseudo-label and Curriculum Learning
by Shengjia Zhang et al

12-02-2021

The Surprising Effectiveness of Representation Learning for Visual Imitation
by Jyothish Pari et al

12-01-2021

Dyadic Human Motion Prediction
by Isinsu Katircioglu et al

11-30-2021

Seeking Salient Facial Regions for Cross-Database Micro-Expression Recognition
by Xingxun Jiang et al

11-30-2021

ARTSeg: Employing Attention for Thermal images Semantic Segmentation
by Farzeen Munir et al

12-03-2021

MSP : Refine Boundary Segmentation via Multiscale Superpixel
by Jie Zhu et al

12-01-2021

Subtask-dominated Transfer Learning for Long-tail Person Search
by Chuang Liu et al

12-03-2021

AirDet: Few-Shot Detection without Fine-tuning for Autonomous Exploration
by Bowen Li et al

12-02-2021

Self-Supervised Material and Texture Representation Learning for Remote Sensing Tasks
by Peri Akiva et al

12-01-2021

FDA-GAN: Flow-based Dual Attention GAN for Human Pose Transfer
by Liyuan Ma et al

12-01-2021

DeepSportLab: a Unified Framework for Ball Detection, Player Instance Segmentation and Pose Estimation in Team Sports Scenes
by Seyed Abolfazl Ghasemzadeh et al

11-30-2021

Spatio-Temporal Multi-Flow Network for Video Frame Interpolation
by Duolikun Danier et al

12-02-2021

Multi-modal application: Image Memes Generation
by Zhiyuan Liu et al

12-02-2021

Just Drive: Colour Bias Mitigation for Semantic Segmentation in the Context of Urban Driving
by Jack Stelling et al

11-30-2021

Affect-DML: Context-Aware One-Shot Recognition of Human Affect using Deep Metric Learning
by Kunyu Peng et al

11-30-2021

The MIS Check-Dam Dataset for Object Detection and Instance Segmentation Tasks
by Chintan Tundia et al

11-30-2021

Anonymization for Skeleton Action Recognition
by Myeonghyeon Kim et al

11-30-2021

AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-shot Interactions
by Yian Wang et al

11-30-2021

CT-block: a novel local and global features extractor for point cloud
by Shangwei Guo et al

12-03-2021

Incremental Learning in Semantic Segmentation from Image Labels
by Fabio Cermelli et al

12-03-2021

MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object Detection
by Yongri Piao et al

12-03-2021

Action Units That Constitute Trainable Micro-expressions (and A Large-scale Synthetic Dataset)
by Yuchi Liu et al

12-03-2021

Gesture Recognition with a Skeleton-Based Keyframe Selection Module
by Yunsoo Kim et al

12-03-2021

Music-to-Dance Generation with Optimal Transport
by Shuang Wu et al

11-30-2021

Improving Differentiable Architecture Search with a Generative Model
by Ruisi Zhang et al

11-30-2021

Two-stage Temporal Modelling Framework for Video-based Depression Recognition using Graph Representation
by Jiaqi Xu et al

12-02-2021

Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation
by Xiang Li et al

12-01-2021

Camera Motion Agnostic 3D Human Pose Estimation
by Seong Hyun Kim et al

12-01-2021

Automatic travel pattern extraction from visa page stamps using CNN models
by Eimantas Ledinauskas et al

11-30-2021

Boosting Discriminative Visual Representation Learning with Scenario-Agnostic Mixup
by Siyuan Li et al

12-01-2021

Learning Oriented Remote Sensing Object Detection via Naive Geometric Computing
by Yanjie Wang et al

12-02-2021

TransZero: Attribute-guided Transformer for Zero-Shot Learning
by Shiming Chen et al

12-03-2021

TRNR: Task-Driven Image Rain and Noise Removal with a Few Images Based on Patch Analysis
by Wu Ran et al

12-02-2021

Make A Long Image Short: Adaptive Token Length for Vision Transformers
by Yichen Zhu et al

12-03-2021

Detect Faces Efficiently: A Survey and Evaluations
by Yuantao Feng et al

12-01-2021

Attribute Artifacts Removal for Geometry-based Point Cloud Compression
by Xihua Sheng et al

12-01-2021

Maximum Consensus by Weighted Influences of Monotone Boolean Functions
by Erchuan Zhang et al

12-03-2021

Detection of Large Vessel Occlusions using Deep Learning by Deforming Vessel Tree Segmentations
by Florian Thamm et al

12-01-2021

Push Stricter to Decide Better: A Class-Conditional Feature Adaptive Framework for Improving Adversarial Robustness
by Jia-Li Yin et al

12-01-2021

Graph Convolutional Module for Temporal Action Localization in Videos
by Runhao Zeng et al

12-01-2021

Visual-Semantic Transformer for Scene Text Recognition
by Xin Tang et al

11-30-2021

A Face Recognition Systems Worst Morph Nightmare, Theoretically
by Una M. Kelly et al

11-30-2021

Pattern-Aware Data Augmentation for LiDAR 3D Object Detection
by Jordan S. K. Hu et al

12-02-2021

3D-Aware Semantic-Guided Generative Model for Human Synthesis
by Jichao Zhang et al

12-02-2021

Attention based Occlusion Removal for Hybrid Telepresence Systems
by Surabhi Gupta et al

11-30-2021

An implementation of the Guess who? game using CLIP
by Arnau Martí Sarri et al

12-01-2021

Interpretable Deep Learning-Based Forensic Iris Segmentation and Recognition
by Andrey Kuehlkamp et al

11-30-2021

Beyond Flatland: Pre-training with a Strong 3D Inductive Bias
by Shubhaankar Gupta et al

11-30-2021

PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction
by Qingyu Wang et al

12-02-2021

TISE: A Toolbox for Text-to-Image Synthesis Evaluation
by Tan M. Dinh et al

11-30-2021

Point Cloud Instance Segmentation with Semi-supervised Bounding-Box Mining
by Yongbin Liao et al

12-01-2021

Generalized Closed-form Formulae for Feature-based Subpixel Alignment in Patch-based Matching
by Laurent Valentin Jospin et al

12-02-2021

Video-Text Pre-training with Learned Regions
by Rui Yan et al

12-03-2021

Semantic Map Injected GAN Training for Image-to-Image Translation
by Balaram Singh Kshatriya et al

11-30-2021

ESL: Event-based Structured Light
by Manasi Muglikar et al

11-30-2021

Contrastive Learning for Local and Global Learning MRI Reconstruction
by Qiaosi Yi et al

11-30-2021

HRNET: AI on Edge for mask detection and social distancing
by Kinshuk Sengupta et al

11-30-2021

TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information
by Suraj Kothawade et al

11-30-2021

Detecting Extratropical Cyclones of the Northern Hemisphere with Single Shot Detector
by Minjing Shi et al

12-02-2021

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation
by Zhaoyuan Yin et al

12-02-2021

Machine Learning-Based Classification Algorithms for the Prediction of Coronary Heart Diseases
by Kelvin Kwakye et al

12-01-2021

Generating Diverse 3D Reconstructions from a Single Occluded Face Image
by Rahul Dey et al

12-02-2021

Learning Spatial-Temporal Graphs for Active Speaker Detection
by Sourya Roy et al

11-30-2021

TridentAdapt: Learning Domain-invariance via Source-Target Confrontation and Self-induced Cross-domain Augmentation
by Fengyi Shen et al

11-30-2021

RADU: Ray-Aligned Depth Update Convolutions for ToF Data Denoising
by Michael Schelling et al

11-30-2021

FMD-cGAN: Fast Motion Deblurring using Conditional Generative Adversarial Networks
by Jatin Kumar et al

12-02-2021

Self-supervised Video Transformer
by Kanchana Ranasinghe et al

12-02-2021

Probabilistic Tracking with Deep Factors
by Fan Jiang et al

12-02-2021

OW-DETR: Open-world Detection Transformer
by Akshita Gupta et al

12-01-2021

Event Neural Networks
by Matthew Dutson et al

12-03-2021

Panoptic-based Object Style-Align for Image-to-Image Translation
by Liyun Zhang et al

11-30-2021

PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite Images
by Stefano Zorzi et al

12-03-2021

Total Scale: Face-to-Body Detail Reconstruction from Sparse RGBD Sensors
by Zheng Dong et al

12-01-2021

FaSS-MVS -- Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery
by Boitumelo Ruf et al

11-30-2021

360MonoDepth: High-Resolution 360{\deg} Monocular Depth Estimation
by Manuel Rey-Area et al

12-02-2021

TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework using Self-Supervised Multi-Task Learning
by Linhao Qu et al

12-03-2021

Lightweight Attentional Feature Fusion for Video Retrieval by Text
by Fan Hu et al

11-30-2021

Automated Damage Inspection of Power Transmission Towers from UAV Images
by Aleixo Cambeiro Barreiro et al

12-02-2021

3rd Place Solution for NeurIPS 2021 Shifts Challenge: Vehicle Motion Prediction
by Ching-Yu Tseng et al

12-01-2021

Unsupervised Statistical Learning for Die Analysis in Ancient Numismatics
by Andreas Heinecke et al

12-03-2021

Towards Super-Resolution CEST MRI for Visualization of Small Structures
by Lukas Folle et al

12-03-2021

Novel Class Discovery in Semantic Segmentation
by Yuyang Zhao et al

12-03-2021

The Box Size Confidence Bias Harms Your Object Detector
by Johannes Gilg et al

12-01-2021

Multiple Fusion Adaptation: A Strong Framework for Unsupervised Semantic Segmentation Adaptation
by Kai Zhang et al

11-30-2021

A Unified Pruning Framework for Vision Transformers
by Hao Yu et al

11-30-2021

Probabilistic Estimation of 3D Human Shape and Pose with a Semantic Local Parametric Model
by Akash Sengupta et al

11-30-2021

Boosting EfficientNets Ensemble Performance via Pseudo-Labels and Synthetic Images by pix2pixHD for Infection and Ischaemia Classification in Diabetic Foot Ulcers
by Louise Bloch et al

12-02-2021

Open-set 3D Object Detection
by Jun Cen et al

12-02-2021

Hamiltonian prior to Disentangle Content and Motion in Image Sequences
by Asif Khan et al

12-02-2021

SwinTrack: A Simple and Strong Baseline for Transformer Tracking
by Liting Lin et al

11-30-2021

Robust Partial-to-Partial Point Cloud Registration in a Full Range
by Liang Pan et al

11-30-2021

Human Imperceptible Attacks and Applications to Improve Fairness
by Xinru Hua et al

12-02-2021

Putting 3D Spatially Sparse Networks on a Diet
by Junha Lee et al

12-02-2021

Unconstrained Face Sketch Synthesis via Perception-Adaptive Network and A New Benchmark
by Lin Nie et al

11-30-2021

MEFNet: Multi-scale Event Fusion Network for Motion Deblurring
by Lei Sun et al

11-30-2021

Large-Scale Video Analytics through Object-Level Consolidation
by Daniel Rivas et al

12-02-2021

Iterative Frame-Level Representation Learning And Classification For Semi-Supervised Temporal Action Segmentation
by Dipika Singhania et al

12-02-2021

FIBA: Frequency-Injection based Backdoor Attack in Medical Image Analysis
by Yu Feng et al

12-02-2021

Video Frame Interpolation without Temporal Priors
by Youjian Zhang et al

11-30-2021

CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning
by Bang Yang et al

12-02-2021

MTFNet: Mutual-Transformer Fusion Network for RGB-D Salient Object Detection
by Xixi Wang et al

12-02-2021

Overcoming the Domain Gap in Neural Action Representations
by Semih Günel et al

12-02-2021

MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment
by Jie Ren et al

12-02-2021

NeSF: Neural Shading Field for Image Harmonization
by Zhongyun Hu et al

12-01-2021

Unleashing the Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification
by Zizheng Yang et al

12-01-2021

Temporally Resolution Decrement: Utilizing the Shape Consistency for Higher Computational Efficiency
by Tianshu Xie et al

11-30-2021

Low-light Image Enhancement via Breaking Down the Darkness
by Qiming Hu et al

11-30-2021

ColibriDoc: An Eye-in-Hand Autonomous Trocar Docking System
by Shervin Dehghani et al

11-30-2021

Reconstruction Student with Attention for Student-Teacher Pyramid Matching
by Shinji Yamada et al

12-02-2021

Deep Learning-Based Carotid Artery Vessel Wall Segmentation in Black-Blood MRI Using Anatomical Priors
by Dieuwertje Alblas et al

11-30-2021

Zero-Shot Semantic Segmentation via Spatial and Multi-Scale Aware Visual Class Embedding
by Sungguk Cha et al

11-30-2021

SamplingAug: On the Importance of Patch Sampling Augmentation for Single Image Super-Resolution
by Shizun Wang et al

12-02-2021

Semantic-Sparse Colorization Network for Deep Exemplar-based Colorization
by Yunpeng Bai et al

11-30-2021

Semi-Supervised 3D Hand Shape and Pose Estimation with Label Propagation
by Samira Kaviani et al

12-02-2021

Stronger Baseline for Person Re-Identification
by Fengliang Qi et al

12-02-2021

The Second Place Solution for ICCV2021 VIPriors Instance Segmentation Challenge
by Bo Yan et al

12-02-2021

Fast automatic deforestation detectors and their extensions for other spatial objects
by Jesper Muren et al

12-02-2021

InsCLR: Improving Instance Retrieval with Self-Supervision
by Zelu Deng et al

12-02-2021

Local Similarity Pattern and Cost Self-Reassembling for Deep Stereo Matching Networks
by Biyang Liu et al

12-01-2021

Optimization of phase-only holograms calculated with scaled diffraction calculation through deep neural networks
by Yoshiyuki Ishii et al

12-03-2021

SGM3D: Stereo Guided Monocular 3D Object Detection
by Zheyuan Zhou et al

11-30-2021

Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
by Byeonghu Na et al

12-02-2021

TBN-ViT: Temporal Bilateral Network with Vision Transformer for Video Scene Parsing
by Bo Yan et al

12-02-2021

Leveraging Human Selective Attention for Medical Image Analysis with Limited Training Data
by Yifei Huang et al

12-02-2021

GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation
by Xingzhe He et al

12-02-2021

Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips
by Lijin Yang et al

11-30-2021

Using a GAN to Generate Adversarial Examples to Facial Image Recognition
by Andrew Merrigan et al

11-30-2021

HEAT: Holistic Edge Attention Transformer for Structured Reconstruction
by Jiacheng Chen et al

11-30-2021

Robust 3D Garment Digitization from Monocular 2D Images for 3D Virtual Try-On Systems
by Sahib Majithia et al

11-30-2021

MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark
by Xiaotian Han et al

11-30-2021

AirObject: A Temporally Evolving Graph Embedding for Object Identification
by Nikhil Varma Keetha et al

11-30-2021

A Dataset-Dispersion Perspective on Reconstruction Versus Recognition in Single-View 3D Reconstruction Networks
by Yefan Zhou et al

12-03-2021

Fully automatic integration of dental CBCT images and full-arch intraoral impressions with stitching error correction via individual tooth segmentation and identification
by Tae Jun Jang et al

12-03-2021

A Systematic IoU-Related Method: Beyond Simplified Regression for Better Localization
by Hanyang Peng et al

12-02-2021

Bio-inspired Polarization Event Camera
by Germain Haessig et al

11-30-2021

Generative Convolution Layer for Image Generation
by Seung Park et al

12-01-2021

Multi-task fusion for improving mammography screening data classification
by Maria Wimmer et al

12-01-2021

Multi-Content Complementation Network for Salient Object Detection in Optical Remote Sensing Images
by Gongyang Li et al

 
Craig Smith