04-01-2021
|
EfficientNetV2: Smaller Models and Faster Training
by
Mingxing Tan
et al
|
|
|
|
03-31-2021
|
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
by
Or Patashnik
et al
|
|
|
|
03-31-2021
|
Going deeper with Image Transformers
by
Hugo Touvron
et al
|
|
|
|
04-01-2021
|
In&Out : Diverse Image Outpainting via GAN
Inversion
by
Yen-Chi Cheng
et al
|
|
|
|
03-30-2021
|
Rethinking Spatial Dimensions of Vision Transformers
by
Byeongho Heo
et al
|
|
|
|
04-01-2021
|
Towards General Purpose Vision Systems
by
Tanmay Gupta
et al
|
|
|
|
04-01-2021
|
LoFTR: Detector-Free Local Feature Matching with
Transformers
by
Jiaming Sun
et al
|
|
|
|
03-30-2021
|
3D Human Pose and Shape Regression with Pyramidal Mesh
Alignment Feedback Loop
by
Hongwen Zhang
et al
|
|
|
|
04-01-2021
|
Unconstrained Scene Generation with Locally Conditioned
Radiance Fields
by
Terrance DeVries
et al
|
|
|
|
04-01-2021
|
Frozen in Time: A Joint Video and Image Encoder for
End-to-End Retrieval
by
Max Bain
et al
|
|
|
|
04-01-2021
|
Putting NeRF on a Diet: Semantically Consistent
Few-Shot View Synthesis
by
Ajay Jain
et al
|
|
|
|
04-01-2021
|
NeuralRecon: Real-Time Coherent 3D Reconstruction from
Monocular Video
by
Jiaming Sun
et al
|
|
|
|
03-31-2021
|
Learning Generalizable Robotic Reward Functions from
In-The-Wild Human Videos
by
Annie S. Chen
et al
|
|
|
|
03-30-2021
|
Dual Contrastive Loss and Attention for GANs
by
Ning Yu
et al
|
|
|
|
04-01-2021
|
Reconstructing 3D Human Pose by Watching Humans in the
Mirror
by
Qi Fang
et al
|
|
|
|
03-30-2021
|
Seasonal Contrast: Unsupervised Pre-Training from
Uncurated Remote Sensing Data
by
Oscar Mañas
et al
|
|
|
|
03-30-2021
|
Benchmarking Representation Learning for Natural World
Image Collections
by
Grant Van Horn
et al
|
|
|
|
03-30-2021
|
Broaden Your Views for Self-Supervised Video Learning
by
Adrià Recasens
et al
|
|
|
|
04-01-2021
|
PhySG: Inverse Rendering with Spherical Gaussians for
Physics-based Material Editing and Relighting
by
Kai Zhang
et al
|
|
|
|
03-31-2021
|
Semi-supervised Synthesis of High-Resolution Editable
Textures for 3D Humans
by
Bindita Chaudhuri
et al
|
|
|
|
03-30-2021
|
Unsupervised Learning of 3D Object Categories from
Videos in the Wild
by
Philipp Henzler
et al
|
|
|
|
03-31-2021
|
Rethinking Style Transfer: From Pixels to Parameterized
Brushstrokes
by
Dmytro Kotovenko
et al
|
|
|
|
03-30-2021
|
Learning monocular 3D reconstruction of articulated
categories from motion
by
Filippos Kokkinos
et al
|
|
|
|
04-01-2021
|
LatentCLR: A Contrastive Learning Approach for
Unsupervised Discovery of Interpretable Directions
by
Oğuz Kaan Yüksel
et al
|
|
|
|
03-30-2021
|
Foveated Neural Radiance Fields for Real-Time and
Egocentric Virtual Reality
by
Nianchen Deng
et al
|
|
|
|
04-01-2021
|
NPMs: Neural Parametric Models for 3D Deformable Shapes
by
Pablo Palafox
et al
|
|
|
|
03-31-2021
|
LIFT-SLAM: a deep-learning feature-based monocular
visual SLAM method
by
Hudson M. S. Bruno
et al
|
|
|
|
03-31-2021
|
CAMPARI: Camera-Aware Decomposed Generative Neural
Radiance Fields
by
Michael Niemeyer
et al
|
|
|
|
03-31-2021
|
Human POSEitioning System (HPS): 3D Human Pose
Estimation and Self-localization in Large Scenes from
Body-Mounted Sensors
by
Vladimir Guzov
et al
|
|
|
|
03-31-2021
|
Analysis on Image Set Visual Question Answering
by
Abhinav Khattar
et al
|
|
|
|
03-30-2021
|
Weakly-Supervised Image Semantic Segmentation Using
Graph Convolutional Networks
by
Shun-Yi Pan
et al
|
|
|
|
03-31-2021
|
RetrievalFuse: Neural 3D Scene Reconstruction with a
Database
by
Yawar Siddiqui
et al
|
|
|
|
04-02-2021
|
Decomposing 3D Scenes into Objects via Unsupervised
Volume Segmentation
by
Karl Stelzner
et al
|
|
|
|
03-31-2021
|
Training robust deep learning models for medical
imaging tasks with spectral decoupling
by
Joona Pohjonen
et al
|
|
|
|
04-02-2021
|
Multiple Heads are Better than One: Few-shot Font
Generation with Multiple Localized Experts
by
Song Park
et al
|
|
|
|
03-31-2021
|
Rainbow Memory: Continual Learning with a Memory of
Diverse Samples
by
Jihwan Bang
et al
|
|
|
|
03-30-2021
|
Thinking Fast and Slow: Efficient Text-to-Visual
Retrieval with Transformers
by
Antoine Miech
et al
|
|
|
|
04-01-2021
|
Is Label Smoothing Truly Incompatible with Knowledge
Distillation: An Empirical Study
by
Zhiqiang Shen
et al
|
|
|
|
03-31-2021
|
Rethinking Self-supervised Correspondence Learning: A
Video Frame-level Similarity Perspective
by
Jiarui Xu
et al
|
|
|
|
04-01-2021
|
Divergence Optimization for Noisy Universal Domain
Adaptation
by
Qing Yu
et al
|
|
|
|
03-31-2021
|
Learning Spatio-Temporal Transformer for Visual
Tracking
by
Bin Yan
et al
|
|
|
|
04-01-2021
|
Avalanche: an End-to-End Library for Continual Learning
by
Vincenzo Lomonaco
et al
|
|
|
|
04-01-2021
|
The Spatially-Correlative Loss for Various Image
Translation Tasks
by
Chuanxia Zheng
et al
|
|
|
|
04-01-2021
|
Neural Video Portrait Relighting in Real-time via
Consistency Modeling
by
Longwen Zhang
et al
|
|
|
|
04-01-2021
|
Sketch2Mesh: Reconstructing and Editing 3D Shapes from
Sketches
by
Benoit Guillard
et al
|
|
|
|
04-01-2021
|
TrajeVAE -- Controllable Human Motion Generation from
Trajectories
by
Kacper Kania
et al
|
|
|
|
03-31-2021
|
Video Exploration via Video-Specific Autoencoders
by
Kevin Wang
et al
|
|
|
|
03-31-2021
|
VITON-HD: High-Resolution Virtual Try-On via
Misalignment-Aware Normalization
by
Seunghwan Choi
et al
|
|
|
|
04-01-2021
|
LED2-Net: Monocular 360 Layout Estimation via
Differentiable Depth Rendering
by
Fu-En Wang
et al
|
|
|
|
04-01-2021
|
Group-Free 3D Object Detection via Transformers
by
Ze Liu
et al
|
|
|
|
03-31-2021
|
ReMix: Towards Image-to-Image Translation with Limited
Data
by
Jie Cao
et al
|
|
|
|
03-30-2021
|
Fast and Accurate Normal Estimation for Point Cloud via
Patch Stitching
by
Jun Zhou
et al
|
|
|
|
03-31-2021
|
An Investigation of Critical Issues in Bias Mitigation
Techniques
by
Robik Shrestha
et al
|
|
|
|
03-30-2021
|
SIMstack: A Generative Shape and Instance Model for
Unordered Object Stacks
by
Zoe Landgraf
et al
|
|
|
|
03-30-2021
|
Diagnosing Vision-and-Language Navigation: What Really
Matters
by
Wanrong Zhu
et al
|
|
|
|
04-01-2021
|
UC2: Universal Cross-lingual Cross-modal
Vision-and-Language Pre-training
by
Mingyang Zhou
et al
|
|
|
|
04-01-2021
|
Improving Calibration for Long-Tailed Recognition
by
Zhisheng Zhong
et al
|
|
|
|
03-31-2021
|
DCVNet: Dilated Cost Volume Networks for Fast Optical
Flow
by
Huaizu Jiang
et al
|
|
|
|
04-01-2021
|
Text to Image Generation with Semantic-Spatial Aware
GAN
by
Wentong Liao
et al
|
|
|
|
04-01-2021
|
Composable Augmentation Encoding for Video
Representation Learning
by
Chen Sun
et al
|
|
|
|
04-01-2021
|
Efficient and Differentiable Shadow Computation for
Inverse Problems
by
Linjie Lyu
et al
|
|
|
|
03-31-2021
|
Neural Surface Maps
by
Luca Morreale
et al
|
|
|
|
03-30-2021
|
Attention, please! A survey of Neural Attention Models
in Deep Learning
by
Alana de Santana Correia
et al
|
|
|
|
03-30-2021
|
Towards More Flexible and Accurate Object Tracking with
Natural Language: Algorithms and Benchmark
by
Xiao Wang
et al
|
|
|
|
03-30-2021
|
HapTable: An Interactive Tabletop Providing Online
Haptic Feedback for Touch Gestures
by
Senem Ezgi Emgin
et al
|
|
|
|
04-01-2021
|
A Realistic Evaluation of Semi-Supervised Learning for
Fine-Grained Classification
by
Jong-Chyi Su
et al
|
|
|
|
04-01-2021
|
Famous Companies Use More Letters in Logo:A Large-Scale
Analysis of Text Area in Logo
by
Shintaro Nishi
et al
|
|
|
|
03-30-2021
|
Contrastive Learning of Single-Cell Phenotypic
Representations for Treatment Classification
by
Alexis Perakis
et al
|
|
|
|
04-01-2021
|
Deep Two-View Structure-from-Motion Revisited
by
Jianyuan Wang
et al
|
|
|
|
03-30-2021
|
Kaleido-BERT: Vision-Language Pre-training on Fashion
Domain
by
Mingchen Zhuge
et al
|
|
|
|
03-31-2021
|
Scale-aware Automatic Augmentation for Object Detection
by
Yukang Chen
et al
|
|
|
|
03-31-2021
|
DA-DETR: Domain Adaptive Detection Transformer by
Hybrid Attention
by
Jingyi Zhang
et al
|
|
|
|
03-30-2021
|
A study of latent monotonic attention variants
by
Albert Zeyer
et al
|
|
|
|
04-01-2021
|
Students are the Best Teacher: Exit-Ensemble
Distillation with Multi-Exits
by
Hojung Lee
et al
|
|
|
|
03-30-2021
|
3D AffordanceNet: A Benchmark for Visual Object
Affordance Understanding
by
Shengheng Deng
et al
|
|
|
|
03-30-2021
|
Grounding Physical Concepts of Objects and Events
Through Dynamic Visual Reasoning
by
Zhenfang Chen
et al
|
|
|
|
03-31-2021
|
Rapid quantification of COVID-19 pneumonia burden from
computed tomography with convolutional LSTM networks
by
Kajetan Grodecki
et al
|
|
|
|
03-31-2021
|
Facial expression and attributes recognition based on
multi-task learning of lightweight neural networks
by
Andrey V. Savchenko
|
|
|
|
04-01-2021
|
Deep Multi-Resolution Dictionary Learning for
Histopathology Image Analysis
by
Nima Hatami
et al
|
|
|
|
04-01-2021
|
SimPoE: Simulated Character Control for 3D Human Pose
Estimation
by
Ye Yuan
et al
|
|
|
|
04-01-2021
|
Multiview Pseudo-Labeling for Semi-supervised Learning
from Video
by
Bo Xiong
et al
|
|
|
|
03-30-2021
|
What Causes Optical Flow Networks to be Vulnerable to
Physical Adversarial Attacks
by
Simon Schrodi
et al
|
|
|
|
04-01-2021
|
Bipartite Graph Network with Adaptive Message Passing
for Unbiased Scene Graph Generation
by
Rongjie Li
et al
|
|
|
|
04-01-2021
|
Unsupervised Sound Localization via Iterative
Contrastive Learning
by
Yan-Bo Lin
et al
|
|
|
|
04-01-2021
|
Learning to Track Instances without Video Annotations
by
Yang Fu
et al
|
|
|
|
03-31-2021
|
FANet: A Feedback Attention Network for Improved
Biomedical Image Segmentation
by
Nikhil Kumar Tomar
et al
|
|
|
|
03-31-2021
|
Two-phase weakly supervised object detection with
pseudo ground truth mining
by
Jun Wang
|
|
|
|
04-01-2021
|
Wide-Depth-Range 6D Object Pose Estimation in Space
by
Yinlin Hu
et al
|
|
|
|
04-01-2021
|
STMTrack: Template-free Visual Tracking with Space-time
Memory Networks
by
Zhihong Fu
et al
|
|
|
|
04-01-2021
|
Modular Adaptation for Cross-Domain Few-Shot Learning
by
Xiao Lin
et al
|
|
|
|
03-30-2021
|
Repopulating Street Scenes
by
Yifan Wang
et al
|
|
|
|
04-01-2021
|
Commonsense Spatial Reasoning for Visually Intelligent
Agents
by
Agnese Chiatti
et al
|
|
|
|
04-01-2021
|
Touch-based Curiosity for Sparse-Reward Tasks
by
Sai Rajeswar
et al
|
|
|
|
04-01-2021
|
Learning Deep Latent Subspaces for Image Denoising
by
Yunhao Yang
et al
|
|
|
|
04-01-2021
|
Fostering Generalization in Single-view 3D
Reconstruction by Learning a Hierarchy of Local and
Global Shape Priors
by
Jan Bechtold
et al
|
|
|
|
03-31-2021
|
A comparative evaluation of learned feature descriptors
on hybrid monocular visual SLAM methods
by
Hudson M. S. Bruno
et al
|
|
|
|
03-31-2021
|
Hierarchical Road Topology Learning for Urban Map-less
Driving
by
Li Zhang
et al
|
|
|
|
04-01-2021
|
TFill: Image Completion via a Transformer-Based
Architecture
by
Chuanxia Zheng
et al
|
|
|
|
03-31-2021
|
FAPIS: A Few-shot Anchor-free Part-based Instance
Segmenter
by
Khoi Nguyen
et al
|
|
|
|
03-31-2021
|
Learning by Aligning Videos in Time
by
Sanjay Haresh
et al
|
|
|
|
04-01-2021
|
Jigsaw Clustering for Unsupervised Visual
Representation Learning
by
Pengguang Chen
et al
|
|
|
|
04-02-2021
|
LeViT: a Vision Transformer in ConvNets Clothing for
Faster Inference
by
Ben Graham
et al
|
|
|
|
03-30-2021
|
Model-Contrastive Federated Learning
by
Qinbin Li
et al
|
|
|
|
04-01-2021
|
MeanShift++: Extremely Fast Mode-Seeking With
Applications to Segmentation and Object Tracking
by
Jennifer Jang
et al
|
|
|
|
03-31-2021
|
NetAdaptV2: Efficient Neural Architecture Search with
Fast Super-Network Training and Architecture
Optimization
by
Tien-Ju Yang
et al
|
|
|
|
04-01-2021
|
A Front-End for Dense Monocular SLAM using a Learned
Outlier Mask Prior
by
Yihao Zhang
et al
|
|
|
|
03-31-2021
|
Full Surround Monodepth from Multiple Cameras
by
Vitor Guizilini
et al
|
|
|
|
04-01-2021
|
RePOSE: Real-Time Iterative Rendering and Refinement
for 6D Object Pose Estimation
by
Shun Iwase
et al
|
|
|
|
03-30-2021
|
Class-Aware Robust Adversarial Training for Object
Detection
by
Pin-Chun Chen
et al
|
|
|
|
04-01-2021
|
CUPID: Adaptive Curation of Pre-training Data for
Video-and-Language Representation Learning
by
Luowei Zhou
et al
|
|
|
|
03-31-2021
|
Joint Learning of Neural Transfer and Architecture
Adaptation for Image Recognition
by
Guangrun Wang
et al
|
|
|
|
03-31-2021
|
Unpaired Single-Image Depth Synthesis with
cycle-consistent Wasserstein GANs
by
Christoph Angermann
et al
|
|
|
|
04-01-2021
|
Exploiting Relationship for Complex-scene Image
Generation
by
Tianyu Hua
et al
|
|
|
|
04-01-2021
|
The surprising impact of mask-head architecture on
novel class segmentation
by
Vighnesh Birodkar
et al
|
|
|
|
04-01-2021
|
Domain Invariant Adversarial Learning
by
Matan Levi
et al
|
|
|
|
03-30-2021
|
Deep Gaussian Processes for Few-Shot Segmentation
by
Joakim Johnander
et al
|
|
|
|
03-31-2021
|
MR Slice Profile Estimation by Learning to Match
Internal Patch Distributions
by
Shuo Han
et al
|
|
|
|
03-30-2021
|
Physics-based Differentiable Depth Sensor Simulation
by
Benjamin Planche
et al
|
|
|
|
04-01-2021
|
Target Transformed Regression for Accurate Tracking
by
Yutao Cui
et al
|
|
|
|
03-31-2021
|
Scalable Visual Attribute Extraction through Hidden
Layers of a Residual ConvNet
by
Andres Baloian
et al
|
|
|
|