04-22-2021
|
VATT: Transformers for Multimodal Self-Supervised
Learning from Raw Video, Audio and Text
by
Hassan Akbari
et al
|
|
|
|
04-20-2021
|
VideoGPT: Video Generation using VQ-VAE and
Transformers
by
Wilson Yan
et al
|
|
|
|
04-22-2021
|
Multiscale Vision Transformers
by
Haoqi Fan
et al
|
|
|
|
04-22-2021
|
On Buggy Resizing Libraries and Surprising Subtleties
in FID Calculation
by
Gaurav Parmar
et al
|
|
|
|
04-21-2021
|
Contingencies from Observations: Tractable Contingency
Planning with Learned Behavior Models
by
Nicholas Rhinehart
et al
|
|
|
|
04-20-2021
|
GENESIS-V2: Inferring Unordered Object Representations
without Iterative Refinement
by
Martin Engelcke
et al
|
|
|
|
04-22-2021
|
Pri3D: Can 3D Priors Help 2D Representation Learning?
by
Ji Hou
et al
|
|
|
|
04-22-2021
|
KeypointDeformer: Unsupervised 3D Keypoint Discovery
for Shape Control
by
Tomas Jakab
et al
|
|
|
|
04-23-2021
|
VidTr: Video Transformer Without Convolutions
by
Xinyu Li
et al
|
|
|
|
04-22-2021
|
Token Labeling: Training a 85.4% Top-1 Accuracy Vision
Transformer with 56M Parameters on ImageNet
by
Zihang Jiang
et al
|
|
|
|
04-22-2021
|
So-ViT: Mind Visual Tokens for Vision Transformer
by
Jiangtao Xie
et al
|
|
|
|
04-22-2021
|
ImageNet-21K Pretraining for the Masses
by
Tal Ridnik
et al
|
|
|
|
04-22-2021
|
Cross-Domain and Disentangled Face Manipulation with 3D
Guidance
by
Can Wang
et al
|
|
|
|
04-21-2021
|
Voxel Structure-based Mesh Reconstruction from a 3D
Point Cloud
by
Chenlei Lv
et al
|
|
|
|
04-20-2021
|
SRWarp: Generalized Image Super-Resolution under
Arbitrary Transformation
by
Sanghyun Son
et al
|
|
|
|
04-22-2021
|
Hierarchical Motion Understanding via Motion Programs
by
Sumith Kulal
et al
|
|
|
|
04-20-2021
|
UNISURF: Unifying Neural Implicit Surfaces and Radiance
Fields for Multi-View Reconstruction
by
Michael Oechsle
et al
|
|
|
|
04-21-2021
|
MVFuseNet: Improving End-to-End Object Detection and
Motion Forecasting through Multi-View Fusion of LiDAR
Data
by
Ankit Laddha
et al
|
|
|
|
04-20-2021
|
Class-Incremental Learning with Generative Classifiers
by
Gido M. van de Ven
et al
|
|
|
|
04-21-2021
|
PP-YOLOv2: A Practical Object Detector
by
Xin Huang
et al
|
|
|
|
04-22-2021
|
Pose-Controllable Talking Face Generation by Implicitly
Modularized Audio-Visual Representation
by
Hang Zhou
et al
|
|
|
|
04-21-2021
|
SSLM: Self-Supervised Learning for Medical Diagnosis
from MR Video
by
Siladittya Manna
et al
|
|
|
|
04-20-2021
|
Perceptual Loss for Robust Unsupervised Homography
Estimation
by
Daniel Koguciuk
et al
|
|
|
|
04-21-2021
|
Dual Head Adversarial Training
by
Yujing Jiang
et al
|
|
|
|
04-20-2021
|
M2TR: Multi-modal Multi-scale Transformers for Deepfake
Detection
by
Junke Wang
et al
|
|
|
|
04-21-2021
|
PocketNet: A Smaller Neural Network for 3D Medical
Image Segmentation
by
Adrian Celaya
et al
|
|
|
|
04-22-2021
|
Automated Tackle Injury Risk Assessment in
Contact-Based Sports -- A Rugby Union Example
by
Zubair Martin
et al
|
|
|
|
04-22-2021
|
Learning Transferable 3D Adversarial Cloaks for Deep
Trained Detectors
by
Arman Maesumi
et al
|
|
|
|
04-20-2021
|
SelfReg: Self-supervised Contrastive Regularization for
Domain Generalization
by
Daehee Kim
et al
|
|
|
|
04-23-2021
|
Sketch-based Normal Map Generation with Geometric
Sampling
by
Yi He
et al
|
|
|
|
04-22-2021
|
An End-to-End Computer Vision Methodology for
Quantitative Metallography
by
Matan Rusanovsky
et al
|
|
|
|
04-23-2021
|
Skip-Convolutions for Efficient Video Processing
by
Amirhossein Habibian
et al
|
|
|
|
04-22-2021
|
Motion Representations for Articulated Animation
by
Aliaksandr Siarohin
et al
|
|
|
|
04-22-2021
|
Focusing on Shadows for Predicting Heightmaps from
Single Remotely Sensed RGB Images with Deep Learning
by
Savvas Karatsiolis
et al
|
|
|
|
04-22-2021
|
METGAN: Generative Tumour Inpainting and Modality
Synthesis in Light Sheet Microscopy
by
Izabela Horvath
et al
|
|
|
|
04-21-2021
|
Colonoscopy Polyp Detection and Classification: Dataset
Creation and Comparative Evaluations
by
Kaidong Li
et al
|
|
|
|
04-21-2021
|
A Fully Spiking Hybrid Neural Network for
Energy-Efficient Object Detection
by
Biswadeep Chakraborty
et al
|
|
|
|
04-22-2021
|
Neuro-inspired edge feature fusion using Choquet
integrals
by
Cedric Marco-Detchart
et al
|
|
|
|
04-21-2021
|
Improving the Accuracy of Early Exits in Multi-Exit
Architectures via Curriculum Learning
by
Arian Bakhtiarnia
et al
|
|
|
|
04-22-2021
|
Frequency Domain Loss Function for Deep Exposure
Correction of Dark Images
by
Ojasvi Yadav
et al
|
|
|
|
04-22-2021
|
Distilling Audio-Visual Knowledge by Compositional
Contrastive Learning
by
Yanbei Chen
et al
|
|
|
|
04-22-2021
|
Towards Adversarial Patch Analysis and Certified
Defense against Crowd Counting
by
Qiming Wu
et al
|
|
|
|
04-22-2021
|
Opening up Open-World Tracking
by
Yang Liu
et al
|
|
|
|
04-21-2021
|
MetricOpt: Learning to Optimize Black-Box Evaluation
Metrics
by
Chen Huang
et al
|
|
|
|
04-20-2021
|
Lighting, Reflectance and Geometry Estimation from
360∘∘ Panoramic Stereo
by
Junxuan Li
et al
|
|
|
|
04-22-2021
|
A Data-Adaptive Loss Function for Incomplete Data and
Incremental Learning in Semantic Image Segmentation
by
Minh H. Vu
et al
|
|
|
|
04-21-2021
|
Multi-Class Micro-CT Image Segmentation Using Sparse
Regularized Deep Networks
by
Amirsaeed Yazdani
et al
|
|
|
|
04-21-2021
|
Meta-learning for skin cancer detection using Deep
Learning Techniques
by
Sara I. Garcia
|
|
|
|
04-21-2021
|
Fixed-Point and Objective Convergence of Plug-and-Play
Algorithms
by
Pravin Nair
et al
|
|
|
|
04-21-2021
|
Localization of Ice-Rink for Broadcast Hockey Videos
by
Mehrnaz Fani
et al
|
|
|
|
04-22-2021
|
FCOS3D: Fully Convolutional One-Stage Monocular 3D
Object Detection
by
Tai Wang
et al
|
|
|
|
04-22-2021
|
Multi-task Semi-supervised Learning for Pulmonary Lobe
Segmentation
by
Jingnan Jia
et al
|
|
|
|
04-22-2021
|
Continental-scale land cover mapping at 10 m resolution
over Europe (ELC10)
by
Zander S. Venter
et al
|
|
|
|
04-22-2021
|
Fully Convolutional Line Parsing
by
Xili Dai
et al
|
|
|
|
04-21-2021
|
Accurate and fast matrix factorization for low-rank
learning
by
Reza Godaz
et al
|
|
|
|
04-20-2021
|
Large Scale Interactive Motion Forecasting for
Autonomous Driving : The Waymo Open Motion Dataset
by
Scott Ettinger
et al
|
|
|
|
04-20-2021
|
Geometric Deep Learning on Anatomical Meshes for the
Prediction of Alzheimers Disease
by
Ignacio Sarasua
et al
|
|
|
|
04-22-2021
|
Domain Adaptation for Semantic Segmentation via
Patch-Wise Contrastive Learning
by
Weizhe Liu
et al
|
|
|
|
04-22-2021
|
Relational Subsets Knowledge Distillation for
Long-tailed Retinal Diseases Recognition
by
Lie Ju
et al
|
|
|
|
04-22-2021
|
Hierarchical growing grid networks for skeleton based
action recognition
by
Zahra Gharaee
|
|
|
|
04-22-2021
|
Self-Supervised Learning from Semantically Imprecise
Data
by
Clemens-Alexander Brust
et al
|
|
|
|
04-22-2021
|
Efficient LiDAR Odometry for Autonomous Driving
by
Xin Zheng
et al
|
|
|
|
04-21-2021
|
DANNet: A One-Stage Domain Adaptation Network for
Unsupervised Nighttime Semantic Segmentation
by
Xinyi Wu
et al
|
|
|
|
04-21-2021
|
Lighting the Darkness in the Deep Learning Era
by
Chongyi Li
et al
|
|
|
|
04-21-2021
|
Self-optimizing loop sifting and majorization for 3D
reconstruction
by
Guoxiang Zhang
et al
|
|
|
|
04-22-2021
|
Network Space Search for Pareto-Efficient Spaces
by
Min-Fong Hong
et al
|
|
|
|
04-22-2021
|
Compressive lensless endoscopy with partial speckle
scanning
by
Stéphanie Guérit
et al
|
|
|
|
04-22-2021
|
NanoNet: Real-Time Polyp Segmentation in Video Capsule
Endoscopy and Colonoscopy
by
Debesh Jha
et al
|
|
|
|
04-22-2021
|
VM-MODNet: Vehicle Motion aware Moving Object Detection
for Autonomous Driving
by
Hazem Rashed
et al
|
|
|
|
04-22-2021
|
Semi-Supervised Segmentation of Concrete Aggregate
Using Consensus Regularisation and Prior Guidance
by
Max Coenen
et al
|
|
|
|
04-22-2021
|
Deep Video Matting via Spatio-Temporal Alignment and
Aggregation
by
Yanan Sun
et al
|
|
|
|
04-22-2021
|
Aerial Scene Understanding in The Wild: Multi-Scene
Recognition via Prototype-based Memory Networks
by
Yuansheng Hua
et al
|
|
|
|
04-22-2021
|
Heterogeneous Grid Convolution for Adaptive, Efficient,
and Controllable Computation
by
Ryuhei Hamaguchi
et al
|
|
|
|
04-21-2021
|
BEVDetNet: Birds Eye View LiDAR Point Cloud based
Real-time 3D Object Detection for Autonomous Driving
by
Sambit Mohapatra
et al
|
|
|
|
04-21-2021
|
Fourier Contour Embedding for Arbitrary-Shaped Text
Detection
by
Yiqin Zhu
et al
|
|
|
|
04-22-2021
|
Continuous Learning and Adaptation with Membrane
Potential and Activation Threshold Homeostasis
by
Alexander Hadjiivanov
|
|
|
|
04-21-2021
|
A Strong Baseline for Vehicle Re-Identification
by
Su V. Huynh
et al
|
|
|
|
04-22-2021
|
ManipulaTHOR: A Framework for Visual Object
Manipulation
by
Kiana Ehsani
et al
|
|
|
|
04-22-2021
|
Robust 360-8PA: Redesigning The Normalized 8-point
Algorithm for 360-FoV Images
by
Bolivar Solarte
et al
|
|
|
|
04-20-2021
|
Boosting Masked Face Recognition with Multi-Task
ArcFace
by
David Montero
et al
|
|
|
|
04-22-2021
|
Sketch-QNet: A Quadruplet ConvNet for Color
Sketch-based Image Retrieval
by
Anibal Fuentes
et al
|
|
|
|
04-20-2021
|
Variational Relational Point Completion Network
by
Liang Pan
et al
|
|
|
|
04-21-2021
|
NTIRE 2021 Challenge on Quality Enhancement of
Compressed Video: Dataset and Study
by
Ren Yang
et al
|
|
|
|
04-21-2021
|
Revisiting Document Representations for Large-Scale
Zero-Shot Learning
by
Jihyung Kil
et al
|
|
|
|
04-21-2021
|
Jacobian Regularization for Mitigating Universal
Adversarial Perturbations
by
Kenneth T. Co
et al
|
|
|
|
04-22-2021
|
Maneuver-based Anchor Trajectory Hypotheses at
Roundabouts
by
Mohamed Hasan
et al
|
|
|
|
04-22-2021
|
H2O: Two Hands Manipulating Objects for First Person
Interaction Recognition
by
Taein Kwon
et al
|
|
|
|
04-21-2021
|
Multi-task Learning with Attention for End-to-end
Autonomous Driving
by
Keishi Ishihara
et al
|
|
|
|
04-21-2021
|
NTIRE 2021 Challenge on Quality Enhancement of
Compressed Video: Methods and Results
by
Ren Yang
et al
|
|
|
|
04-23-2021
|
Skeletor: Skeletal Transformers for Robust Body-Pose
Estimation
by
Tao Jiang
et al
|
|
|
|
04-20-2021
|
Improving state-of-the-art in Detecting Student
Engagement with Resnet and TCN Hybrid Network
by
Ali Abedi
et al
|
|
|
|
04-20-2021
|
Deep Transform and Metric Learning Networks
by
Wen Tang
et al
|
|
|
|
04-21-2021
|
Exploring 2D Data Augmentation for 3D Monocular Object
Detection
by
Sugirtha T
et al
|
|
|
|
04-20-2021
|
Voice2Mesh: Cross-Modal 3D Face Model Generation from
Voices
by
Cho-Ying Wu
et al
|
|
|
|
04-21-2021
|
Measuring economic activity from space: a case study
using flying airplanes and COVID-19
by
Mauricio Pamplona Segundo
et al
|
|
|
|
04-21-2021
|
Multi-Attention-Based Soft Partition Network for
Vehicle Re-Identification
by
Sangrok Lee
et al
|
|
|
|
04-20-2021
|
Distill on the Go: Online knowledge distillation in
self-supervised learning
by
Prashant Bhat
et al
|
|
|
|
04-22-2021
|
Connecting Hamilton--Jacobi partial differential
equations with maximum a posteriori and posterior mean
estimators for some non-convex priors
by
Jérôme Darbon
et al
|
|
|
|
04-20-2021
|
HMS: Hierarchical Modality Selection for Efficient
Video Recognition
by
Zejia Weng
et al
|
|
|
|
04-20-2021
|
GraghVQA: Language-Guided Graph Neural Networks for
Graph-based Visual Question Answering
by
Weixin Liang
et al
|
|
|
|
04-20-2021
|
Detection of Audio-Video Synchronization Errors Via
Event Detection
by
Joshua P. Ebenezer
et al
|
|
|
|
04-20-2021
|
Semantic similarity metrics for learned image
registration
by
Steffen Czolbe
et al
|
|
|
|
04-21-2021
|
Hierarchical Convolutional Neural Network with Feature
Preservation and Autotuned Thresholding for Crack
Detection
by
Qiuchen Zhu
et al
|
|
|
|
04-21-2021
|
Comprehensive Multi-Modal Interactions for Referring
Image Segmentation
by
Kanishk Jain
et al
|
|
|
|
04-21-2021
|
Orderly Dual-Teacher Knowledge Distillation for
Lightweight Human Pose Estimation
by
Zhong-Qiu Zhao
et al
|
|
|
|
04-21-2021
|
Improvement of Normal Estimation for PointClouds via
Simplifying Surface Fitting
by
Jun Zhou
et al
|
|
|
|
04-20-2021
|
CrossATNet - A Novel Cross-Attention Based Framework
for Sketch-Based Image Retrieval
by
Ushasi Chaudhuri
et al
|
|
|
|
04-20-2021
|
What is Wrong with One-Class Anomaly Detection?
by
JuneKyu Park
et al
|
|
|
|
04-20-2021
|
Boundary-Aware 3D Object Detection from Point Clouds
by
Rui Qian
et al
|
|
|
|
04-20-2021
|
Table Tennis Stroke Recognition Using Two-Dimensional
Human Pose Estimation
by
Kaustubh Milind Kulkarni
et al
|
|
|
|
04-20-2021
|
Measuring the Ripeness of Fruit with Hyperspectral
Imaging and Deep Learning
by
Leon Amadeus Varga
et al
|
|
|
|
04-21-2021
|
Programmable 3D snapshot microscopy with Fourier
convolutional networks
by
Diptodip Deb
et al
|
|
|
|
04-20-2021
|
A novel three-stage training strategy for long-tailed
classification
by
Gongzhe Li
et al
|
|
|
|