12-09-2021
|
Plenoxels: Radiance Fields without Neural Networks
by
Alex Yu
et al
|
|
|
|
12-08-2021
|
InvGAN: Invertible GANs
by
Partha Ghosh
et al
|
|
|
|
12-09-2021
|
GAN-Supervised Dense Visual Alignment
by
William Peebles
et al
|
|
|
|
12-09-2021
|
Extending the WILDS Benchmark for Unsupervised
Adaptation
by
Shiori Sagawa
et al
|
|
|
|
12-07-2021
|
Grounded Language-Image Pre-training
by
Liunian Harold Li
et al
|
|
|
|
12-09-2021
|
Multimodal Conditional Image Synthesis with
Product-of-Experts GANs
by
Xun Huang
et al
|
|
|
|
12-07-2021
|
CMA-CLIP: Cross-Modality Attention CLIP for Image-Text
Classification
by
Huidong Liu
et al
|
|
|
|
12-08-2021
|
MLP Architectures for Vision-and-Language Modeling: An
Empirical Study
by
Yixin Nie
et al
|
|
|
|
12-08-2021
|
FLAVA: A Foundational Language And Vision Alignment
Model
by
Amanpreet Singh
et al
|
|
|
|
12-07-2021
|
Ref-NeRF: Structured View-Dependent Appearance for
Neural Radiance Fields
by
Dor Verbin
et al
|
|
|
|
12-09-2021
|
HairCLIP: Design Your Hair by Text and Reference Image
by
Tianyi Wei
et al
|
|
|
|
12-08-2021
|
Everything at Once -- Multi-modal Fusion Transformer
for Video Retrieval
by
Nina Shvetsova
et al
|
|
|
|
12-09-2021
|
CLIP-NeRF: Text-and-Image Driven Manipulation of Neural
Radiance Fields
by
Can Wang
et al
|
|
|
|
12-08-2021
|
Whats Behind the Couch? Directed Ray Distance Functions
(DRDF) for 3D Scene Reconstruction
by
Nilesh Kulkarni
et al
|
|
|
|
12-09-2021
|
Neural Radiance Fields for Outdoor Scene Relighting
by
Viktor Rudnev
et al
|
|
|
|
12-09-2021
|
PTR: A Benchmark for Part-based Conceptual, Relational,
and Physical Reasoning
by
Yining Hong
et al
|
|
|
|
12-09-2021
|
Fast Point Transformer
by
Chunghyun Park
et al
|
|
|
|
12-08-2021
|
Tracking People by Predicting 3D Appearance, Location
& Pose
by
Jathushan Rajasegaran
et al
|
|
|
|
12-08-2021
|
Prompting Visual-Language Models for Efficient Video
Understanding
by
Chen Ju
et al
|
|
|
|
12-08-2021
|
Contrastive Instruction-Trajectory Learning for
Vision-Language Navigation
by
Xiwen Liang
et al
|
|
|
|
12-09-2021
|
Self-Supervised Image-to-Text and Text-to-Image
Synthesis
by
Anindya Sundar Das
et al
|
|
|
|
12-09-2021
|
A Bilingual, OpenWorld Video Text Dataset and
End-to-end Video Text Spotter with Transformer
by
Weijia Wu
et al
|
|
|
|
12-08-2021
|
Do Pedestrians Pay Attention? Eye Contact Detection in
the Wild
by
Younes Belkada
et al
|
|
|
|
12-09-2021
|
Neural Descriptor Fields: SE(3)-Equivariant Object
Representations for Manipulation
by
Anthony Simeonov
et al
|
|
|
|
12-08-2021
|
BACON: Band-limited Coordinate Networks for Multiscale
Scene Representation
by
David B. Lindell
et al
|
|
|
|
12-09-2021
|
Latent Space Explanation by Intervention
by
Itai Gat
et al
|
|
|
|
12-08-2021
|
Symmetry Perception by Deep Networks: Inadequacy of
Feed-Forward Architectures and Improvements with
Recurrent Connections
by
Shobhita Sundaram
et al
|
|
|
|
12-07-2021
|
Activation to Saliency: Forming High-Quality Labels for
Unsupervised Salient Object Detection
by
Huajun Zhou
et al
|
|
|
|
12-08-2021
|
Adverse Weather Image Translation with Asymmetric and
Uncertainty-aware GAN
by
Jeong-gi Kwak
et al
|
|
|
|
12-09-2021
|
Semi-Supervised Medical Image Segmentation via Cross
Teaching between CNN and Transformer
by
Xiangde Luo
et al
|
|
|
|
12-10-2021
|
CityNeRF: Building NeRF at City Scale
by
Yuanbo Xiangli
et al
|
|
|
|
12-08-2021
|
Exploring Temporal Granularity in Self-Supervised Video
Representation Learning
by
Rui Qian
et al
|
|
|
|
12-09-2021
|
PixMix: Dreamlike Pictures Comprehensively Improve
Safety Measures
by
Dan Hendrycks
et al
|
|
|
|
12-07-2021
|
A Survey on Intrinsic Images: Delving Deep Into Lambert
and Beyond
by
Elena Garces
et al
|
|
|
|
12-08-2021
|
DualFormer: Local-Global Stratified Transformer for
Efficient Video Recognition
by
Yuxuan Liang
et al
|
|
|
|
12-08-2021
|
A Hierarchical Spatio-Temporal Graph Convolutional
Neural Network for Anomaly Detection in Videos
by
Xianlin Zeng
et al
|
|
|
|
12-08-2021
|
Shortest Paths in Graphs with Matrix-Valued Edges:
Concepts, Algorithm and Application to 3D Multi-Shape
Analysis
by
Viktoria Ehm
et al
|
|
|
|
12-08-2021
|
Revisiting Contrastive Learning through the Lens of
Neighborhood Component Analysis: an Integrated
Framework
by
Ching-Yun Ko
et al
|
|
|
|
12-09-2021
|
Evaluating saliency methods on artificial data with
different background types
by
Céline Budding
et al
|
|
|
|
12-08-2021
|
Feature Statistics Mixing Regularization for Generative
Adversarial Networks
by
Junho Kim
et al
|
|
|
|
12-07-2021
|
Evaluating Generic Auto-ML Tools for Computational
Pathology
by
Lars Ole Schwen
et al
|
|
|
|
12-09-2021
|
Does Redundancy in AI Perception Systems Help to Test
for Super-Human Automated Driving Performance?
by
Hanno Gottschalk
et al
|
|
|
|
12-08-2021
|
Come-Closer-Diffuse-Faster: Accelerating Conditional
Diffusion Models for Inverse Problems through
Stochastic Contraction
by
Hyungjin Chung
et al
|
|
|
|
12-08-2021
|
SNEAK: Synonymous Sentences-Aware Adversarial Attack on
Natural Language Video Localization
by
Wenbo Gou
et al
|
|
|
|
12-07-2021
|
Parallel Discrete Convolutions on Adaptive Particle
Representations of Images
by
Joel Jonsson
et al
|
|
|
|
12-09-2021
|
Generating Useful Accident-Prone Driving Scenarios via
a Learned Traffic Prior
by
Davis Rempe
et al
|
|
|
|
12-09-2021
|
BLT: Bidirectional Layout Transformer for Controllable
Layout Generation
by
Xiang Kong
et al
|
|
|
|
12-09-2021
|
MAGMA -- Multimodal Augmentation of Generative Models
through Adapter-based Finetuning
by
Constantin Eichenberg
et al
|
|
|
|
12-07-2021
|
Bootstrapping ViTs: Towards Liberating Vision
Transformers from Pre-training
by
Haofei Zhang
et al
|
|
|
|
12-08-2021
|
Self-Supervised Models are Continual Learners
by
Enrico Fini
et al
|
|
|
|
12-10-2021
|
UNIST: Unpaired Neural Implicit Shape Translation
Network
by
Qimin Chen
et al
|
|
|
|
12-09-2021
|
Critical configurations for two projective views, a new
approach
by
Martin Bråtelund
|
|
|
|
12-08-2021
|
Geometry-Guided Progressive NeRF for Generalizable and
Efficient Neural Human Rendering
by
Mingfei Chen
et al
|
|
|
|
12-07-2021
|
Unsupervised Representation Learning via Neural
Activation Coding
by
Yookoon Park
et al
|
|
|
|
12-08-2021
|
CoSSL: Co-Learning of Representation and Classifier for
Imbalanced Semi-Supervised Learning
by
Yue Fan
et al
|
|
|
|
12-07-2021
|
Nuclei Segmentation in Histopathology Images using Deep
Learning with Local and Global Views
by
Mahdi Arab Loodaricheh
et al
|
|
|
|
12-08-2021
|
Transformer-Based Approach for Joint Handwriting and
Named Entity Recognition in Historical documents
by
Ahmed Cheikh Rouhoua
et al
|
|
|
|
12-09-2021
|
DVHN: A Deep Hashing Framework for Large-scale Vehicle
Re-identification
by
Yongbiao Chen
et al
|
|
|
|
12-08-2021
|
Trajectory-Constrained Deep Latent Visual Attention for
Improved Local Planning in Presence of Heterogeneous
Terrain
by
Stefan Wapnick
et al
|
|
|
|
12-09-2021
|
Locally Shifted Attention With Early Global Integration
by
Shelly Sheynin
et al
|
|
|
|
12-10-2021
|
HeadNeRF: A Real-time NeRF-based Parametric Head Model
by
Yang Hong
et al
|
|
|
|
12-08-2021
|
Audio-Visual Synchronisation in the wild
by
Honglie Chen
et al
|
|
|
|
12-09-2021
|
Auto-X3D: Ultra-Efficient Video Understanding via
Finer-Grained Neural Architecture Search
by
Yifan Jiang
et al
|
|
|
|
12-09-2021
|
Superpixel-Based Building Damage Detection from
Post-earthquake Very High Resolution Imagery Using Deep
Neural Networks
by
Jun Wang
et al
|
|
|
|
12-09-2021
|
FaceFormer: Speech-Driven 3D Facial Animation with
Transformers
by
Yingruo Fan
et al
|
|
|
|
12-08-2021
|
Reverse image filtering using total derivative
approximation and accelerated gradient descent
by
Fernando J. Galetto
et al
|
|
|
|
12-09-2021
|
CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit
Directions
by
Rameen Abdal
et al
|
|
|
|
12-07-2021
|
Low-rank Tensor Decomposition for Compression of
Convolutional Neural Networks Using Funnel
Regularization
by
Bo-Shiuan Chu
et al
|
|
|
|
12-08-2021
|
Topology-aware Convolutional Neural Network for
Efficient Skeleton-based Action Recognition
by
Kailin Xu
et al
|
|
|
|
12-10-2021
|
More Control for Free! Image Synthesis with Semantic
Diffusion Guidance
by
Xihui Liu
et al
|
|
|
|
12-09-2021
|
RamBoAttack: A Robust Query Efficient Deep Neural
Network Decision Exploit
by
Viet Quoc Vo
et al
|
|
|
|
12-08-2021
|
Neural Points: Point Cloud Representation with Neural
Fields
by
Wanquan Feng
et al
|
|
|
|
12-09-2021
|
Mimicking the Oracle: An Initial Phase Decorrelation
Approach for Class Incremental Learning
by
Yujun Shi
et al
|
|
|
|
12-08-2021
|
Contrastive Learning with Large Memory Bank and
Negative Embedding Subtraction for Accurate Copy
Detection
by
Shuhei Yokoo
|
|
|
|
12-07-2021
|
Gaussian map predictions for 3D surface feature
localisation and counting
by
Justin Le Louëdec
et al
|
|
|
|
12-09-2021
|
One-dimensional Deep Low-rank and Sparse Network for
Accelerated MRI
by
Zi Wang
et al
|
|
|
|
12-09-2021
|
Mutual Adversarial Training: Learning together is
better than going alone
by
Jiang Liu
et al
|
|
|
|
12-09-2021
|
Searching Parameterized AP Loss for Object Detection
by
Chenxin Tao
et al
|
|
|
|
12-07-2021
|
Time-Equivariant Contrastive Video Representation
Learning
by
Simon Jenni
et al
|
|
|
|
12-08-2021
|
Binary Change Guided Hyperspectral Multiclass Change
Detection
by
Meiqi Hu
et al
|
|
|
|
12-08-2021
|
Learn2Reg: comprehensive multi-task medical image
registration challenge, dataset and evaluation in the
era of deep learning
by
Alessa Hering
et al
|
|
|
|
12-09-2021
|
Model Doctor: A Simple Gradient Aggregation Strategy
for Diagnosing and Treating CNN Classifiers
by
Zunlei Feng
et al
|
|
|
|
12-07-2021
|
Fully Attentional Network for Semantic Segmentation
by
Qi Song
et al
|
|
|
|
12-07-2021
|
A Contrastive Distillation Approach for Incremental
Semantic Segmentation in Aerial Images
by
Edoardo Arnaudo
et al
|
|
|
|
12-09-2021
|
Injecting Semantic Concepts into End-to-End Image
Captioning
by
Zhiyuan Fang
et al
|
|
|
|
12-10-2021
|
Couplformer:Rethinking Vision Transformer with Coupling
Attention Map
by
Hai Lan
et al
|
|
|
|
12-08-2021
|
VISOLO: Grid-Based Space-Time Aggregation for Efficient
Online Video Instance Segmentation
by
Su Ho Han
et al
|
|
|
|
12-08-2021
|
Boosting Contrastive Learning with Relation Knowledge
Distillation
by
Kai Zheng
et al
|
|
|
|
12-08-2021
|
Burn After Reading: Online Adaptation for Cross-domain
Streaming Data
by
Luyu Yang
et al
|
|
|
|
12-09-2021
|
Adaptive Methods for Aggregated Domain Generalization
by
Xavier Thomas
et al
|
|
|
|
12-09-2021
|
3D-VField: Learning to Adversarially Deform Point
Clouds for Robust 3D Object Detection
by
Alexander Lehner
et al
|
|
|
|
12-09-2021
|
Exploring the Equivalence of Siamese Self-Supervised
Learning via A Unified Gradient Framework
by
Chenxin Tao
et al
|
|
|
|
12-08-2021
|
Assessing a Single Image in Reference-Guided Image
Synthesis
by
Jiayi Guo
et al
|
|
|
|
12-07-2021
|
DeepFace-EMD: Re-ranking Using Patch-wise Earth Movers
Distance Improves Out-Of-Distribution Face
Identification
by
Hai Phan
et al
|
|
|
|
12-08-2021
|
Progressive Multi-stage Interactive Training in Mobile
Network for Fine-grained Recognition
by
Zhenxin Wu
et al
|
|
|
|
12-09-2021
|
Learning with Nested Scene Modeling and Cooperative
Architecture Search for Low-Light Vision
by
Risheng Liu
et al
|
|
|
|
12-08-2021
|
Garment4D: Garment Reconstruction from Point Cloud
Sequences
by
Fangzhou Hong
et al
|
|
|
|
12-08-2021
|
BA-Net: Bridge Attention for Deep Convolutional Neural
Networks
by
Yue Zhao
et al
|
|
|
|
12-09-2021
|
Explainability of the Implications of Supervised and
Unsupervised Face Image Quality Estimations Through
Activation Map Variation Analyses in Face Recognition
Models
by
Biying Fu
et al
|
|
|
|
12-08-2021
|
SoK: Anti-Facial Recognition Technology
by
Emily Wenger
et al
|
|
|
|
12-08-2021
|
Enhancing Food Intake Tracking in Long-Term Care with
Automated Food Imaging and Nutrient Intake Tracking
(AFINI-T) Technology
by
Kaylen J. Pfisterer
et al
|
|
|
|
12-08-2021
|
A Unified Architecture of Semantic Segmentation and
Hierarchical Generative Adversarial Networks for
Expression Manipulation
by
Rumeysa Bodur
et al
|
|
|
|
12-09-2021
|
Amicable Aid: Turning Adversarial Attack to Benefit
Classification
by
Juyeop Kim
et al
|
|
|
|
12-09-2021
|
Self-Supervised Keypoint Discovery in Behavioral Videos
by
Jennifer J. Sun
et al
|
|
|
|
12-07-2021
|
GPCO: An Unsupervised Green Point Cloud Odometry Method
by
Pranav Kadam
et al
|
|
|
|
12-07-2021
|
Generation of Non-Deterministic Synthetic Face Datasets
Guided by Identity Priors
by
Marcel Grimmer
et al
|
|
|
|
12-07-2021
|
Fully Context-Aware Image Inpainting with a Learned
Semantic Pyramid
by
Wendong Zhang
et al
|
|
|
|
12-08-2021
|
Implicit Neural Representations for Image Compression
by
Yannick Strümpler
et al
|
|
|
|
12-10-2021
|
Unified Multimodal Pre-training and Prompt-based Tuning
for Vision-Language Understanding and Generation
by
Tianyi Liu
et al
|
|
|
|
12-08-2021
|
GCA-Net : Utilizing Gated Context Attention for
Improving Image Forgery Localization and Detection
by
Sowmen Das
et al
|
|
|
|
12-09-2021
|
Robust Weakly Supervised Learning for COVID-19
Recognition Using Multi-Center CT Images
by
Qinghao Ye
et al
|
|
|
|
12-08-2021
|
Unimodal Face Classification with Multimodal Training
by
Wenbin Teng
et al
|
|
|
|
12-08-2021
|
Transformaly -- Two (Feature Spaces) Are Better Than
One
by
Matan Jacob Cohen
et al
|
|
|
|
12-10-2021
|
VUT: Versatile UI Transformer for Multi-Modal
Multi-Task User Interface Modeling
by
Yang Li
et al
|
|
|
|
12-07-2021
|
CG-NeRF: Conditional Generative Neural Radiance Fields
by
Kyungmin Jo
et al
|
|
|
|
12-08-2021
|
A novel multi-view deep learning approach for BI-RADS
and density assessment of mammograms
by
Huyen T. X. Nguyen
et al
|
|
|
|
12-09-2021
|
PE-former: Pose Estimation Transformer
by
Paschalis Panteleris
et al
|
|
|
|
12-10-2021
|
Predicting Physical World Destinations for Commands
Given to Self-Driving Cars
by
Dusan Grujicic
et al
|
|
|
|
12-10-2021
|
Critical configurations for three projective views
by
Martin Bråtelund
|
|
|
|
12-09-2021
|
BLPnet: A New DNN model for Automatic License Plate
Detection with Bengali OCR
by
Md Saif Hassan Onim
et al
|
|
|
|
12-08-2021
|
Adversarial Parametric Pose Prior
by
Andrey Davydov
et al
|
|
|
|
12-07-2021
|
Domain Generalization via Progressive Layer-wise and
Channel-wise Dropout
by
Jintao Guo
et al
|
|
|
|
12-07-2021
|
Variance-Aware Weight Initialization for Point
Convolutional Neural Networks
by
Pedro Hermosilla
et al
|
|
|
|
12-08-2021
|
SimulSLT: End-to-End Simultaneous Sign Language
Translation
by
Aoxiong Yin
et al
|
|
|
|
12-09-2021
|
Progressive Attention on Multi-Level Dense Difference
Maps for Generic Event Boundary Detection
by
Jiaqi Tang
et al
|
|
|
|
12-07-2021
|
Suppressing Static Visual Cues via Normalizing Flows
for Self-Supervised Video Representation Learning
by
Manlin Zhang
et al
|
|
|
|
12-09-2021
|
HBReID: Harder Batch for Re-identification
by
Wen Li
et al
|
|
|
|
12-07-2021
|
Auxiliary Learning for Self-Supervised Video
Representation via Similarity-based Knowledge
Distillation
by
Amirhossein Dadashzadeh
et al
|
|
|
|
12-07-2021
|
Vehicle trajectory prediction works, but not everywhere
by
Mohammadhossein Bahari
et al
|
|
|
|
12-08-2021
|
SoK: Vehicle Orientation Representations for Deep
Rotation Estimation
by
Huahong Tu
et al
|
|
|
|
12-09-2021
|
AdaStereo: An Efficient Domain-Adaptive Stereo Matching
Approach
by
Xiao Song
et al
|
|
|
|
12-08-2021
|
Learning Auxiliary Monocular Contexts Helps Monocular
3D Object Detection
by
Xianpeng Liu
et al
|
|
|
|
12-07-2021
|
A Robust Completed Local Binary Pattern (RCLBP) for
Surface Defect Detection
by
Nana Kankam Gyimah
et al
|
|
|
|
12-09-2021
|
Contextualized Spatio-Temporal Contrastive Learning
with Self-Supervision
by
Liangzhe Yuan
et al
|
|
|
|
12-07-2021
|
Wild ToFu: Improving Range and Quality of Indirect
Time-of-Flight Depth with RGB Fusion in Challenging
Environments
by
HyunJun Jung
et al
|
|
|
|
12-08-2021
|
On visual self-supervision and its effect on model
robustness
by
Michal Kucer
et al
|
|
|
|
12-09-2021
|
CaSP: Class-agnostic Semi-Supervised Pretraining for
Detection and Segmentation
by
Lu Qi
et al
|
|
|
|
12-07-2021
|
Few-Shot Image Classification Along Sparse Graphs
by
Joseph F Comer
et al
|
|
|
|
12-08-2021
|
Unsupervised Complementary-aware Multi-process Fusion
for Visual Place Recognition
by
Stephen Hausler
et al
|
|
|
|
12-09-2021
|
Implicit Feature Refinement for Instance Segmentation
by
Lufan Ma
et al
|
|
|
|
12-08-2021
|
Recurrent Glimpse-based Decoder for Detection with
Transformer
by
Zhe Chen
et al
|
|
|
|
12-09-2021
|
Spatio-temporal Relation Modeling for Few-shot Action
Recognition
by
Anirudh Thatipelli
et al
|
|
|
|
12-07-2021
|
Flexible Networks for Learning Physical Dynamics of
Deformable Objects
by
Jinhyung Park
et al
|
|
|
|
12-09-2021
|
Exploring Event-driven Dynamic Context for Accident
Scene Segmentation
by
Jiaming Zhang
et al
|
|
|
|
12-09-2021
|
IterMVS: Iterative Probability Estimation for Efficient
Multi-View Stereo
by
Fangjinhua Wang
et al
|
|
|
|
12-07-2021
|
Unsupervised Learning of Compositional Scene
Representations from Multiple Unspecified Viewpoints
by
Jinyang Yuan
et al
|
|
|
|
12-09-2021
|
A Shared Representation for Photorealistic Driving
Simulators
by
Saeed Saadatnejad
et al
|
|
|
|
12-09-2021
|
ScaleNet: A Shallow Architecture for Scale Estimation
by
Axel Barroso-Laguna
et al
|
|
|
|
12-09-2021
|
Knowledge Distillation for Object Detection via Rank
Mimicking and Prediction-guided Feature Imitation
by
Gang Li
et al
|
|
|
|
12-08-2021
|
DMRVisNet: Deep Multi-head Regression Network for
Pixel-wise Visibility Estimation Under Foggy Weather
by
Jing You
et al
|
|
|
|
12-07-2021
|
A Generic Approach for Enhancing GANs by Regularized
Latent Optimization
by
Yufan Zhou
et al
|
|
|
|
12-07-2021
|
Image Enhancement via Bilateral Learning
by
Saeedeh Rezaee
et al
|
|
|
|
12-07-2021
|
Regularity Learning via Explicit Distribution Modeling
for Skeletal Video Anomaly Detection
by
Shoubin Yu
et al
|
|
|
|