03-01-2022
|
Generative Adversarial Networks
by
Gilad Cohen
et al
|
|
|
|
03-03-2022
|
Understanding Failure Modes of Self-Supervised Learning
by
Neha Mukund Kalibhat
et al
|
|
|
|
03-03-2022
|
Efficient Video Instance Segmentation via Tracklet
Query and Proposal
by
Jialian Wu
et al
|
|
|
|
03-03-2022
|
BatchFormer: Learning to Explore Sample Relationships
for Robust Representation Learning
by
Zhi Hou
et al
|
|
|
|
03-03-2022
|
NeRF-Supervision: Learning Dense Object Descriptors
from Neural Radiance Fields
by
Lin Yen-Chen
et al
|
|
|
|
03-02-2022
|
TableFormer: Table Structure Understanding with
Transformers
by
Ahmed Nassar
et al
|
|
|
|
03-03-2022
|
Recovering 3D Human Mesh from Monocular Images: A
Survey
by
Yating Tian
et al
|
|
|
|
03-01-2022
|
Variational Autoencoders Without the Variation
by
Gregory A. Daly
et al
|
|
|
|
03-01-2022
|
CLIP-GEN: Language-Free Training of a Text-to-Image
Generator with CLIP
by
Zihao Wang
et al
|
|
|
|
03-03-2022
|
Playable Environments: Video Manipulation in Space and
Time
by
Willi Menapace
et al
|
|
|
|
03-04-2022
|
Freeform Body Motion Generation from Speech
by
Jing Xu
et al
|
|
|
|
03-01-2022
|
D^2ETR: Decoder-Only DETR with Computationally
Efficient Cross-Scale Attention
by
Junyu Lin
et al
|
|
|
|
03-03-2022
|
Vision-Language Intelligence: Tasks, Representation
Learning, and Large Models
by
Feng Li
et al
|
|
|
|
03-04-2022
|
DiT: Self-supervised Pre-training for Document Image
Transformer
by
Junlong Li
et al
|
|
|
|
03-02-2022
|
HighMMT: Towards Modality and Task Generalization for
High-Modality Representation Learning
by
Paul Pu Liang
et al
|
|
|
|
03-03-2022
|
Mind the Gap: Understanding the Modality Gap in
Multi-modal Contrastive Representation Learning
by
Weixin Liang
et al
|
|
|
|
03-01-2022
|
Self-Supervised Vision Transformers Learn Visual
Concepts in Histopathology
by
Richard J. Chen
et al
|
|
|
|
03-01-2022
|
Affordance Learning from Play for Sample-Efficient
Policy Learning
by
Jessica Borja-Diaz
et al
|
|
|
|
03-01-2022
|
Recent, rapid advancement in visual question answering
architecture
by
Venkat Kodali
et al
|
|
|
|
03-03-2022
|
PINA: Learning a Personalized Implicit Neural Avatar
from a Single RGB-D Video Sequence
by
Zijian Dong
et al
|
|
|
|
03-01-2022
|
Unsupervised Vision-and-Language Pre-training via
Retrieval-based Multi-Granular Alignment
by
Mingyang Zhou
et al
|
|
|
|
03-03-2022
|
A Deep Neural Framework for Image Caption Generation
Using GRU-Based Attention Mechanism
by
Rashid Khan
et al
|
|
|
|
03-01-2022
|
InCloud: Incremental Learning for Point Cloud Place
Recognition
by
Joshua Knights
et al
|
|
|
|
03-01-2022
|
Styleverse: Towards Identity Stylization across
Heterogeneous Domains
by
Jia Li
et al
|
|
|
|
03-01-2022
|
Benchmarking Robustness of Deep Learning Classifiers
Using Two-Factor Perturbation
by
Wei Dai
et al
|
|
|
|
03-03-2022
|
Autoregressive Image Generation using Residual
Quantization
by
Doyup Lee
et al
|
|
|
|
03-01-2022
|
Towards Creativity Characterization of Generative
Models via Group-based Subset Scanning
by
Celia Cintas
et al
|
|
|
|
03-01-2022
|
CrossPoint: Self-Supervised Cross-Modal Contrastive
Learning for 3D Point Cloud Understanding
by
Mohamed Afham
et al
|
|
|
|
03-02-2022
|
Hyperspectral Pixel Unmixing with Latent Dirichlet
Variational Autoencoder
by
Kiran Mantripragada
et al
|
|
|
|
03-03-2022
|
Random Quantum Neural Networks (RQNN) for Noisy Image
Recognition
by
Debanjan Konar
et al
|
|
|
|
03-03-2022
|
ROCT-Net: A new ensemble deep convolutional model with
improved spatial resolution learning for detecting
common diseases from retinal OCT images
by
Mohammad Rahimzadeh
et al
|
|
|
|
03-02-2022
|
DisARM: Displacement Aware Relation Module for 3D
Detection
by
Yao Duan
et al
|
|
|
|
03-03-2022
|
Investigating the limited performance of a
deep-learning-based SPECT denoising approach: An
observer-study-based characterization
by
Zitong Yu
et al
|
|
|
|
03-03-2022
|
Interactive Image Synthesis with Panoptic Layout
Generation
by
Bo Wang
et al
|
|
|
|
03-02-2022
|
MetaDT: Meta Decision Tree for Interpretable Few-Shot
Learning
by
Baoquan Zhang
et al
|
|
|
|
03-02-2022
|
PetsGAN: Rethinking Priors for Single Image Generation
by
Zicheng Zhang
et al
|
|
|
|
03-01-2022
|
Multi-Task Multi-Scale Learning For Outcome Prediction
in 3D PET Images
by
Amine Amyar
et al
|
|
|
|
03-03-2022
|
Capturing Shape Information with Multi-Scale
Topological Loss Terms for 3D Reconstruction
by
Dominik J. E. Waibel
et al
|
|
|
|
03-03-2022
|
Selective Residual M-Net for Real Image Denoising
by
Chi-Mao Fan
et al
|
|
|
|
03-01-2022
|
How certain are your uncertainties?
by
Luke Whitbread
et al
|
|
|
|
03-03-2022
|
NeuroFluid: Fluid Dynamics Grounding with
Particle-Driven Neural Radiance Fields
by
Shanyan Guan
et al
|
|
|
|
03-01-2022
|
Colon Nuclei Instance Segmentation using a
Probabilistic Two-Stage Detector
by
Pedro Costa
et al
|
|
|
|
03-01-2022
|
Compliance Challenges in Forensic Image Analysis Under
the Artificial Intelligence Act
by
Benedikt Lorch
et al
|
|
|
|
03-02-2022
|
Differentiable IFS Fractals
by
Cory Braker Scott
|
|
|
|
03-02-2022
|
Enhancing Adversarial Robustness for Deep Metric
Learning
by
Mo Zhou
et al
|
|
|
|
03-03-2022
|
Recent Advances in Vision Transformer: A Survey and
Outlook of Recent Work
by
Khawar Islam
|
|
|
|
03-03-2022
|
Detecting High-Quality GAN-Generated Face Images using
Neural Networks
by
Ehsan Nowroozi
et al
|
|
|
|
03-03-2022
|
DIME: Fine-grained Interpretations of Multimodal Models
via Disentangled Local Explanations
by
Yiwei Lyu
et al
|
|
|
|
03-01-2022
|
Towards IID representation learning and its application
on biomedical data
by
Jiqing Wu
et al
|
|
|
|
03-03-2022
|
Label-Only Model Inversion Attacks via Boundary
Repulsion
by
Mostafa Kahla
et al
|
|
|
|
03-03-2022
|
On Learning Contrastive Representations for Learning
with Noisy Labels
by
Li Yi
et al
|
|
|
|
03-02-2022
|
Protecting Celebrities with Identity Consistency
Transformer
by
Xiaoyi Dong
et al
|
|
|
|
03-01-2022
|
Generalizable Person Re-Identification via
Self-Supervised Batch Norm Test-Time Adaption
by
Ke Han
et al
|
|
|
|
03-03-2022
|
Ensembles of Vision Transformers as a New Paradigm for
Automated Classification in Ecology
by
S. Kyathanahally
et al
|
|
|
|
03-03-2022
|
TCTrack: Temporal Contexts for Aerial Tracking
by
Ziang Cao
et al
|
|
|
|
03-01-2022
|
Can No-reference features help in Full-reference image
quality estimation?
by
Saikat Dutta
et al
|
|
|
|
03-01-2022
|
Separable-HoverNet and Instance-YOLO for Colon Nuclei
Identification and Counting
by
Chunhui Lin
et al
|
|
|
|
03-03-2022
|
Instance Segmentation for Autonomous Log Grasping in
Forestry Operations
by
Jean-Michel Fortin
et al
|
|
|
|
03-03-2022
|
Cross-Modality Earth Movers Distance for Visible
Thermal Person Re-Identification
by
Yongguo Ling
et al
|
|
|
|
03-03-2022
|
DenseUNets with feedback non-local attention for the
segmentation of specular microscopy images of the
corneal endothelium with Fuchs dystrophy
by
Juan P. Vigueras-Guillén
et al
|
|
|
|
03-03-2022
|
Rethinking the role of normalization and residual
blocks for spiking neural networks
by
Shin-ichi Ikegawa
et al
|
|
|
|
03-03-2022
|
Self-supervised Transparent Liquid Segmentation for
Robotic Pouring
by
Gautham Narayan Narasimhan
et al
|
|
|
|
03-01-2022
|
A unified 3D framework for Organs at Risk Localization
and Segmentation for Radiation Therapy Planning
by
Fernando Navarro
et al
|
|
|
|
03-02-2022
|
ADVISE: ADaptive Feature Relevance and VISual
Explanations for Convolutional Neural Networks
by
Mohammad Mahdi Dehshibi
et al
|
|
|
|
03-03-2022
|
LatentFormer: Multi-Agent Transformer-Based Interaction
Modeling and Trajectory Prediction
by
Elmira Amirloo
et al
|
|
|
|
03-02-2022
|
VAE-iForest: Auto-encoding Reconstruction and
Isolation-based Anomalies Detecting Fallen Objects on
Road Surface
by
Takato Yasuno
et al
|
|
|
|
03-03-2022
|
LGT-Net: Indoor Panoramic Room Layout Estimation with
Geometry-Aware Transformer Network
by
Zhigang Jiang
et al
|
|
|
|
03-03-2022
|
Adaptive Path Planning for UAVs for Multi-Resolution
Semantic Segmentation
by
Felix Stache
et al
|
|
|
|
03-03-2022
|
Modality-Adaptive Mixup and Invariant Decomposition for
RGB-Infrared Person Re-Identification
by
Zhipeng Huang
et al
|
|
|
|
03-01-2022
|
SwitchHit: A Probabilistic, Complementarity-Based
Switching System for Improved Visual Place Recognition
in Changing Environments
by
Maria Waheed
et al
|
|
|
|
03-03-2022
|
Adaptive Local-Global Relational Network for Facial
Action Units Recognition and Facial Paralysis
Estimation
by
Xuri Ge
et al
|
|
|
|
03-01-2022
|
Towards deep learning-powered IVF: A large public
benchmark for morphokinetic parameter prediction
by
Tristan Gomez
et al
|
|
|
|
03-02-2022
|
H4D: Human 4D Modeling by Learning Neural Compositional
Representation
by
Boyan Jiang
et al
|
|
|
|
03-02-2022
|
LILE: Look In-Depth before Looking Elsewhere -- A Dual
Attention Network using Transformers for Cross-Modal
Information Retrieval in Histopathology Archives
by
Danial Maleki
et al
|
|
|
|
03-03-2022
|
Translational Lung Imaging Analysis Through
Disentangled Representations
by
Pedro M. Gordaliza
et al
|
|
|
|
03-01-2022
|
Image analysis for automatic measurement of crustose
lichens
by
Pedro Guedes
et al
|
|
|
|
03-03-2022
|
NUQ: A Noise Metric for Diffusion MRI via Uncertainty
Discrepancy Quantification
by
Shreyas Fadnavis
et al
|
|
|
|
03-03-2022
|
CenterSnap: Single-Shot Multi-Object 3D Shape
Reconstruction and Categorical 6D Pose and Size
Estimation
by
Muhammad Zubair Irshad
et al
|
|
|
|
03-01-2022
|
When A Conventional Filter Meets Deep Learning: Basis
Composition Learning on Image Filters
by
Fu Lee Wang
et al
|
|
|
|
03-01-2022
|
Omni-frequency Channel-selection Representations for
Unsupervised Anomaly Detection
by
Yufei Liang
et al
|
|
|
|
03-03-2022
|
Revisiting Click-based Interactive Video Object
Segmentation
by
Stephane Vujasinovic
et al
|
|
|
|
03-02-2022
|
OVE6D: Object Viewpoint Encoding for Depth-based 6D
Object Pose Estimation
by
Dingding Cai
et al
|
|
|
|
03-01-2022
|
Stable, accurate and efficient deep neural networks for
inverse problems with analysis-sparse models
by
Maksym Neyra-Nesterenko
et al
|
|
|
|
03-03-2022
|
An Efficient Subpopulation-based Membership Inference
Attack
by
Shahbaz Rezaei
et al
|
|
|
|
03-04-2022
|
Carbon Footprint of Selecting and Training Deep
Learning Models for Medical Image Analysis
by
Raghavendra Selvan
et al
|
|
|
|
03-03-2022
|
Ad2Attack: Adaptive Adversarial Attack on Real-Time UAV
Tracking
by
Changhong Fu
et al
|
|
|
|
03-04-2022
|
Uncertainty Estimation for Heatmap-based Landmark
Localization
by
Lawrence Schobs
et al
|
|
|
|
03-03-2022
|
A study on the distribution of social biases in
self-supervised learning visual models
by
Kirill Sirotkin
et al
|
|
|
|
03-01-2022
|
Towards a unified view of unsupervised non-local
methods for image denoising: the NL-Ridge approach
by
Sébastien Herbreteau
et al
|
|
|
|
03-02-2022
|
A Simple and Universal Rotation Equivariant Point-cloud
Network
by
Ben Finkelshtein
et al
|
|
|
|
03-01-2022
|
MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D
Human Pose Estimation in Video
by
Jinlu Zhang
et al
|
|
|
|
03-01-2022
|
Bridge the Gap between Supervised and Unsupervised
Learning for Fine-Grained Classification
by
Jiabao Wang
et al
|
|
|
|
03-02-2022
|
Video Question Answering: Datasets, Algorithms and
Challenges
by
Yaoyao Zhong
et al
|
|
|
|
03-03-2022
|
Why adversarial training can hurt robust accuracy
by
Jacob Clarysse
et al
|
|
|
|
03-02-2022
|
Shape constrained CNN for segmentation guided
prediction of myocardial shape and pose parameters in
cardiac MRI
by
Sofie Tilborghs
et al
|
|
|
|
03-03-2022
|
CAFE: Learning to Condense Dataset by Aligning Features
by
Kai Wang
et al
|
|
|
|
03-03-2022
|
Exploring Patch-wise Semantic Relation for Contrastive
Learning in Image-to-Image Translation Tasks
by
Chanyong Jung
et al
|
|
|
|
03-01-2022
|
X-Trans2Cap: Cross-Modal Knowledge Transfer using
Transformer for 3D Dense Captioning
by
Zhihao Yuan
et al
|
|
|
|
03-03-2022
|
Curriculum-style Local-to-global Adaptation for
Cross-domain Remote Sensing Image Segmentation
by
Bo Zhang
et al
|
|
|
|
03-02-2022
|
Improving Lidar-Based Semantic Segmentation of Top-View
Grid Maps by Learning Features in Complementary
Representations
by
Frank Bieder
et al
|
|
|
|
03-02-2022
|
Hybrid Tracker with Pixel and Instance for Video
Panoptic Segmentation
by
Weicai Ye
et al
|
|
|
|
03-03-2022
|
Weakly Supervised Object Localization as Domain
Adaption
by
Lei Zhu
et al
|
|
|
|
03-03-2022
|
Region-of-Interest Based Neural Video Compression
by
Yura Perugachi-Diaz
et al
|
|
|
|
03-02-2022
|
Asynchronous Optimisation for Event-based Visual
Odometry
by
Daqi Liu
et al
|
|
|
|
03-01-2022
|
Boundary Corrected Multi-scale Fusion Network for
Real-time Semantic Segmentation
by
Tianjiao Jiang
et al
|
|
|
|
03-02-2022
|
A Generalized Approach for Cancellable Template and Its
Realization for Minutia Cylinder-Code
by
Xingbo Dong
et al
|
|
|
|
03-01-2022
|
Robust Seatbelt Detection and Usage Recognition for
Driver Monitoring Systems
by
Feng Hu
|
|
|
|
03-02-2022
|
Detecting Adversarial Perturbations in Multi-Task
Perception
by
Marvin Klingner
et al
|
|
|
|