05-24-2022
|
OnePose: One-Shot Object Pose Estimation without CAD
Models
by
Jiaming Sun
et al
|
|
|
|
05-25-2022
|
Neural 3D Reconstruction in the Wild
by
Jiaming Sun
et al
|
|
|
|
05-25-2022
|
Pretraining is All You Need for Image-to-Image
Translation
by
Tengfei Wang
et al
|
|
|
|
05-25-2022
|
An Evolutionary Approach to Dynamic Introduction of
Tasks in Large-scale Multitask Learning Systems
by
Andrea Gesmundo
et al
|
|
|
|
05-24-2022
|
StylizedNeRF: Consistent 3D Scene Stylization as
Stylized NeRF via 2D-3D Mutual Learning
by
Yi-Hua Huang
et al
|
|
|
|
05-25-2022
|
Fine-grained Image Captioning with CLIP Reward
by
Jaemin Cho
et al
|
|
|
|
05-26-2022
|
Green Hierarchical Vision Transformer for Masked Image
Modeling
by
Lang Huang
et al
|
|
|
|
05-25-2022
|
Inception Transformer
by
Chenyang Si
et al
|
|
|
|
05-26-2022
|
AdaptFormer: Adapting Vision Transformers for Scalable
Visual Recognition
by
Shoufa Chen
et al
|
|
|
|
05-25-2022
|
Multimodal Knowledge Alignment with Reinforcement
Learning
by
Youngjae Yu
et al
|
|
|
|
05-24-2022
|
Trajectory Optimization for Physics-Based
Reconstruction of 3d Human Pose from Monocular Video
by
Erik Gärtner
et al
|
|
|
|
05-26-2022
|
One-Shot Face Reenactment on Megapixels
by
Wonjun Kang
et al
|
|
|
|
05-27-2022
|
Sharpness-Aware Training for Free
by
Jiawei Du
et al
|
|
|
|
05-24-2022
|
Rethinking Evaluation Practices in Visual Question
Answering: A Case Study on Out-of-Distribution
Generalization
by
Aishwarya Agrawal
et al
|
|
|
|
05-27-2022
|
GIT: A Generative Image-to-text Transformer for Vision
and Language
by
Jianfeng Wang
et al
|
|
|
|
05-24-2022
|
mPLUG: Effective and Efficient Vision-Language Learning
by Cross-modal Skip-connections
by
Chenliang Li
et al
|
|
|
|
05-26-2022
|
SHREC 2022: pothole and crack detection in the road
pavement using images and RGB-D data
by
Elia Moscoso Thompson
et al
|
|
|
|
05-26-2022
|
Matryoshka Representations for Adaptive Deployment
by
Aditya Kusupati
et al
|
|
|
|
05-25-2022
|
Misleading Deep-Fake Detection with GAN Fingerprints
by
Vera Wesselkamp
et al
|
|
|
|
05-25-2022
|
Mutual Information Divergence: A Unified Metric for
Multimodal Generative Models
by
Jin-Hwa Kim
et al
|
|
|
|
05-27-2022
|
3DILG: Irregular Latent Grids for 3D Generative
Modeling
by
Biao Zhang
et al
|
|
|
|
05-26-2022
|
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified
Birds-Eye View Representation
by
Zhijian Liu
et al
|
|
|
|
05-27-2022
|
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for
Human-Object Interactions
by
Huaizu Jiang
et al
|
|
|
|
05-26-2022
|
PREF: Phasorial Embedding Fields for Compact Neural
Representations
by
Binbin Huang
et al
|
|
|
|
05-24-2022
|
ASSET: Autoregressive Semantic Scene Editing with
Transformers at High Resolutions
by
Difan Liu
et al
|
|
|
|
05-26-2022
|
Measuring Perceptual Color Differences of Smartphone
Photography
by
Zhihua Wang
et al
|
|
|
|
05-24-2022
|
Cross-Domain Style Mixing for Face Cartoonization
by
Seungkwon Kim
et al
|
|
|
|
05-25-2022
|
How explainable are adversarially-robust CNNs?
by
Mehdi Nourelahi
et al
|
|
|
|
05-24-2022
|
Naive Few-Shot Learning: Sequence Consistency
Evaluation
by
Tomer Barak
et al
|
|
|
|
05-26-2022
|
Tree Reconstruction using Topology Optimisation
by
Thomas Lowe
et al
|
|
|
|
05-27-2022
|
X-ViT: High Performance Linear Vision Transformer
without Softmax
by
Jeonggeun Song
et al
|
|
|
|
05-25-2022
|
The Dialog Must Go On: Improving Visual Dialog via
Generative Self-Training
by
Gi-Cheon Kang
et al
|
|
|
|
05-26-2022
|
TransBoost: Improving the Best ImageNet Performance
using Deep Transduction
by
Omer Belhasin
et al
|
|
|
|
05-26-2022
|
Denial-of-Service Attacks on Learned Image Compression
by
Kang Liu
et al
|
|
|
|
05-26-2022
|
Revealing the Dark Secrets of Masked Image Modeling
by
Zhenda Xie
et al
|
|
|
|
05-26-2022
|
BppAttack: Stealthy and Efficient Trojan Attacks
against Deep Neural Networks via Image Quantization and
Contrastive Adversarial Learning
by
Zhenting Wang
et al
|
|
|
|
05-26-2022
|
DGSVis: Visual Analysis of Hierarchical Snapshots in
Dynamic Graph
by
Baofeng Chang
|
|
|
|
05-27-2022
|
Neural Basis Models for Interpretability
by
Filip Radenovic
et al
|
|
|
|
05-24-2022
|
Jointly Optimizing Color Rendition and In-Camera
Backgrounds in an RGB Virtual Production Stage
by
Chloe LeGendre
et al
|
|
|
|
05-24-2022
|
A Wireless-Vision Dataset for Privacy Preserving Human
Activity Recognition
by
Yanling Hao
et al
|
|
|
|
05-25-2022
|
Online Deep Equilibrium Learning for Regularization by
Denoising
by
Jiaming Liu
et al
|
|
|
|
05-26-2022
|
DeepTechnome: Mitigating Unknown Bias in Deep Learning
Based Assessment of CT Images
by
Simon Langer
et al
|
|
|
|
05-24-2022
|
Diffuse Map Guiding Unsupervised Generative Adversarial
Network for SVBRDF Estimation
by
Zhiyao Luo
et al
|
|
|
|
05-26-2022
|
SemAffiNet: Semantic-Affine Transformation for Point
Cloud Segmentation
by
Ziyi Wang
et al
|
|
|
|
05-27-2022
|
Contrastive Learning Rivals Masked Image Modeling in
Fine-tuning via Feature Distillation
by
Yixuan Wei
et al
|
|
|
|
05-27-2022
|
Video2StyleGAN: Disentangling Local and Global
Variations in a Video
by
Rameen Abdal
et al
|
|
|
|
05-26-2022
|
On the Eigenvalues of Global Covariance Pooling for
Fine-grained Visual Recognition
by
Yue Song
et al
|
|
|
|
05-25-2022
|
Crossmodal-3600: A Massively Multilingual Multimodal
Evaluation Dataset
by
Ashish V. Thapliyal
et al
|
|
|
|
05-26-2022
|
Continual evaluation for lifelong learning: Identifying
the stability gap
by
Matthias De Lange
et al
|
|
|
|
05-25-2022
|
DisinfoMeme: A Multimodal Dataset for Detecting Meme
Intentionally Spreading Out Disinformation
by
Jingnong Qu
et al
|
|
|
|
05-24-2022
|
Classification of Phonological Parameters in Sign
Languages
by
Boris Mocialov
et al
|
|
|
|
05-25-2022
|
Open-Domain Sign Language Translation Learned from
Online Video
by
Bowen Shi
et al
|
|
|
|
05-26-2022
|
AI for Porosity and Permeability Prediction from
Geologic Core X-Ray Micro-Tomography
by
Zangir Iklassov
et al
|
|
|
|
05-26-2022
|
SARS-CoV-2 Result Interpretation based on Image
Analysis of Lateral Flow Devices
by
Neeraj Vashistha
|
|
|
|
05-26-2022
|
FCN-Pose: A Pruned and Quantized CNN for Robot Pose
Estimation for Constrained Devices
by
Marrone Silvério Melo Dantas
et al
|
|
|
|
05-25-2022
|
An Empirical Study on Distribution Shift Robustness
From the Perspective of Pre-Training and Data
Augmentation
by
Ziquan Liu
et al
|
|
|
|
05-24-2022
|
Aerial Vision-and-Dialog Navigation
by
Yue Fan
et al
|
|
|
|
05-24-2022
|
CDFKD-MFS: Collaborative Data-free Knowledge
Distillation via Multi-level Feature Sharing
by
Zhiwei Hao
et al
|
|
|
|
05-26-2022
|
Objects Matter: Learning Object Relation Graph for
Robust Camera Relocalization
by
Chengyu Qiao
et al
|
|
|
|
05-24-2022
|
Improving Human Image Synthesis with Residual Fast
Fourier Transformation and Wasserstein Distance
by
Jianhan Wu
et al
|
|
|
|
05-27-2022
|
Simple Unsupervised Object-Centric Learning for Complex
and Naturalistic Videos
by
Gautam Singh
et al
|
|
|
|
05-26-2022
|
Efficient U-Transformer with Boundary-Aware Loss for
Action Segmentation
by
Dazhao Du
et al
|
|
|
|
05-26-2022
|
A Model or 603 Exemplars: Towards Memory-Efficient
Class-Incremental Learning
by
Da-Wei Zhou
et al
|
|
|
|
05-26-2022
|
Fast Vision Transformers with HiLo Attention
by
Zizheng Pan
et al
|
|
|
|
05-25-2022
|
Learning to segment with limited annotations:
Self-supervised pretraining with regression and
contrastive loss in MRI
by
Lavanya Umapathy
et al
|
|
|
|
05-27-2022
|
Scalable Interpretability via Polynomials
by
Abhimanyu Dubey
et al
|
|
|
|
05-25-2022
|
Spotlights: Probing Shapes from Spherical Viewpoints
by
Jiaxin Wei
et al
|
|
|
|
05-26-2022
|
Acute Lymphoblastic Leukemia Detection Using
Hypercomplex-Valued Convolutional Neural Networks
by
Guilherme Vieira
et al
|
|
|
|
05-26-2022
|
2D versus 3D Convolutional Spiking Neural Networks
Trained with Unsupervised STDP for Human Action
Recognition
by
Mireille El-Assal
et al
|
|
|
|
05-26-2022
|
Task-Customized Self-Supervised Pre-training with
Scalable Dynamic Routing
by
Zhili Liu
et al
|
|
|
|
05-26-2022
|
MemeTector: Enforcing deep focus for meme detection
by
Christos Koutlis
et al
|
|
|
|
05-26-2022
|
A Physical-World Adversarial Attack Against 3D Face
Recognition
by
Yanjie Li
et al
|
|
|
|
05-24-2022
|
Differentiable Dynamics for Articulated 3d Human Motion
Reconstruction
by
Erik Gärtner
et al
|
|
|
|
05-26-2022
|
Learning What and Where -- Unsupervised Disentangling
Location and Identity Tracking
by
Manuel Traub
et al
|
|
|
|
05-24-2022
|
HiVLP: Hierarchical Vision-Language Pre-Training for
Fast Image-Text Retrieval
by
Feilong Chen
et al
|
|
|
|
05-24-2022
|
Gacs-Korner Common Information Variational Autoencoder
by
Michael Kleinman
et al
|
|
|
|
05-24-2022
|
A CNN with Noise Inclined Module and Denoise Framework
for Hyperspectral Image Classification
by
Zhiqiang Gong
et al
|
|
|
|
05-26-2022
|
Surround-view Fisheye Camera Perception for Automated
Driving: Overview, Survey and Challenges
by
Varun Ravi Kumar
et al
|
|
|
|
05-25-2022
|
TreEnhance: An Automatic Tree-Search Based Method for
Low-Light Image Enhancement
by
Marco Cotogni
et al
|
|
|
|
05-24-2022
|
Optimizing Performance of Federated Person
Re-identification: Benchmarking and Analysis
by
Weiming Zhuang
et al
|
|
|
|
05-24-2022
|
3D helical CT reconstruction with memory efficient
invertible Learned Primal-Dual method
by
Buda Bajić
et al
|
|
|
|
05-26-2022
|
Penalizing Proposals using Classifiers for
Semi-Supervised Object Detection
by
Somnath Hazra
et al
|
|
|
|
05-26-2022
|
Transferable Adversarial Attack based on Integrated
Gradients
by
Yi Huang
et al
|
|
|
|
05-27-2022
|
Deep Learning Fetal Ultrasound Video Model Match Human
Observers in Biometric Measurements
by
Szymon Płotka
et al
|
|
|
|
05-24-2022
|
Hierarchical Vectorization for Portrait Images
by
Qian Fu
et al
|
|
|
|
05-27-2022
|
Fine-tuning deep learning models for stereo matching
using results from semi-global matching
by
Hessah Albanwan
et al
|
|
|
|
05-24-2022
|
Learning to Assemble Geometric Shapes
by
Jinhwi Lee
et al
|
|
|
|
05-24-2022
|
sat2pc: Estimating Point Cloud of Building Roofs from
2D Satellite Images
by
Yoones Rezaei
et al
|
|
|
|
05-24-2022
|
Skin Cancer Diagnostics with an All-Inclusive
Smartphone Application
by
Upender Kalwa
et al
|
|
|
|
05-26-2022
|
Semantic Segmentation for Thermal Images: A Comparative
Survey
by
Zülfiye Kütük
et al
|
|
|
|
05-26-2022
|
Cross-Architecture Self-supervised Video Representation
Learning
by
Sheng Guo
et al
|
|
|
|
05-25-2022
|
Exploring Map-based Features for Efficient
Attention-based Vehicle Motion Prediction
by
Carlos Gómez-Huélamo
et al
|
|
|
|
05-25-2022
|
VizInspect Pro -- Automated Optical Inspection (AOI)
solution
by
Faraz Waseem
et al
|
|
|
|
05-25-2022
|
People counting system for retail analytics using edge
AI
by
Karthik Reddy Kanjula
et al
|
|
|
|
05-24-2022
|
AFNet-M: Adaptive Fusion Network with Masks for 2D+3D
Facial Expression Recognition
by
Mingzhe Sui
et al
|
|
|
|
05-25-2022
|
Designing an Efficient End-to-end Machine Learning
Pipeline for Real-time Empty-shelf Detection
by
Dipendra Jha
et al
|
|
|
|
05-25-2022
|
Deep Gradient Learning for Efficient Camouflaged Object
Detection
by
Ge-Peng Ji
et al
|
|
|
|
05-24-2022
|
G-Rep: Gaussian Representation for Arbitrary-Oriented
Object Detection
by
Liping Hou
et al
|
|
|
|
05-24-2022
|
Robust 3D Object Detection in Cold Weather Conditions
by
Aldi Piroli
et al
|
|
|
|
05-25-2022
|
RADNet: Ensemble Model for Robust Glaucoma
Classification in Color Fundus Images
by
Dmitrii Medvedev
et al
|
|
|
|
05-26-2022
|
Social Interpretable Tree for Pedestrian Trajectory
Prediction
by
Liushuai Shi
et al
|
|
|
|
05-25-2022
|
Deniable Steganography
by
Yong Xu
et al
|
|
|
|
05-24-2022
|
Collaborative 3D Object Detection for Automatic Vehicle
Systems via Learnable Communications
by
Junyong Wang
et al
|
|
|
|
05-24-2022
|
Effect of Gender, Pose and Camera Distance on Human
Body Dimensions Estimation
by
Yansel Gónzalez Tejeda
et al
|
|
|
|
05-26-2022
|
Fight Poison with Poison: Detecting Backdoor Poison
Samples via Decoupling Benign Correlations
by
Xiangyu Qi
et al
|
|
|
|
05-25-2022
|
MoCoViT: Mobile Convolutional Vision Transformer
by
Hailong Ma
et al
|
|
|
|
05-25-2022
|
Breaking the Chain of Gradient Leakage in Vision
Transformers
by
Yahui Liu
et al
|
|
|
|
05-24-2022
|
Symbolic Expression Transformer: A Computer Vision
Approach for Symbolic Regression
by
Jiachen Li
et al
|
|
|
|
05-27-2022
|
Comparison of Deep Learning Segmentation and
Multigrader-annotated Mandibular Canals of Multicenter
CBCT scans
by
Jorma Järnstedt
et al
|
|
|
|
05-25-2022
|
A Comparative Study of Gastric Histopathology Sub-size
Image Classification: from Linear Regression to Visual
Transformer
by
Weiming Hu
et al
|
|
|
|
05-26-2022
|
VectorAdam for Rotation Equivariant Geometry
Optimization
by
Selena Ling
et al
|
|
|
|
05-24-2022
|
Sim-To-Real Transfer of Visual Grounding for
Human-Aided Ambiguity Resolution
by
Georgios Tziafas
et al
|
|
|
|
05-24-2022
|
TraCon: A novel dataset for real-time traffic cones
detection using deep learning
by
Iason Katsamenis
et al
|
|
|
|
05-26-2022
|
VIDI: A Video Dataset of Incidents
by
Duygu Sesver
et al
|
|
|
|
05-25-2022
|
Image Colorization using U-Net with Skip Connections
and Fusion Layer on Landscape Images
by
Muhammad Hisyam Zayd
et al
|
|
|
|
05-26-2022
|
ANISE: Assembly-based Neural Implicit Surface
rEconstruction
by
Dmitry Petrov
et al
|
|
|
|
05-24-2022
|
GLOBUS: GLObal Building heights for Urban Studies
by
Harsh G. Kamath
et al
|
|
|
|
05-25-2022
|
You Need to Read Again: Multi-granularity Perception
Network for Moment Retrieval in Videos
by
Xin Sun
et al
|
|
|
|
05-25-2022
|
Semantic-Aware Representation Blending for Multi-Label
Image Recognition with Partial Labels
by
Tao Pu
et al
|
|
|
|
05-26-2022
|
Analytical Interpretation of Latent Codes in InfoGAN
with SAR Images
by
Zhenpeng Feng
et al
|
|
|
|
05-24-2022
|
Interaction of a priori Anatomic Knowledge with
Self-Supervised Contrastive Learning in Cardiac
Magnetic Resonance Imaging
by
Makiya Nakashima
et al
|
|
|
|
05-25-2022
|
Structured Uncertainty in the Observation Space of
Variational Autoencoders
by
James Langley
et al
|
|
|
|
05-25-2022
|
To image, or not to image: Class-specific diffractive
cameras with all-optical erasure of undesired objects
by
Bijie Bai
et al
|
|
|
|
05-24-2022
|
An interpretation of the final fully connected layer
by
Siddhartha
|
|
|
|
05-25-2022
|
Contrastive Learning with Boosted Memorization
by
Zhihan Zhou
et al
|
|
|
|
05-26-2022
|
SwinVRNN: A Data-Driven Ensemble Forecasting Model via
Learned Distribution Perturbation
by
Yuan Hu
et al
|
|
|
|
05-26-2022
|
Analyzing the Latent Space of GAN through Local
Dimension Estimation
by
Jaewoong Choi
et al
|
|
|
|
05-26-2022
|
Light Field Raindrop Removal via 4D Re-sampling
by
Dong Jing
et al
|
|
|
|
05-25-2022
|
MUG: Multi-human Graph Network for 3D Mesh
Reconstruction from 2D Pose
by
Chenyan Wu
et al
|
|
|
|