03-10-2022
|
Model soups: averaging weights of multiple fine-tuned
models improves accuracy without increasing inference
time
by
Mitchell Wortsman
et al
|
|
|
|
03-09-2022
|
On the surprising tradeoff between ImageNet accuracy
and perceptual similarity
by
Manoj Kumar
et al
|
|
|
|
03-08-2022
|
EdgeFormer: Improving Light-weight ConvNets by Learning
from Vision Transformers
by
Haokui Zhang
et al
|
|
|
|
03-10-2022
|
Cluttered Food Grasping with Adaptive Fingers and
Synthetic-Data Trained Object Detection
by
Avinash Ummadisingu
et al
|
|
|
|
03-11-2022
|
The Role of ImageNet Classes in Fr\echet Inception
Distance
by
Tuomas Kynkäänniemi
et al
|
|
|
|
03-10-2022
|
LoopITR: Combining Dual and Cross Encoder Architectures
for Image-Text Retrieval
by
Jie Lei
et al
|
|
|
|
03-09-2022
|
Leveling Down in Computer Vision: Pareto Inefficiencies
in Fair Deep Classifiers
by
Dominik Zietlow
et al
|
|
|
|
03-10-2022
|
Conditional Prompt Learning for Vision-Language Models
by
Kaiyang Zhou
et al
|
|
|
|
03-08-2022
|
Dynamic Dual-Output Diffusion Models
by
Yaniv Benny
et al
|
|
|
|
03-10-2022
|
BEAT: A Large-Scale Semantic and Emotional Multi-Modal
Dataset for Conversational Gestures Synthesis
by
Haiyang Liu
et al
|
|
|
|
03-10-2022
|
StyleBabel: Artistic Style Tagging and Captioning
by
Dan Ruta
et al
|
|
|
|
03-08-2022
|
RC-MVSNet: Unsupervised Multi-View Stereo with Neural
Rendering
by
Di Chang
et al
|
|
|
|
03-09-2022
|
NLX-GPT: A Model for Natural Language Explanations in
Vision and Vision-Language Tasks
by
Fawaz Sammani
et al
|
|
|
|
03-10-2022
|
MORE: Multi-Order RElation Mining for Dense Captioning
in 3D Scenes
by
Yang Jiao
et al
|
|
|
|
03-08-2022
|
ART-Point: Improving Rotation Robustness of Point Cloud
Classifiers via Adversarial Rotation
by
Robin Wang
et al
|
|
|
|
03-10-2022
|
A Closer Look at Debiased Temporal Sentence Grounding
in Videos: Dataset, Metric, and Approach
by
Xiaohan Lan
et al
|
|
|
|
03-10-2022
|
Iterative Corresponding Geometry: Fusing Region and
Depth for Highly Efficient 3D Tracking of Textureless
Objects
by
Manuel Stoiber
et al
|
|
|
|
03-08-2022
|
StyleHEAT: One-Shot High-Resolution Editable Talking
Face Generation via Pretrained StyleGAN
by
Fei Yin
et al
|
|
|
|
03-11-2022
|
ActiveMLP: An MLP-like Architecture with Active Token
Mixer
by
Guoqiang Wei
et al
|
|
|
|
03-10-2022
|
Knowledge-enriched Attention Network with Group-wise
Semantic for Visual Storytelling
by
Tengpeng Li
et al
|
|
|
|
03-11-2022
|
FLAG: Flow-based 3D Avatar Generation from Sparse
Observations
by
Sadegh Aliakbarian
et al
|
|
|
|
03-11-2022
|
Masked Visual Pre-training for Motor Control
by
Tete Xiao
et al
|
|
|
|
03-08-2022
|
Multi-Modal Mixup for Robust Fine-tuning
by
Junhyuk So
et al
|
|
|
|
03-08-2022
|
Tuning-free multi-coil compressed sensing MRI with
Parallel Variable Density Approximate Message Passing
(P-VDAMP)
by
Charles Millard
et al
|
|
|
|
03-08-2022
|
Visual-Language Navigation Pretraining via Prompt-based
Environmental Self-exploration
by
Xiwen Liang
et al
|
|
|
|
03-09-2022
|
Pose Guided Multi-person Image Generation From Text
by
Soon Yau Cheong
et al
|
|
|
|
03-09-2022
|
A Unified Transformer Framework for Group-based
Segmentation: Co-Segmentation, Co-Saliency Detection
and Video Salient Object Detection
by
Yukun Su
et al
|
|
|
|
03-09-2022
|
FlexIT: Towards Flexible Semantic Image Translation
by
Guillaume Couairon
et al
|
|
|
|
03-08-2022
|
Semantic Distillation Guided Salient Object Detection
by
Bo Xu
et al
|
|
|
|
03-08-2022
|
Where Does the Performance Improvement Come From? - A
Reproducibility Concern about Image-Text Retrieval
by
Jun Rao
et al
|
|
|
|
03-09-2022
|
Mapping global dynamics of benchmark creation and
saturation in artificial intelligence
by
Adriano Barbosa-Silva
et al
|
|
|
|
03-10-2022
|
Hyperspectral Imaging for cherry tomato
by
Yun Xiang
et al
|
|
|
|
03-08-2022
|
On Generalizing Beyond Domains in Cross-Domain
Continual Learning
by
Christian Simon
et al
|
|
|
|
03-08-2022
|
Analyzing General-Purpose Deep-Learning Detection and
Segmentation Models with Images from a Lidar as a
Camera Sensor
by
Yu Xianjia
et al
|
|
|
|
03-09-2022
|
Model-Agnostic Multitask Fine-tuning for Few-shot
Vision-Language Transfer Learning
by
Zhenhailong Wang
et al
|
|
|
|
03-08-2022
|
Motron: Multimodal Probabilistic Human Motion
Forecasting
by
Tim Salzmann
et al
|
|
|
|
03-09-2022
|
Coarse-to-Fine Sparse Transformer for Hyperspectral
Image Reconstruction
by
Jing Lin
et al
|
|
|
|
03-11-2022
|
Democratizing Contrastive Language-Image Pre-training:
A CLIP Benchmark of Data, Model, and Supervision
by
Yufeng Cui
et al
|
|
|
|
03-08-2022
|
Source-free Domain Adaptation for Multi-site and
Lifespan Brain Skull Stripping
by
Yunxiang Li
et al
|
|
|
|
03-08-2022
|
Efficient and Accurate Hyperspectral Pansharpening
Using 3D VolumeNet and 2.5D Texture Transfer
by
Yinao Li
et al
|
|
|
|
03-10-2022
|
Synopses of Movie Narratives: a Video-Language Dataset
for Story Understanding
by
Yidan Sun
et al
|
|
|
|
03-08-2022
|
A New 27 Class Sign Language Dataset Collected from 173
Individuals
by
Arda Mavi
et al
|
|
|
|
03-09-2022
|
Low-light Image and Video Enhancement via Selective
Manipulation of Chromaticity
by
Sumit Shekhar
et al
|
|
|
|
03-09-2022
|
Cross-modal Map Learning for Vision and Language
Navigation
by
Georgios Georgakis
et al
|
|
|
|
03-08-2022
|
Breast cancer detection using artificial intelligence
techniques: A systematic literature review
by
Ali Bou Nassif
et al
|
|
|
|
03-10-2022
|
Zero-Shot Action Recognition with Transformer-based
Video Semantic Embedding
by
Keval Doshi
et al
|
|
|
|
03-10-2022
|
Online Deep Metric Learning via Mutual Distillation
by
Gao-Dong Liu
et al
|
|
|
|
03-09-2022
|
HDL: Hybrid Deep Learning for the Synthesis of
Myocardial Velocity Maps in Digital Twins for Cardiac
Analysis
by
Xiaodan Xing
et al
|
|
|
|
03-10-2022
|
Autofocusing+: Noise-Resilient Motion Correction in
Magnetic Resonance Imaging
by
Ekaterina Kuzmina
et al
|
|
|
|
03-10-2022
|
An Audio-Visual Attention Based Multimodal Network for
Fake Talking Face Videos Detection
by
Ganglai Wang
et al
|
|
|
|
03-09-2022
|
The Transitive Information Theory and its Application
to Deep Generative Models
by
Trung Ngo
et al
|
|
|
|
03-10-2022
|
Toward Efficient Hyperspectral Image Processing inside
Camera Pixels
by
Gourav Datta
et al
|
|
|
|
03-10-2022
|
Back to Reality: Weakly-supervised 3D Object Detection
with Shape-guided Label Enhancement
by
Xiuwei Xu
et al
|
|
|
|
03-10-2022
|
ReF -- Rotation Equivariant Features for Local Feature
Matching
by
Abhishek Peri
et al
|
|
|
|
03-08-2022
|
Understanding person identification via gait
by
Simon Hanisch
et al
|
|
|
|
03-10-2022
|
Representation Compensation Networks for Continual
Semantic Segmentation
by
Chang-Bin Zhang
et al
|
|
|
|
03-10-2022
|
AGCN: Augmented Graph Convolutional Network for
Lifelong Multi-label Image Recognition
by
Kaile Du
et al
|
|
|
|
03-09-2022
|
What Matters For Meta-Learning Vision Regression Tasks?
by
Ning Gao
et al
|
|
|
|
03-10-2022
|
Towards Less Constrained Macro-Neural Architecture
Search
by
Vasco Lopes
et al
|
|
|
|
03-09-2022
|
Triangular Character Animation Sampling with Motion,
Emotion, and Relation
by
Yizhou Zhao
et al
|
|
|
|
03-10-2022
|
Learning-based Localizability Estimation for Robust
LiDAR Localization
by
Julian Nubert
et al
|
|
|
|
03-11-2022
|
Multi-modal Graph Learning for Disease Prediction
by
Shuai Zheng
et al
|
|
|
|
03-11-2022
|
Graph Neural Networks for Relational Inductive Bias in
Vision-based Deep Reinforcement Learning of Robot
Control
by
Marco Oliva
et al
|
|
|
|
03-09-2022
|
Adaptive Trajectory Prediction via Transferable GNN
by
Yi Xu
et al
|
|
|
|
03-08-2022
|
End-to-end Multiple Instance Learning with Gradient
Accumulation
by
Axel Andersson
et al
|
|
|
|
03-08-2022
|
VoViT: Low Latency Graph-based Audio-Visual Voice
Separation Transformer
by
Juan F. Montesinos
et al
|
|
|
|
03-09-2022
|
CEU-Net: Ensemble Semantic Segmentation of
Hyperspectral Images Using Clustering
by
Nicholas Soucy
et al
|
|
|
|
03-11-2022
|
Flexible Amortized Variational Inference in qBOLD MRI
by
Ivor J. A. Simpson
et al
|
|
|
|
03-08-2022
|
Trustable Co-label Learning from Multiple Noisy
Annotators
by
Shikun Li
et al
|
|
|
|
03-09-2022
|
Ray Tracing-Guided Design of Plenoptic Cameras
by
Tim Michels
et al
|
|
|
|
03-08-2022
|
Selective-Supervised Contrastive Learning with Noisy
Labels
by
Shikun Li
et al
|
|
|
|
03-11-2022
|
WLASL-LEX: a Dataset for Recognising Phonological
Properties in American Sign Language
by
Federico Tavella
et al
|
|
|
|
03-08-2022
|
Sharing Generative Models Instead of Private Data: A
Simulation Study on Mammography Patch Classification
by
Zuzanna Szafranowska
et al
|
|
|
|
03-10-2022
|
Membership Privacy Protection for Image Translation
Models via Adversarial Knowledge Distillation
by
Saeed Ranjbar Alvar
et al
|
|
|
|
03-08-2022
|
Easy Ensemble: Simple Deep Ensemble Learning for
Sensor-Based Human Activity Recognition
by
Tatsuhito Hasegawa
et al
|
|
|
|
03-09-2022
|
A Tree-Structured Multi-Task Model Recommender
by
Lijun Zhang
et al
|
|
|
|
03-09-2022
|
Frequency-driven Imperceptible Adversarial Attack on
Semantic Similarity
by
Cheng Luo
et al
|
|
|
|
03-11-2022
|
ROOD-MRI: Benchmarking the robustness of deep learning
segmentation models to out-of-distribution and
corrupted data in MRI
by
Lyndon Boone
et al
|
|
|
|
03-09-2022
|
A Neuro-vector-symbolic Architecture for Solving Ravens
Progressive Matrices
by
Michael Hersche
et al
|
|
|
|
03-10-2022
|
Domain Generalization via Shuffled Style Assembly for
Face Anti-Spoofing
by
Zhuo Wang
et al
|
|
|
|
03-10-2022
|
Suspected Object Matters: Rethinking Models Prediction
for One-stage Visual Grounding
by
Yang Jiao
et al
|
|
|
|
03-08-2022
|
The Flag Median and FlagIRLS
by
Nathan Mankovich
et al
|
|
|
|
03-09-2022
|
Anti-Oversmoothing in Deep Vision Transformers via the
Fourier Domain Analysis: From Theory to Practice
by
Peihao Wang
et al
|
|
|
|
03-10-2022
|
A Survey of Surface Defect Detection of Industrial
Products Based on A Small Number of Labeled Data
by
Qifan Jin
et al
|
|
|
|
03-10-2022
|
Prediction-Guided Distillation for Dense Object
Detection
by
Chenhongyi Yang
et al
|
|
|
|
03-10-2022
|
EyeLoveGAN: Exploiting domain-shifts to boost network
learning with cycleGANs
by
Josefine Vilsbøll Sundgaard
et al
|
|
|
|
03-08-2022
|
YouTube-GDD: A challenging gun detection dataset with
rich contextual information
by
Yongxiang Gu
et al
|
|
|
|
03-10-2022
|
Information-Theoretic Odometry Learning
by
Sen Zhang
et al
|
|
|
|
03-08-2022
|
DeltaCNN: End-to-End CNN Inference of Sparse Frame
Differences in Videos
by
Mathias Parger
et al
|
|
|
|
03-10-2022
|
Domain Generalisation for Object Detection
by
Karthik Seemakurthy
et al
|
|
|
|
03-09-2022
|
OpenTAL: Towards Open Set Temporal Action Localization
by
Wentao Bao
et al
|
|
|
|
03-10-2022
|
TrueType Transformer: Character and Font Style
Recognition in Outline Format
by
Yusuke Nagata
et al
|
|
|
|
03-08-2022
|
ClearPose: Large-scale Transparent Object Dataset and
Benchmark
by
Xiaotong Chen
et al
|
|
|
|
03-11-2022
|
Detection of multiple retinal diseases in
ultra-widefield fundus images using deep learning:
data-driven identification of relevant regions
by
Justin Engelmann
et al
|
|
|
|
03-09-2022
|
Practical Evaluation of Adversarial Robustness via
Adaptive Auto Attack
by
Ye Liu
et al
|
|
|
|
03-10-2022
|
Learning Distinctive Margin toward Active Domain
Adaptation
by
Ming Xie
et al
|
|
|
|
03-09-2022
|
Improving Neural ODEs via Knowledge Distillation
by
Haoyu Chu
et al
|
|
|
|
03-09-2022
|
Align-Deform-Subtract: An Interventional Framework for
Explaining Object Differences
by
Cian Eastwood
et al
|
|
|
|
03-08-2022
|
A Gating Model for Bias Calibration in Generalized
Zero-shot Learning
by
Gukyeong Kwon
et al
|
|
|
|
03-09-2022
|
Simulation of Plenoptic Cameras
by
Tim Michels
et al
|
|
|
|
03-09-2022
|
Inadequately Pre-trained Models are Better Feature
Extractors
by
Andong Deng
et al
|
|
|
|
03-09-2022
|
Manifold Modeling in Quotient Space: Learning An
Invariant Mapping with Decodability of Image Patches
by
Tatsuya Yokota
et al
|
|
|
|
03-10-2022
|
An Empirical Investigation of 3D Anomaly Detection and
Segmentation
by
Eliahu Horwitz
et al
|
|
|
|
03-11-2022
|
Deep AutoAugment
by
Yu Zheng
et al
|
|
|
|
03-08-2022
|
Data augmentation with mixtures of max-entropy
transformations for filling-level classification
by
Apostolos Modas
et al
|
|
|
|
03-08-2022
|
Dynamic Group Transformer: A General Vision Transformer
Backbone with Dynamic Group Attention
by
Kai Liu
et al
|
|
|
|
03-10-2022
|
GrainSpace: A Large-scale Dataset for Fine-grained and
Domain-adaptive Recognition of Cereal Grains
by
Lei Fan
et al
|
|
|
|
03-08-2022
|
Robust Multi-Task Learning and Online Refinement for
Spacecraft Pose Estimation across Domain Gap
by
Tae Ha Park
et al
|
|
|
|
03-08-2022
|
Learning to Erase the Bayer-Filter to See in the Dark
by
Xingbo Dong
et al
|
|
|
|
03-08-2022
|
MICDIR: Multi-scale Inverse-consistent Deformable Image
Registration using UNetMSS with Self-Constructing Graph
Latent
by
Soumick Chatterjee
et al
|
|
|
|
03-11-2022
|
AI-enabled Automatic Multimodal Fusion of Cone-Beam CT
and Intraoral Scans for Intelligent 3D Tooth-Bone
Reconstruction and Clinical Applications
by
Jin Hao
et al
|
|
|
|
03-09-2022
|
Intention-aware Feature Propagation Network for
Interactive Segmentation
by
Chuyu Zhang
et al
|
|
|
|
03-09-2022
|
Multiscale Convolutional Transformer with Center Mask
Pretraining for Hyperspectral Image Classificationtion
by
Yifan Wang
et al
|
|
|
|
03-10-2022
|
MVP: Multimodality-guided Visual Pre-training
by
Longhui Wei
et al
|
|
|
|
03-08-2022
|
Evolutionary Neural Cascade Search across Supernetworks
by
Alexander Chebykin
et al
|
|
|
|
03-10-2022
|
NeRFocus: Neural Radiance Field for 3D Synthetic
Defocus
by
Yinhuai Wang
et al
|
|
|
|
03-08-2022
|
Mutual Contrastive Learning to Disentangle Whole Slide
Image Representations for Glioma Grading
by
Lipei Zhang
et al
|
|
|
|