09-09-2021
|
IICNet: A Generic Framework for Reversible Image
Conversion
by
Ka Leong Cheng
et al
|
|
|
|
09-07-2021
|
PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR
System
by
Yuning Du
et al
|
|
|
|
09-07-2021
|
Sensor-Augmented Egocentric-Video Captioning with
Dynamic Modal Attention
by
Katsuyuki Nakamura
et al
|
|
|
|
09-08-2021
|
OSSR-PID: One-Shot Symbol Recognition in P&ID
Sheets using Path Sampling and GCN
by
Shubham Paliwal
et al
|
|
|
|
09-09-2021
|
TxT: Crossmodal End-to-End Learning with Transformers
by
Jan-Martin O. Steitz
et al
|
|
|
|
09-09-2021
|
Talk-to-Edit: Fine-Grained Facial Editing via Dialog
by
Yuming Jiang
et al
|
|
|
|
09-09-2021
|
UCTransNet: Rethinking the Skip Connections in U-Net
from a Channel-wise Perspective with Transformer
by
Haonan Wang
et al
|
|
|
|
09-07-2021
|
Perceptual Learned Video Compression with Recurrent
Conditional GAN
by
Ren Yang
et al
|
|
|
|
09-07-2021
|
Brand Label Albedo Extraction of eCommerce Products
using Generative Adversarial Network
by
Suman Sapkota
et al
|
|
|
|
09-08-2021
|
Toward Real-World Super-Resolution via Adaptive
Downsampling Models
by
Sanghyun Son
et al
|
|
|
|
09-08-2021
|
Unfolding Taylors Approximations for Image Restoration
by
Man Zhou
et al
|
|
|
|
09-07-2021
|
Multi-Branch Deep Radial Basis Function Networks for
Facial Emotion Recognition
by
Fernanda Hernández-Luquin
et al
|
|
|
|
09-07-2021
|
ICCAD Special Session Paper: Quantum-Classical Hybrid
Machine Learning for Image Classification
by
Mahabubul Alam
et al
|
|
|
|
09-07-2021
|
nnFormer: Interleaved Transformer for Volumetric
Segmentation
by
Hong-Yu Zhou
et al
|
|
|
|
09-08-2021
|
Which and Where to Focus: A Simple yet Accurate
Framework for Arbitrary-Shaped Nearby Text Detection in
Scene Images
by
Youhui Guo
et al
|
|
|
|
09-09-2021
|
Per Garment Capture and Synthesis for Real-time Virtual
Try-on
by
Toby Chong
et al
|
|
|
|
09-07-2021
|
Learning Fast Sample Re-weighting Without Reward Data
by
Zizhao Zhang
et al
|
|
|
|
09-09-2021
|
Tiny CNN for feature point description for document
analysis: approach and dataset
by
A. Sheshkus
et al
|
|
|
|
09-09-2021
|
Multilingual Audio-Visual Smartphone Dataset And
Evaluation
by
Hareesh Mandalapu
et al
|
|
|
|
09-07-2021
|
Self-supervised Tumor Segmentation through Layer
Decomposition
by
Xiaoman Zhang
et al
|
|
|
|
09-08-2021
|
Egocentric View Hand Action Recognition by Leveraging
Hand Surface and Hand Grasp Type
by
Sangpil Kim
et al
|
|
|
|
09-08-2021
|
FIDNet: LiDAR Point Cloud Semantic Segmentation with
Fully Interpolation Decoding
by
Yiming Zhao
et al
|
|
|
|
09-08-2021
|
Temporal RoI Align for Video Object Recognition
by
Tao Gong
et al
|
|
|
|
09-08-2021
|
FaceCook: Face Generation Based on Linear Scaling
Factors
by
Tianren Wang
et al
|
|
|
|
09-07-2021
|
Rethinking Common Assumptions to Mitigate Racial Bias
in Face Recognition Datasets
by
Matthew Gwilliam
et al
|
|
|
|
09-10-2021
|
Residual 3D Scene Flow Learning with Context-Aware
Feature Extraction
by
Guangming Wang
et al
|
|
|
|
09-07-2021
|
Unpaired Adversarial Learning for Single Image
Deraining with Rain-Space Contrastive Constraints
by
Xiang Chen
et al
|
|
|
|
09-07-2021
|
FuseFormer: Fusing Fine-Grained Information in
Transformers for Video Inpainting
by
Rui Liu
et al
|
|
|
|
09-07-2021
|
Smart Traffic Monitoring System using Computer Vision
and Edge Computing
by
Guanxiong Liu
et al
|
|
|
|
09-09-2021
|
PhysGNN: A Physics-Driven Graph Neural Network Based
Model for Predicting Soft Tissue Deformation in
Image-Guided Neurosurgery
by
Yasmin Salehi
et al
|
|
|
|
09-07-2021
|
Fishr: Invariant Gradient Variances for
Out-of-distribution Generalization
by
Alexandre Rame
et al
|
|
|
|
09-09-2021
|
ErfAct: Non-monotonic smooth trainable Activation
Functions
by
Koushik Biswas
et al
|
|
|
|
09-07-2021
|
Evaluation of an Audio-Video Multimodal Deepfake
Dataset using Unimodal and Multimodal Detectors
by
Hasam Khalid
et al
|
|
|
|
09-07-2021
|
Grassmannian Graph-attentional Landmark Selection for
Domain Adaptation
by
Bin Sun
et al
|
|
|
|
09-07-2021
|
Rendezvous: Attention Mechanisms for the Recognition of
Surgical Action Triplets in Endoscopic Videos
by
Chinedu Innocent Nwoye
et al
|
|
|
|
09-08-2021
|
Time Alignment using Lip Images for Frame-based
Electrolaryngeal Voice Conversion
by
Yi-Syuan Liou
et al
|
|
|
|
09-08-2021
|
Adaptive Few-Shot Learning PoC Ultrasound COVID-19
Diagnostic System
by
Michael Karnes
et al
|
|
|
|
09-09-2021
|
EVOQUER: Enhancing Temporal Grounding with
Video-Pivoted BackQuery Generation
by
Yanjun Gao
et al
|
|
|
|
09-07-2021
|
Melatect: A Machine Learning Model Approach For
Identifying Malignant Melanoma in Skin Growths
by
Vidushi Meel
et al
|
|
|
|
09-08-2021
|
Pose-guided Inter- and Intra-part Relational
Transformer for Occluded Person Re-Identification
by
Zhongxing Ma
et al
|
|
|
|
09-08-2021
|
Shuffled Patch-Wise Supervision for Presentation Attack
Detection
by
Alperen Kantarcı
et al
|
|
|
|
09-09-2021
|
NEAT: Neural Attention Fields for End-to-End Autonomous
Driving
by
Kashyap Chitta
et al
|
|
|
|
09-10-2021
|
Automatic Displacement and Vibration Measurement in
Laboratory Experiments with A Deep Learning Method
by
Yongsheng Bai
et al
|
|
|
|
09-08-2021
|
Scaled ReLU Matters for Training Vision Transformers
by
Pichao Wang
et al
|
|
|
|
09-08-2021
|
fastMRI+: Clinical Pathology Annotations for Knee and
Brain Fully Sampled Multi-Coil MRI Data
by
Ruiyang Zhao
et al
|
|
|
|
09-09-2021
|
Fair Conformal Predictors for Applications in Medical
Imaging
by
Charles Lu
et al
|
|
|
|
09-10-2021
|
PIP: Physical Interaction Prediction via Mental Imagery
with Span Selection
by
Jiafei Duan
et al
|
|
|
|
09-08-2021
|
Improving Building Segmentation for Off-Nadir Satellite
Imagery
by
Hanxiang Hao
et al
|
|
|
|
09-10-2021
|
EfficientCLIP: Efficient Cross-Modal Pre-training by
Ensemble Confident Learning and Language Modeling
by
Jue Wang
et al
|
|
|
|
09-10-2021
|
Face-NMS: A Core-set Selection Approach for Efficient
Face Recognition
by
Yunze Chen
et al
|
|
|
|
09-07-2021
|
CovarianceNet: Conditional Generative Model for Correct
Covariance Prediction in Human Motion Prediction
by
Aleksey Postnikov
et al
|
|
|
|
09-09-2021
|
HSMD: An object motion detection algorithm using a
Hybrid Spiking Neural Network Architecture
by
Pedro Machado
et al
|
|
|
|
09-07-2021
|
Learning to Combine the Modalities of Language and
Video for Temporal Moment Localization
by
Jungkyoo Shin
et al
|
|
|
|
09-10-2021
|
TADA: Taxonomy Adaptive Domain Adaptation
by
Rui Gong
et al
|
|
|
|
09-10-2021
|
View Blind-spot as Inpainting: Self-Supervised
Denoising with Mask Guided Residual Convolution
by
Yuhongze Zhou
et al
|
|
|
|
09-10-2021
|
Mesh convolutional neural networks for wall shear
stress estimation in 3D artery models
by
Julian Suk
et al
|
|
|
|
09-07-2021
|
Resolving gas bubbles ascending in liquid metal from
low-SNR neutron radiography images
by
Mihails Birjukovs
et al
|
|
|
|
09-09-2021
|
Automatic Portrait Video Matting via Context Motion
Network
by
Qiqi Hou
et al
|
|
|
|
09-10-2021
|
Line as a Visual Sentence: Context-aware Line
Descriptor for Visual Localization
by
Sungho Yoon
et al
|
|
|
|
09-08-2021
|
Panoptic nuScenes: A Large-Scale Benchmark for LiDAR
Panoptic Segmentation and Tracking
by
Whye Kit Fong
et al
|
|
|
|
09-09-2021
|
Dynamic Modeling of Hand-Object Interactions via
Tactile Sensing
by
Qiang Zhang
et al
|
|
|
|
09-09-2021
|
Preservational Learning Improves Self-supervised
Medical Image Models by Reconstructing Diverse Contexts
by
Hong-Yu Zhou
et al
|
|
|
|
09-09-2021
|
Object recognition for robotics from tactile time
series data utilising different neural network
architectures
by
Wolfgang Bottcher
et al
|
|
|
|
09-09-2021
|
Taming Self-Supervised Learning for Presentation Attack
Detection: In-Image De-Folding and Out-of-Image
De-Mixing
by
Haozhe Liu
et al
|
|
|
|
09-08-2021
|
Axial multi-layer perceptron architecture for automatic
segmentation of choroid plexus in multiple sclerosis
by
Marius Schmidt-Mengin
et al
|
|
|
|
09-07-2021
|
Improving Phenotype Prediction using Long-Range
Spatio-Temporal Dynamics of Functional Connectivity
by
Simon Dahan
et al
|
|
|
|
09-08-2021
|
Identification of Social-Media Platform of Videos
through the Use of Shared Features
by
Luca Maiano
et al
|
|
|
|
09-10-2021
|
An Empirical Study of GPT-3 for Few-Shot
Knowledge-Based VQA
by
Zhengyuan Yang
et al
|
|
|
|
09-09-2021
|
ConvMLP: Hierarchical Convolutional MLPs for Vision
by
Jiachen Li
et al
|
|
|
|
09-08-2021
|
Digitize-PID: Automatic Digitization of Piping and
Instrumentation Diagrams
by
Shubham Paliwal
et al
|
|
|
|
09-09-2021
|
Towards Transferable Adversarial Attacks on Vision
Transformers
by
Zhipeng Wei
et al
|
|
|
|
09-10-2021
|
Detection of GAN-synthesized street videos
by
Omran Alamayreh
et al
|
|
|
|
09-09-2021
|
Single Image 3D Object Estimation with Primitive Graph
Networks
by
Qian He
et al
|
|
|
|
09-08-2021
|
Improving Deep Metric Learning by Divide and Conquer
by
Artsiom Sanakoyeu
et al
|
|
|
|
09-07-2021
|
Simple Video Generation using Neural ODEs
by
David Kanaa
et al
|
|
|
|
09-07-2021
|
Self-Supervised Representation Learning using Visual
Field Expansion on Digital Pathology
by
Joseph Boyd
et al
|
|
|
|
09-07-2021
|
Certifiable Outlier-Robust Geometric Perception: Exact
Semidefinite Relaxations and Scalable Global
Optimization
by
Heng Yang
et al
|
|
|
|
09-09-2021
|
IFBiD: Inference-Free Bias Detection
by
Ignacio Serna
et al
|
|
|
|
09-10-2021
|
Saliency Guided Experience Packing for Replay in
Continual Learning
by
Gobinda Saha
et al
|
|
|
|
09-09-2021
|
Neural-IMLS: Learning Implicit Moving Least-Squares for
Surface Reconstruction from Unoriented Point clouds
by
Zixiong Wang
et al
|
|
|
|
09-09-2021
|
Is Attention Better Than Matrix Decomposition?
by
Zhengyang Geng
et al
|
|
|
|
09-08-2021
|
Modified Supervised Contrastive Learning for Detecting
Anomalous Driving Behaviours
by
Shehroz S. Khan
et al
|
|
|
|
09-08-2021
|
Deriving Explanation of Deep Visual Saliency Models
by
Sai Phani Kumar Malladi
et al
|
|
|
|
09-10-2021
|
Emerging AI Security Threats for Autonomous Cars --
Case Studies
by
Shanthi Lekkala
et al
|
|
|
|
09-07-2021
|
DeepFakes: Detecting Forged and Synthetic Media Content
Using Machine Learning
by
Sm Zobaed
et al
|
|
|
|
09-07-2021
|
GCsT: Graph Convolutional Skeleton Transformer for
Action Recognition
by
Ruwen Bai
et al
|
|
|
|
09-07-2021
|
Journalistic Guidelines Aware News Image Captioning
by
Xuewen Yang
et al
|
|
|
|
09-07-2021
|
Capturing the objects of vision with neural networks
by
Benjamin Peters
et al
|
|
|
|
09-08-2021
|
Learning Local-Global Contextual Adaptation for Fully
End-to-End Bottom-Up Human Pose Estimation
by
Nan Xue
et al
|
|
|
|
09-10-2021
|
ReconfigISP: Reconfigurable Camera Image Processing
Pipeline
by
Ke Yu
et al
|
|
|
|
09-09-2021
|
Copy-Move Image Forgery Detection Based on Evolving
Circular Domains Coverage
by
Shilin Lu
et al
|
|
|
|
09-08-2021
|
Panoptic SegFormer
by
Zhiqi Li
et al
|
|
|
|
09-08-2021
|
Multi-Tensor Network Representation for High-Order
Tensor Completion
by
Chang Nie
et al
|
|
|
|
09-08-2021
|
Disentangling Alzheimers disease neurodegeneration from
typical brain aging using machine learning
by
Gyujoon Hwang
et al
|
|
|
|
09-08-2021
|
LiDARTouch: Monocular metric depth estimation with a
few-beam LiDAR
by
Florent Bartoccioni
et al
|
|
|
|
09-10-2021
|
Temporally Coherent Person Matting Trained on
Fake-Motion Dataset
by
Ivan Molodetskikh
et al
|
|
|
|
09-08-2021
|
SSEGEP: Small SEGment Emphasized Performance evaluation
metric for medical image segmentation
by
Ammu R
et al
|
|
|
|
09-07-2021
|
RoadAtlas: Intelligent Platform for Automated Road
Defect Detection and Asset Management
by
Zhuoxiao Chen
et al
|
|
|
|
09-09-2021
|
ACFNet: Adaptively-Cooperative Fusion Network for RGB-D
Salient Object Detection
by
Jinchao Zhu
|
|
|
|
09-07-2021
|
Fair Comparison: Quantifying Variance in Resultsfor
Fine-grained Visual Categorization
by
Matthew Gwilliam
et al
|
|
|
|
09-09-2021
|
Vision-and-Language or Vision-for-Language? On
Cross-Modal Influence in Multimodal Transformers
by
Stella Frank
et al
|
|
|
|
09-08-2021
|
Unsupervised clothing change adaptive person ReID
by
Ziyue Zhang
et al
|
|
|
|
09-09-2021
|
PIMNet: A Parallel, Iterative and Mimicking Network for
Scene Text Recognition
by
Zhi Qiao
et al
|
|
|
|
09-07-2021
|
Efficient ADMM-based Algorithms for Convolutional
Sparse Coding
by
Farshad G. Veshki
et al
|
|
|
|
09-07-2021
|
Learning to Discriminate Information for Online Action
Detection: Analysis and Application
by
Sumin Lee
et al
|
|
|
|
09-08-2021
|
RGB-D Salient Object Detection with Ubiquitous Target
Awareness
by
Yifan Zhao
et al
|
|
|
|
09-08-2021
|
Mask is All You Need: Rethinking Mask R-CNN for Dense
and Arbitrary-Shaped Scene Text Detection
by
Xugong Qin
et al
|
|
|
|
09-07-2021
|
Master Face Attacks on Face Recognition Systems
by
Huy H. Nguyen
et al
|
|
|
|
09-09-2021
|
CrowdDriven: A New Challenging Dataset for Outdoor
Visual Localization
by
Ara Jafarzadeh
et al
|
|
|
|
09-08-2021
|
SORNet: Spatial Object-Centric Representations for
Sequential Manipulation
by
Wentao Yuan
et al
|
|
|
|
09-09-2021
|
Reconstructing and grounding narrated instructional
videos in 3D
by
Dimitri Zhukov
et al
|
|
|
|
09-09-2021
|
Application of the Singular Spectrum Analysis on
electroluminescence images of thin-film photovoltaic
modules
by
Evgenii Sovetkin
et al
|
|
|
|
09-09-2021
|
ACP++: Action Co-occurrence Priors for Human-Object
Interaction Detection
by
Dong-Jin Kim
et al
|
|
|
|
09-09-2021
|
Towards Robust Cross-domain Image Understanding with
Unsupervised Noise Removal
by
Lei Zhu
et al
|
|
|
|
09-08-2021
|
Energy-Efficient Mobile Robot Control via Run-time
Monitoring of Environmental Complexity and Computing
Workload
by
Sherif A. S. Mohamed
et al
|
|
|
|
09-07-2021
|
YouRefIt: Embodied Reference Understanding with
Language and Gesture
by
Yixin Chen
et al
|
|
|
|
09-10-2021
|
Unsupervised Change Detection in Hyperspectral Images
using Feature Fusion Deep Convolutional Autoencoders
by
Debasrita Chakraborty
et al
|
|
|
|
09-08-2021
|
On Recognizing Occluded Faces in the Wild
by
Mustafa Ekrem Erakın
et al
|
|
|
|
09-10-2021
|
Temporal Pyramid Transformer with Multimodal
Interaction for Video Question Answering
by
Min Peng
et al
|
|
|
|
09-08-2021
|
Automated LoD-2 Model Reconstruction from
Very-HighResolution Satellite-derived Digital Surface
Model and Orthophoto
by
Shengxi Gui
et al
|
|
|
|
09-09-2021
|
Leveraging Local Domains for Image-to-Image Translation
by
Anthony Dell'Eva
et al
|
|
|
|
09-07-2021
|
Fine-grained Hand Gesture Recognition in
Multi-viewpoint Hand Hygiene
by
Huy Q. Vo
et al
|
|
|
|
09-09-2021
|
Improving Video-Text Retrieval by Multi-Stream Corpus
Alignment and Dual Softmax Loss
by
Xing Cheng
et al
|
|
|
|
09-09-2021
|
Fine-grained Data Distribution Alignment for
Post-Training Quantization
by
Yunshan Zhong
et al
|
|
|
|
09-09-2021
|
Towards Fully Automated Segmentation of Rat Cardiac MRI
by Leveraging Deep Learning Frameworks
by
Daniel Fernandez-Llaneza
et al
|
|
|
|
09-07-2021
|
GTT-Net: Learned Generalized Trajectory Triangulation
by
Xiangyu Xu
et al
|
|
|
|
09-10-2021
|
Negative Sample Matters: A Renaissance of Metric
Learning for Temporal Grounding
by
Zhenzhi Wang
et al
|
|
|
|
09-10-2021
|
Spatio-Temporal Recurrent Networks for Event-Based
Optical Flow Estimation
by
Ziluo Ding
et al
|
|
|
|
09-09-2021
|
Efficiently Identifying Task Groupings for Multi-Task
Learning
by
Christopher Fifty
et al
|
|
|
|
09-10-2021
|
Panoptic Narrative Grounding
by
C. González
et al
|
|
|
|
09-07-2021
|
Support Vector Machine for Handwritten Character
Recognition
by
Jomy John
|
|
|
|
09-08-2021
|
Cross-Site Severity Assessment of COVID-19 from CT
Images via Domain Adaptation
by
Geng-Xin Xu
et al
|
|
|
|
09-09-2021
|
Learning Cross-Scale Visual Representations for
Real-Time Image Geo-Localization
by
Tianyi Zhang
et al
|
|
|
|
09-07-2021
|
Knowledge Distillation Using Hierarchical
Self-Supervision Augmented Distribution
by
Chuanguang Yang
et al
|
|
|
|
09-09-2021
|
Continuous Event-Line Constraint for Closed-Form
Velocity Initialization
by
Peng Xin
et al
|
|
|
|
09-09-2021
|
Deep Hough Voting for Robust Global Registration
by
Junha Lee
et al
|
|
|
|
09-09-2021
|
S3G-ARM: Highly Compressive Visual Self-localization
from Sequential Semantic Scene Graph Using Absolute and
Relative Measurements
by
Mitsuki Yoshida
et al
|
|
|
|
09-09-2021
|
Self Supervision to Distillation for Long-Tailed Visual
Recognition
by
Tianhao Li
et al
|
|
|
|
09-09-2021
|
M5Product: A Multi-modal Pretraining Benchmark for
E-commercial Product Downstream Tasks
by
Xiao Dong
et al
|
|
|
|
09-08-2021
|
Tactile Image-to-Image Disentanglement of Contact
Geometry from Motion-Induced Shear
by
Anupam K. Gupta
et al
|
|
|
|
09-08-2021
|
Level Set Binocular Stereo with Occlusions
by
Jialiang Wang
et al
|
|
|
|
09-08-2021
|
Recalibrating the KITTI Dataset Camera Setup for
Improved Odometry Accuracy
by
Igor Cvišić
et al
|
|
|
|
09-08-2021
|
Elastic Significant Bit Quantization and Acceleration
for Deep Neural Networks
by
Cheng Gong
et al
|
|
|
|
09-08-2021
|
Matching in the Dark: A Dataset for Matching Image
Pairs of Low-light Scenes
by
W. Song
et al
|
|
|
|
09-10-2021
|
LibFewShot: A Comprehensive Library for Few-shot
Learning
by
Wenbin Li
et al
|
|
|
|
09-07-2021
|
MRI Reconstruction Using Deep Energy-Based Model
by
Yu Guan
et al
|
|
|
|
09-07-2021
|
FDA: Feature Decomposition and Aggregation for Robust
Airway Segmentation
by
Minghui Zhang
et al
|
|
|
|
09-09-2021
|
Energy Attack: On Transferring Adversarial Examples
by
Ruoxi Shi
et al
|
|
|
|