2021.9.13 Vision papers

09-09-2021	IICNet: A Generic Framework for Reversible Image Conversion by Ka Leong Cheng et al
09-07-2021	PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System by Yuning Du et al
09-07-2021	Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention by Katsuyuki Nakamura et al
09-08-2021	OSSR-PID: One-Shot Symbol Recognition in P&ID Sheets using Path Sampling and GCN by Shubham Paliwal et al
09-09-2021	TxT: Crossmodal End-to-End Learning with Transformers by Jan-Martin O. Steitz et al
09-09-2021	Talk-to-Edit: Fine-Grained Facial Editing via Dialog by Yuming Jiang et al
09-09-2021	UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer by Haonan Wang et al
09-07-2021	Perceptual Learned Video Compression with Recurrent Conditional GAN by Ren Yang et al
09-07-2021	Brand Label Albedo Extraction of eCommerce Products using Generative Adversarial Network by Suman Sapkota et al
09-08-2021	Toward Real-World Super-Resolution via Adaptive Downsampling Models by Sanghyun Son et al
09-08-2021	Unfolding Taylors Approximations for Image Restoration by Man Zhou et al
09-07-2021	Multi-Branch Deep Radial Basis Function Networks for Facial Emotion Recognition by Fernanda Hernández-Luquin et al
09-07-2021	ICCAD Special Session Paper: Quantum-Classical Hybrid Machine Learning for Image Classification by Mahabubul Alam et al
09-07-2021	nnFormer: Interleaved Transformer for Volumetric Segmentation by Hong-Yu Zhou et al
09-08-2021	Which and Where to Focus: A Simple yet Accurate Framework for Arbitrary-Shaped Nearby Text Detection in Scene Images by Youhui Guo et al
09-09-2021	Per Garment Capture and Synthesis for Real-time Virtual Try-on by Toby Chong et al
09-07-2021	Learning Fast Sample Re-weighting Without Reward Data by Zizhao Zhang et al
09-09-2021	Tiny CNN for feature point description for document analysis: approach and dataset by A. Sheshkus et al
09-09-2021	Multilingual Audio-Visual Smartphone Dataset And Evaluation by Hareesh Mandalapu et al
09-07-2021	Self-supervised Tumor Segmentation through Layer Decomposition by Xiaoman Zhang et al
09-08-2021	Egocentric View Hand Action Recognition by Leveraging Hand Surface and Hand Grasp Type by Sangpil Kim et al
09-08-2021	FIDNet: LiDAR Point Cloud Semantic Segmentation with Fully Interpolation Decoding by Yiming Zhao et al
09-08-2021	Temporal RoI Align for Video Object Recognition by Tao Gong et al
09-08-2021	FaceCook: Face Generation Based on Linear Scaling Factors by Tianren Wang et al
09-07-2021	Rethinking Common Assumptions to Mitigate Racial Bias in Face Recognition Datasets by Matthew Gwilliam et al
09-10-2021	Residual 3D Scene Flow Learning with Context-Aware Feature Extraction by Guangming Wang et al
09-07-2021	Unpaired Adversarial Learning for Single Image Deraining with Rain-Space Contrastive Constraints by Xiang Chen et al
09-07-2021	FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting by Rui Liu et al
09-07-2021	Smart Traffic Monitoring System using Computer Vision and Edge Computing by Guanxiong Liu et al
09-09-2021	PhysGNN: A Physics-Driven Graph Neural Network Based Model for Predicting Soft Tissue Deformation in Image-Guided Neurosurgery by Yasmin Salehi et al
09-07-2021	Fishr: Invariant Gradient Variances for Out-of-distribution Generalization by Alexandre Rame et al
09-09-2021	ErfAct: Non-monotonic smooth trainable Activation Functions by Koushik Biswas et al
09-07-2021	Evaluation of an Audio-Video Multimodal Deepfake Dataset using Unimodal and Multimodal Detectors by Hasam Khalid et al
09-07-2021	Grassmannian Graph-attentional Landmark Selection for Domain Adaptation by Bin Sun et al
09-07-2021	Rendezvous: Attention Mechanisms for the Recognition of Surgical Action Triplets in Endoscopic Videos by Chinedu Innocent Nwoye et al
09-08-2021	Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion by Yi-Syuan Liou et al
09-08-2021	Adaptive Few-Shot Learning PoC Ultrasound COVID-19 Diagnostic System by Michael Karnes et al
09-09-2021	EVOQUER: Enhancing Temporal Grounding with Video-Pivoted BackQuery Generation by Yanjun Gao et al
09-07-2021	Melatect: A Machine Learning Model Approach For Identifying Malignant Melanoma in Skin Growths by Vidushi Meel et al
09-08-2021	Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification by Zhongxing Ma et al
09-08-2021	Shuffled Patch-Wise Supervision for Presentation Attack Detection by Alperen Kantarcı et al
09-09-2021	NEAT: Neural Attention Fields for End-to-End Autonomous Driving by Kashyap Chitta et al
09-10-2021	Automatic Displacement and Vibration Measurement in Laboratory Experiments with A Deep Learning Method by Yongsheng Bai et al
09-08-2021	Scaled ReLU Matters for Training Vision Transformers by Pichao Wang et al
09-08-2021	fastMRI+: Clinical Pathology Annotations for Knee and Brain Fully Sampled Multi-Coil MRI Data by Ruiyang Zhao et al
09-09-2021	Fair Conformal Predictors for Applications in Medical Imaging by Charles Lu et al
09-10-2021	PIP: Physical Interaction Prediction via Mental Imagery with Span Selection by Jiafei Duan et al
09-08-2021	Improving Building Segmentation for Off-Nadir Satellite Imagery by Hanxiang Hao et al
09-10-2021	EfficientCLIP: Efficient Cross-Modal Pre-training by Ensemble Confident Learning and Language Modeling by Jue Wang et al
09-10-2021	Face-NMS: A Core-set Selection Approach for Efficient Face Recognition by Yunze Chen et al
09-07-2021	CovarianceNet: Conditional Generative Model for Correct Covariance Prediction in Human Motion Prediction by Aleksey Postnikov et al
09-09-2021	HSMD: An object motion detection algorithm using a Hybrid Spiking Neural Network Architecture by Pedro Machado et al
09-07-2021	Learning to Combine the Modalities of Language and Video for Temporal Moment Localization by Jungkyoo Shin et al
09-10-2021	TADA: Taxonomy Adaptive Domain Adaptation by Rui Gong et al
09-10-2021	View Blind-spot as Inpainting: Self-Supervised Denoising with Mask Guided Residual Convolution by Yuhongze Zhou et al
09-10-2021	Mesh convolutional neural networks for wall shear stress estimation in 3D artery models by Julian Suk et al
09-07-2021	Resolving gas bubbles ascending in liquid metal from low-SNR neutron radiography images by Mihails Birjukovs et al
09-09-2021	Automatic Portrait Video Matting via Context Motion Network by Qiqi Hou et al
09-10-2021	Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization by Sungho Yoon et al
09-08-2021	Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking by Whye Kit Fong et al
09-09-2021	Dynamic Modeling of Hand-Object Interactions via Tactile Sensing by Qiang Zhang et al
09-09-2021	Preservational Learning Improves Self-supervised Medical Image Models by Reconstructing Diverse Contexts by Hong-Yu Zhou et al
09-09-2021	Object recognition for robotics from tactile time series data utilising different neural network architectures by Wolfgang Bottcher et al
09-09-2021	Taming Self-Supervised Learning for Presentation Attack Detection: In-Image De-Folding and Out-of-Image De-Mixing by Haozhe Liu et al
09-08-2021	Axial multi-layer perceptron architecture for automatic segmentation of choroid plexus in multiple sclerosis by Marius Schmidt-Mengin et al
09-07-2021	Improving Phenotype Prediction using Long-Range Spatio-Temporal Dynamics of Functional Connectivity by Simon Dahan et al
09-08-2021	Identification of Social-Media Platform of Videos through the Use of Shared Features by Luca Maiano et al
09-10-2021	An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA by Zhengyuan Yang et al
09-09-2021	ConvMLP: Hierarchical Convolutional MLPs for Vision by Jiachen Li et al
09-08-2021	Digitize-PID: Automatic Digitization of Piping and Instrumentation Diagrams by Shubham Paliwal et al
09-09-2021	Towards Transferable Adversarial Attacks on Vision Transformers by Zhipeng Wei et al
09-10-2021	Detection of GAN-synthesized street videos by Omran Alamayreh et al
09-09-2021	Single Image 3D Object Estimation with Primitive Graph Networks by Qian He et al
09-08-2021	Improving Deep Metric Learning by Divide and Conquer by Artsiom Sanakoyeu et al
09-07-2021	Simple Video Generation using Neural ODEs by David Kanaa et al
09-07-2021	Self-Supervised Representation Learning using Visual Field Expansion on Digital Pathology by Joseph Boyd et al
09-07-2021	Certifiable Outlier-Robust Geometric Perception: Exact Semidefinite Relaxations and Scalable Global Optimization by Heng Yang et al
09-09-2021	IFBiD: Inference-Free Bias Detection by Ignacio Serna et al
09-10-2021	Saliency Guided Experience Packing for Replay in Continual Learning by Gobinda Saha et al
09-09-2021	Neural-IMLS: Learning Implicit Moving Least-Squares for Surface Reconstruction from Unoriented Point clouds by Zixiong Wang et al
09-09-2021	Is Attention Better Than Matrix Decomposition? by Zhengyang Geng et al
09-08-2021	Modified Supervised Contrastive Learning for Detecting Anomalous Driving Behaviours by Shehroz S. Khan et al
09-08-2021	Deriving Explanation of Deep Visual Saliency Models by Sai Phani Kumar Malladi et al
09-10-2021	Emerging AI Security Threats for Autonomous Cars -- Case Studies by Shanthi Lekkala et al
09-07-2021	DeepFakes: Detecting Forged and Synthetic Media Content Using Machine Learning by Sm Zobaed et al
09-07-2021	GCsT: Graph Convolutional Skeleton Transformer for Action Recognition by Ruwen Bai et al
09-07-2021	Journalistic Guidelines Aware News Image Captioning by Xuewen Yang et al
09-07-2021	Capturing the objects of vision with neural networks by Benjamin Peters et al
09-08-2021	Learning Local-Global Contextual Adaptation for Fully End-to-End Bottom-Up Human Pose Estimation by Nan Xue et al
09-10-2021	ReconfigISP: Reconfigurable Camera Image Processing Pipeline by Ke Yu et al
09-09-2021	Copy-Move Image Forgery Detection Based on Evolving Circular Domains Coverage by Shilin Lu et al
09-08-2021	Panoptic SegFormer by Zhiqi Li et al
09-08-2021	Multi-Tensor Network Representation for High-Order Tensor Completion by Chang Nie et al
09-08-2021	Disentangling Alzheimers disease neurodegeneration from typical brain aging using machine learning by Gyujoon Hwang et al
09-08-2021	LiDARTouch: Monocular metric depth estimation with a few-beam LiDAR by Florent Bartoccioni et al
09-10-2021	Temporally Coherent Person Matting Trained on Fake-Motion Dataset by Ivan Molodetskikh et al
09-08-2021	SSEGEP: Small SEGment Emphasized Performance evaluation metric for medical image segmentation by Ammu R et al
09-07-2021	RoadAtlas: Intelligent Platform for Automated Road Defect Detection and Asset Management by Zhuoxiao Chen et al
09-09-2021	ACFNet: Adaptively-Cooperative Fusion Network for RGB-D Salient Object Detection by Jinchao Zhu
09-07-2021	Fair Comparison: Quantifying Variance in Resultsfor Fine-grained Visual Categorization by Matthew Gwilliam et al
09-09-2021	Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers by Stella Frank et al
09-08-2021	Unsupervised clothing change adaptive person ReID by Ziyue Zhang et al
09-09-2021	PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition by Zhi Qiao et al
09-07-2021	Efficient ADMM-based Algorithms for Convolutional Sparse Coding by Farshad G. Veshki et al
09-07-2021	Learning to Discriminate Information for Online Action Detection: Analysis and Application by Sumin Lee et al
09-08-2021	RGB-D Salient Object Detection with Ubiquitous Target Awareness by Yifan Zhao et al
09-08-2021	Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection by Xugong Qin et al
09-07-2021	Master Face Attacks on Face Recognition Systems by Huy H. Nguyen et al
09-09-2021	CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization by Ara Jafarzadeh et al
09-08-2021	SORNet: Spatial Object-Centric Representations for Sequential Manipulation by Wentao Yuan et al
09-09-2021	Reconstructing and grounding narrated instructional videos in 3D by Dimitri Zhukov et al
09-09-2021	Application of the Singular Spectrum Analysis on electroluminescence images of thin-film photovoltaic modules by Evgenii Sovetkin et al
09-09-2021	ACP++: Action Co-occurrence Priors for Human-Object Interaction Detection by Dong-Jin Kim et al
09-09-2021	Towards Robust Cross-domain Image Understanding with Unsupervised Noise Removal by Lei Zhu et al
09-08-2021	Energy-Efficient Mobile Robot Control via Run-time Monitoring of Environmental Complexity and Computing Workload by Sherif A. S. Mohamed et al
09-07-2021	YouRefIt: Embodied Reference Understanding with Language and Gesture by Yixin Chen et al
09-10-2021	Unsupervised Change Detection in Hyperspectral Images using Feature Fusion Deep Convolutional Autoencoders by Debasrita Chakraborty et al
09-08-2021	On Recognizing Occluded Faces in the Wild by Mustafa Ekrem Erakın et al
09-10-2021	Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering by Min Peng et al
09-08-2021	Automated LoD-2 Model Reconstruction from Very-HighResolution Satellite-derived Digital Surface Model and Orthophoto by Shengxi Gui et al
09-09-2021	Leveraging Local Domains for Image-to-Image Translation by Anthony Dell'Eva et al
09-07-2021	Fine-grained Hand Gesture Recognition in Multi-viewpoint Hand Hygiene by Huy Q. Vo et al
09-09-2021	Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss by Xing Cheng et al
09-09-2021	Fine-grained Data Distribution Alignment for Post-Training Quantization by Yunshan Zhong et al
09-09-2021	Towards Fully Automated Segmentation of Rat Cardiac MRI by Leveraging Deep Learning Frameworks by Daniel Fernandez-Llaneza et al
09-07-2021	GTT-Net: Learned Generalized Trajectory Triangulation by Xiangyu Xu et al
09-10-2021	Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding by Zhenzhi Wang et al
09-10-2021	Spatio-Temporal Recurrent Networks for Event-Based Optical Flow Estimation by Ziluo Ding et al
09-09-2021	Efficiently Identifying Task Groupings for Multi-Task Learning by Christopher Fifty et al
09-10-2021	Panoptic Narrative Grounding by C. González et al
09-07-2021	Support Vector Machine for Handwritten Character Recognition by Jomy John
09-08-2021	Cross-Site Severity Assessment of COVID-19 from CT Images via Domain Adaptation by Geng-Xin Xu et al
09-09-2021	Learning Cross-Scale Visual Representations for Real-Time Image Geo-Localization by Tianyi Zhang et al
09-07-2021	Knowledge Distillation Using Hierarchical Self-Supervision Augmented Distribution by Chuanguang Yang et al
09-09-2021	Continuous Event-Line Constraint for Closed-Form Velocity Initialization by Peng Xin et al
09-09-2021	Deep Hough Voting for Robust Global Registration by Junha Lee et al
09-09-2021	S3G-ARM: Highly Compressive Visual Self-localization from Sequential Semantic Scene Graph Using Absolute and Relative Measurements by Mitsuki Yoshida et al
09-09-2021	Self Supervision to Distillation for Long-Tailed Visual Recognition by Tianhao Li et al
09-09-2021	M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks by Xiao Dong et al
09-08-2021	Tactile Image-to-Image Disentanglement of Contact Geometry from Motion-Induced Shear by Anupam K. Gupta et al
09-08-2021	Level Set Binocular Stereo with Occlusions by Jialiang Wang et al
09-08-2021	Recalibrating the KITTI Dataset Camera Setup for Improved Odometry Accuracy by Igor Cvišić et al
09-08-2021	Elastic Significant Bit Quantization and Acceleration for Deep Neural Networks by Cheng Gong et al
09-08-2021	Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes by W. Song et al
09-10-2021	LibFewShot: A Comprehensive Library for Few-shot Learning by Wenbin Li et al
09-07-2021	MRI Reconstruction Using Deep Energy-Based Model by Yu Guan et al
09-07-2021	FDA: Feature Decomposition and Aggregation for Robust Airway Segmentation by Minghui Zhang et al
09-09-2021	Energy Attack: On Transferring Adversarial Examples by Ruoxi Shi et al

Craig SmithSeptember 13, 2021