Cross-Domain Representation Learning for Clothes Unfolding in Robot-Assisted Dressing

Abstract

Assistive robots can significantly reduce the burden of daily activities by providing services such as unfolding clothes and dressing assistance. For robotic clothes manipulation tasks, grasping point recognition is one of the core steps, which is usually achieved by supervised deep learning methods using large amount of labeled training data. Given that collecting real annotated data is extremely labor-intensive and time-consuming in this field, synthetic data generated by physics engines is typically adopted for data enrichment. However, there exists an inherent discrepancy between real and synthetic domains. Therefore, effectively leveraging synthetic data together with real data to jointly train models for grasping point recognition is desirable. In this paper, we propose a Cross-Domain Representation Learning (CDRL) framework that adaptively extracts domain-specific features from synthetic and real domains respectively, before further fusing these domain-specific features to produce more informative and robust cross-domain representations, thereby improving the prediction accuracy of grasping points. Experimental results show that our CDRL framework is capable of recognizing grasping points more precisely compared with five baseline methods. Based on our CDRL framework, we enable a Baxter humanoid robot to unfold a hanging white coat with a 92% success rate and assist 6 users to dress successfully.

Publication
In European Conference on Computer Vision Workshops (ECCV 2022 Workshops)
Runyang Feng (封润洋)
Runyang Feng (封润洋)
PhD Student of Computer Science and Technology

Runyang Feng is currently a PhD student in School of Artificial Intelligence Jilin University. His research interests include Human Pose Estimation (2D), Video Understanding, Computer Vision, and Deep Learning.