Structured Bayesian Meta-Learning for Data-Efficient Visual-Tactile Model Estimation

Shaoxiong Yao; Yifan Zhu; Kris Hauser

Structured Bayesian Meta-Learning for Data-Efficient Visual-Tactile Model Estimation

Shaoxiong Yao, Yifan Zhu, Kris Hauser

Proceedings of The 8th Conference on Robot Learning, PMLR 270:3072-3093, 2025.

Abstract

Estimating visual-tactile models of deformable objects is challenging because vision suffers from occlusion, while touch data is sparse and noisy. We propose a novel data-efficient method for dense heterogeneous model estimation by leveraging experience from diverse training objects. The method is based on Bayesian Meta-Learning (BML), which can mitigate overfitting high-capacity visual-tactile models by meta-learning an informed prior and naturally achieves few-shot online estimation via posterior estimation. However, BML requires a shared parametric model across tasks but visual-tactile models for diverse objects have different parameter spaces. To address this issue, we introduce Structured Bayesian Meta-Learning (SBML) that incorporates heterogeneous physics models, enabling learning from training objects with varying appearances and geometries. SBML performs zero-shot vision-only prediction of deformable model parameters and few-shot adaptation after a handful of touches. Experiments show that in two classes of heterogeneous objects, namely plants and shoes, SBML outperforms existing approaches in force and torque prediction accuracy in zero- and few-shot settings.

Cite this Paper

BibTeX

@InProceedings{pmlr-v270-yao25a,
  title = 	 {Structured Bayesian Meta-Learning for Data-Efficient Visual-Tactile Model Estimation},
  author =       {Yao, Shaoxiong and Zhu, Yifan and Hauser, Kris},
  booktitle = 	 {Proceedings of The 8th Conference on Robot Learning},
  pages = 	 {3072--3093},
  year = 	 {2025},
  editor = 	 {Agrawal, Pulkit and Kroemer, Oliver and Burgard, Wolfram},
  volume = 	 {270},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {06--09 Nov},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v270/main/assets/yao25a/yao25a.pdf},
  url = 	 {https://proceedings.mlr.press/v270/yao25a.html},
  abstract = 	 {Estimating visual-tactile models of deformable objects is challenging because vision suffers from occlusion, while touch data is sparse and noisy. We propose a novel data-efficient method for dense heterogeneous model estimation by leveraging experience from diverse training objects. The method is based on Bayesian Meta-Learning (BML), which can mitigate overfitting high-capacity visual-tactile models by meta-learning an informed prior and naturally achieves few-shot online estimation via posterior estimation. However, BML requires a shared parametric model across tasks but visual-tactile models for diverse objects have different parameter spaces. To address this issue, we introduce Structured Bayesian Meta-Learning (SBML) that incorporates heterogeneous physics models, enabling learning from training objects with varying appearances and geometries. SBML performs zero-shot vision-only prediction of deformable model parameters and few-shot adaptation after a handful of touches. Experiments show that in two classes of heterogeneous objects, namely plants and shoes, SBML outperforms existing approaches in force and torque prediction accuracy in zero- and few-shot settings.}
}

Endnote

%0 Conference Paper
%T Structured Bayesian Meta-Learning for Data-Efficient Visual-Tactile Model Estimation
%A Shaoxiong Yao
%A Yifan Zhu
%A Kris Hauser
%B Proceedings of The 8th Conference on Robot Learning
%C Proceedings of Machine Learning Research
%D 2025
%E Pulkit Agrawal
%E Oliver Kroemer
%E Wolfram Burgard	
%F pmlr-v270-yao25a
%I PMLR
%P 3072--3093
%U https://proceedings.mlr.press/v270/yao25a.html
%V 270
%X Estimating visual-tactile models of deformable objects is challenging because vision suffers from occlusion, while touch data is sparse and noisy. We propose a novel data-efficient method for dense heterogeneous model estimation by leveraging experience from diverse training objects. The method is based on Bayesian Meta-Learning (BML), which can mitigate overfitting high-capacity visual-tactile models by meta-learning an informed prior and naturally achieves few-shot online estimation via posterior estimation. However, BML requires a shared parametric model across tasks but visual-tactile models for diverse objects have different parameter spaces. To address this issue, we introduce Structured Bayesian Meta-Learning (SBML) that incorporates heterogeneous physics models, enabling learning from training objects with varying appearances and geometries. SBML performs zero-shot vision-only prediction of deformable model parameters and few-shot adaptation after a handful of touches. Experiments show that in two classes of heterogeneous objects, namely plants and shoes, SBML outperforms existing approaches in force and torque prediction accuracy in zero- and few-shot settings.

APA

Yao, S., Zhu, Y. & Hauser, K.. (2025). Structured Bayesian Meta-Learning for Data-Efficient Visual-Tactile Model Estimation. Proceedings of The 8th Conference on Robot Learning, in Proceedings of Machine Learning Research 270:3072-3093 Available from https://proceedings.mlr.press/v270/yao25a.html.

Structured Bayesian Meta-Learning for Data-Efficient Visual-Tactile Model Estimation

Abstract

Cite this Paper

Related Material