Fleet Supervisor Allocation: A Submodular Maximization Approach

Oguzhan Akcin; Ahmet Ege Tanriverdi; Kaan Kale; Sandeep P. Chinchali

Fleet Supervisor Allocation: A Submodular Maximization Approach

Oguzhan Akcin, Ahmet Ege Tanriverdi, Kaan Kale, Sandeep P. Chinchali

Proceedings of The 8th Conference on Robot Learning, PMLR 270:4603-4630, 2025.

Abstract

In real-world scenarios, the data collected by robots in diverse and unpredictable environments is crucial for enhancing their perception and decision-making models. This data is predominantly collected under human supervision, particularly through imitation learning (IL), where robots learn complex tasks by observing human supervisors. However, the deployment of multiple robots and supervisors to accelerate the learning process often leads to data redundancy and inefficiencies, especially as the scale of robot fleets increases. Moreover, the reliance on teleoperation for supervision introduces additional challenges due to potential network connectivity issues. To address these issues in data collection, we introduce an Adaptive Submodular Allocation policy, ASA, designed for efficient human supervision allocation within multi-robot systems under uncertain connectivity conditions. Our approach reduces data redundancy by balancing the informativeness and diversity of data collection, and is capable of accommodating connectivity variances. We evaluate the effectiveness of ASA in simulations with 100 robots across four different environments and various network settings, including a real-world teleoperation scenario over a 5G network. We train and test our policy, ASA, and state-of-the-art policies utilizing NVIDIA’s Isaac Gym. Our results show that ASA enhances the return on human effort by up to $3.37\times$, outperforming current baselines in all simulated scenarios and providing robustness against connectivity disruptions.

Cite this Paper

BibTeX

@InProceedings{pmlr-v270-akcin25a,
  title = 	 {Fleet Supervisor Allocation: A Submodular Maximization Approach},
  author =       {Akcin, Oguzhan and Tanriverdi, Ahmet Ege and Kale, Kaan and Chinchali, Sandeep P.},
  booktitle = 	 {Proceedings of The 8th Conference on Robot Learning},
  pages = 	 {4603--4630},
  year = 	 {2025},
  editor = 	 {Agrawal, Pulkit and Kroemer, Oliver and Burgard, Wolfram},
  volume = 	 {270},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {06--09 Nov},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v270/main/assets/akcin25a/akcin25a.pdf},
  url = 	 {https://proceedings.mlr.press/v270/akcin25a.html},
  abstract = 	 {In real-world scenarios, the data collected by robots in diverse and unpredictable environments is crucial for enhancing their perception and decision-making models. This data is predominantly collected under human supervision, particularly through imitation learning (IL), where robots learn complex tasks by observing human supervisors. However, the deployment of multiple robots and supervisors to accelerate the learning process often leads to data redundancy and inefficiencies, especially as the scale of robot fleets increases. Moreover, the reliance on teleoperation for supervision introduces additional challenges due to potential network connectivity issues. To address these issues in data collection, we introduce an Adaptive Submodular Allocation policy, ASA, designed for efficient human supervision allocation within multi-robot systems under uncertain connectivity conditions. Our approach reduces data redundancy by balancing the informativeness and diversity of data collection, and is capable of accommodating connectivity variances. We evaluate the effectiveness of ASA in simulations with 100 robots across four different environments and various network settings, including a real-world teleoperation scenario over a 5G network. We train and test our policy, ASA, and state-of-the-art policies utilizing NVIDIA’s Isaac Gym. Our results show that ASA enhances the return on human effort by up to $3.37\times$, outperforming current baselines in all simulated scenarios and providing robustness against connectivity disruptions.}
}

Endnote

%0 Conference Paper
%T Fleet Supervisor Allocation: A Submodular Maximization Approach
%A Oguzhan Akcin
%A Ahmet Ege Tanriverdi
%A Kaan Kale
%A Sandeep P. Chinchali
%B Proceedings of The 8th Conference on Robot Learning
%C Proceedings of Machine Learning Research
%D 2025
%E Pulkit Agrawal
%E Oliver Kroemer
%E Wolfram Burgard	
%F pmlr-v270-akcin25a
%I PMLR
%P 4603--4630
%U https://proceedings.mlr.press/v270/akcin25a.html
%V 270
%X In real-world scenarios, the data collected by robots in diverse and unpredictable environments is crucial for enhancing their perception and decision-making models. This data is predominantly collected under human supervision, particularly through imitation learning (IL), where robots learn complex tasks by observing human supervisors. However, the deployment of multiple robots and supervisors to accelerate the learning process often leads to data redundancy and inefficiencies, especially as the scale of robot fleets increases. Moreover, the reliance on teleoperation for supervision introduces additional challenges due to potential network connectivity issues. To address these issues in data collection, we introduce an Adaptive Submodular Allocation policy, ASA, designed for efficient human supervision allocation within multi-robot systems under uncertain connectivity conditions. Our approach reduces data redundancy by balancing the informativeness and diversity of data collection, and is capable of accommodating connectivity variances. We evaluate the effectiveness of ASA in simulations with 100 robots across four different environments and various network settings, including a real-world teleoperation scenario over a 5G network. We train and test our policy, ASA, and state-of-the-art policies utilizing NVIDIA’s Isaac Gym. Our results show that ASA enhances the return on human effort by up to $3.37\times$, outperforming current baselines in all simulated scenarios and providing robustness against connectivity disruptions.

APA

Akcin, O., Tanriverdi, A.E., Kale, K. & Chinchali, S.P.. (2025). Fleet Supervisor Allocation: A Submodular Maximization Approach. Proceedings of The 8th Conference on Robot Learning, in Proceedings of Machine Learning Research 270:4603-4630 Available from https://proceedings.mlr.press/v270/akcin25a.html.

Fleet Supervisor Allocation: A Submodular Maximization Approach

Abstract

Cite this Paper

Related Material