System-Agnostic Meta-Learning for MDP-based Dynamic Scheduling via Descriptive Policy

Hyun-Suk Lee
Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:169-187, 2022.

Abstract

Dynamic scheduling is an important problem in applications from queuing to wireless networks. It addresses how to choose an item among multiple scheduling items in each timestep to achieve a long-term goal. Most of the conventional approaches for dynamic scheduling find the optimal policy for a given specific system so that the policy from these approaches is usable only for the corresponding system characteristics. Hence, it is hard to use such approaches for a practical system in which system characteristics dynamically change. This paper proposes a novel policy structure for MDP-based dynamic scheduling, a descriptive policy, which has a system-agnostic capability to adapt to unseen system characteristics for an identical task (dynamic scheduling). To this end, the descriptive policy learns a system-agnostic scheduling principle–in a nutshell, “which condition of items should have a higher priority in scheduling”. The scheduling principle can be applied to any system so that the descriptive policy learned in one system can be used for another system. Experiments with simple explanatory and realistic application scenarios demonstrate that it enables system-agnostic meta-learning with very little performance degradation.

Cite this Paper


BibTeX
@InProceedings{pmlr-v151-lee22a, title = { System-Agnostic Meta-Learning for MDP-based Dynamic Scheduling via Descriptive Policy }, author = {Lee, Hyun-Suk}, booktitle = {Proceedings of The 25th International Conference on Artificial Intelligence and Statistics}, pages = {169--187}, year = {2022}, editor = {Camps-Valls, Gustau and Ruiz, Francisco J. R. and Valera, Isabel}, volume = {151}, series = {Proceedings of Machine Learning Research}, month = {28--30 Mar}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v151/lee22a/lee22a.pdf}, url = {https://proceedings.mlr.press/v151/lee22a.html}, abstract = { Dynamic scheduling is an important problem in applications from queuing to wireless networks. It addresses how to choose an item among multiple scheduling items in each timestep to achieve a long-term goal. Most of the conventional approaches for dynamic scheduling find the optimal policy for a given specific system so that the policy from these approaches is usable only for the corresponding system characteristics. Hence, it is hard to use such approaches for a practical system in which system characteristics dynamically change. This paper proposes a novel policy structure for MDP-based dynamic scheduling, a descriptive policy, which has a system-agnostic capability to adapt to unseen system characteristics for an identical task (dynamic scheduling). To this end, the descriptive policy learns a system-agnostic scheduling principle–in a nutshell, “which condition of items should have a higher priority in scheduling”. The scheduling principle can be applied to any system so that the descriptive policy learned in one system can be used for another system. Experiments with simple explanatory and realistic application scenarios demonstrate that it enables system-agnostic meta-learning with very little performance degradation. } }
Endnote
%0 Conference Paper %T System-Agnostic Meta-Learning for MDP-based Dynamic Scheduling via Descriptive Policy %A Hyun-Suk Lee %B Proceedings of The 25th International Conference on Artificial Intelligence and Statistics %C Proceedings of Machine Learning Research %D 2022 %E Gustau Camps-Valls %E Francisco J. R. Ruiz %E Isabel Valera %F pmlr-v151-lee22a %I PMLR %P 169--187 %U https://proceedings.mlr.press/v151/lee22a.html %V 151 %X Dynamic scheduling is an important problem in applications from queuing to wireless networks. It addresses how to choose an item among multiple scheduling items in each timestep to achieve a long-term goal. Most of the conventional approaches for dynamic scheduling find the optimal policy for a given specific system so that the policy from these approaches is usable only for the corresponding system characteristics. Hence, it is hard to use such approaches for a practical system in which system characteristics dynamically change. This paper proposes a novel policy structure for MDP-based dynamic scheduling, a descriptive policy, which has a system-agnostic capability to adapt to unseen system characteristics for an identical task (dynamic scheduling). To this end, the descriptive policy learns a system-agnostic scheduling principle–in a nutshell, “which condition of items should have a higher priority in scheduling”. The scheduling principle can be applied to any system so that the descriptive policy learned in one system can be used for another system. Experiments with simple explanatory and realistic application scenarios demonstrate that it enables system-agnostic meta-learning with very little performance degradation.
APA
Lee, H.. (2022). System-Agnostic Meta-Learning for MDP-based Dynamic Scheduling via Descriptive Policy . Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 151:169-187 Available from https://proceedings.mlr.press/v151/lee22a.html.

Related Material