검색 상세

On Supervised Online Rolling-Horizon Control for Infinite-Horizon Discounted Markov Decision Processes