The talk will be based on the paper Policy Optimization for Robust Average Cost MDPs, 38rth Conference on Neural Information Processing Systems (NeurIPS 2024).
Identyfikator spotkania: 251 152 4038 Kod: 276466