Personal profile
Short presentation
Collaborations and top research areas from the last five years
-
Offline RL in Regular Decision Processes: Sample Efficiency via Language Metrics
Deb, A., Cipollone, R., Jonsson, A., Ronca, A. & Talebi, M. S., 2025. 25 p.Research output: Contribution to conference › Paper › Research
Open AccessFile40 Downloads (Pure) -
Differentially Private No-regret Exploration in Adversarial Markov Decision Processes
Bai, S., Zeng, L., Zhao, C., Duan, X., Talebi, M. S., Cheng, P. & Chen, J., 2024, Proceedings of the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024). PMLR, Vol. 244. p. 235-272 38 p. (Proceedings of Machine Learning Research, Vol. 244).Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Open AccessFile24 Downloads (Pure) -
Logarithmic regret in communicating MDPs: Leveraging known dynamics with bandits
Saber, H., Pesquerel, F., Maillard, O.-A. & Talebi, M. S., 2024, Proceedings of the 15th Asian Conference on Machine Learning. PMLR, p. 1167-1182 (Proceedings of Machine Learning Research, Vol. 222).Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Open AccessFile31 Downloads (Pure) -
Scaling Power Management in Cloud Data Centers: A Multi-Level Continuous-Time MDP Approach
Chitsaz, B., Khonsari, A., Moradian, M., Dadlani, A. & Talebi, M. S., 2024, In: IEEE Transactions on Services Computing. 17, 4, p. 1753-1765 12 p.Research output: Contribution to journal › Journal article › Research › peer-review
Open AccessFile1 Citation (Scopus)26 Downloads (Pure) -
Double Graph Attention Networks for Visual Semantic Navigation
Lyu, Y. & Talebi, M. S., 2023, In: Neural Processing Letters. 55, 7, p. 9019-9040Research output: Contribution to journal › Journal article › Research › peer-review
Open AccessFile5 Citations (Scopus)18 Downloads (Pure) -
Exploration in Reward Machines with Low Regret
Bourel, H., Jonsson, A., Maillard, O. A. & Talebi, M. S., 2023, Proceedings of The 26th International Conference on Artificial Intelligence and Statistics. PMLR, Vol. 206. p. 4114-4146 33 p. (Proceedings of Machine Learning Research, Vol. 206).Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Open AccessFile7 Citations (Scopus)75 Downloads (Pure) -
Provably Efficient Offline Reinforcement Learning in Regular Decision Processes
Cipollone, R., Jonsson, A., Ronca, A. & Talebi, M. S., 2023, Advances in Neural Information Processing Systems 36 (NeurIPS 2023). NeurIPS Proceedings, 34 p. (Advances in Neural Information Processing Systems, Vol. 36).Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Open AccessFile45 Downloads (Pure) -
Scaling Up Q-Learning via Exploiting State–Action Equivalence
Lyu, Y., Côme, A., Zhang, Y. & Talebi, M. S., 2023, In: Entropy. 25, 4, 584.Research output: Contribution to journal › Journal article › Research › peer-review
Open AccessFile4 Citations (Scopus)55 Downloads (Pure)