Sadegh Talebi

Sadegh Talebi

  • Universitetsparken 1

    2100 København Ø

Filter
Article in proceedings

Search results

  • 2024

    Differentially Private No-regret Exploration in Adversarial Markov Decision Processes

    Bai, S., Zeng, L., Zhao, C., Duan, X., Talebi, M. S., Cheng, P. & Chen, J., 2024, Proceedings of the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024). PMLR, Vol. 244. p. 235-272 38 p. (Proceedings of Machine Learning Research, Vol. 244).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Open Access
    File
  • Logarithmic regret in communicating MDPs: Leveraging known dynamics with bandits

    Saber, H., Pesquerel, F., Maillard, O.-A. & Talebi, M. S., 2024, Proceedings of the 15th Asian Conference on Machine Learning. PMLR, p. 1167-1182 (Proceedings of Machine Learning Research, Vol. 222).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Open Access
    File
    10 Downloads (Pure)
  • 2023

    Exploration in Reward Machines with Low Regret

    Bourel, H., Jonsson, A., Maillard, O. A. & Talebi, M. S., 2023, Proceedings of The 26th International Conference on Artificial Intelligence and Statistics. PMLR, Vol. 206. p. 4114-4146 33 p. (Proceedings of Machine Learning Research, Vol. 206).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Open Access
    File
    3 Citations (Scopus)
    16 Downloads (Pure)
  • Provably Efficient Offline Reinforcement Learning in Regular Decision Processes

    Cipollone, R., Jonsson, A., Ronca, A. & Talebi, M. S., 2023, Advances in Neural Information Processing Systems 36 (NeurIPS 2023). NeurIPS Proceedings, 34 p. (Advances in Neural Information Processing Systems, Vol. 36).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Open Access
    File
    12 Downloads (Pure)
  • 2021

    Improved Exploration in Factored Average-Reward MDPs

    Talebi, S., Jonsson, A. & Maillard, O.-A., 2021, Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS). PMLR, p. 3988-3996 (Proceedings of Machine Learning Research, Vol. 130).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Open Access
    File
    4 Citations (Scopus)
    23 Downloads (Pure)
  • 2020

    Adversarial Bandits with Corruptions

    Yang, L., Hajiesmaili, M. H., Talebi, S., Lui, J. C. S. & Wong, W. S., 2020, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtua. NeurIPS Proceedings, 10 p. (Advances in Neural Information Processing Systems, Vol. 33).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Open Access
    File
    61 Downloads (Pure)
  • Bandit-based relay selection in cooperative networks over unknown stationary channels

    Nomikos, N., Talebi, S., Wichman, R. & Charalambous, T., 2020, Proceedings of the 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing, MLSP 2020. IEEE, 9231604

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Open Access
    File
    6 Citations (Scopus)
    14 Downloads (Pure)
  • Tightening Exploration in Upper Confidence Reinforcement Learning

    Bourel, H., Maillard, O. & Talebi, S., 2020, Proceedings of the 37th International Conference on Machine Learning. PMLR, p. 1056-1066 (Proceedings of Machine Learning Research, Vol. 119).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Open Access
    File
    51 Downloads (Pure)