publications

publications by categories in reversed chronological order.

2024

  1. Logarithmic regret in communicating MDPs: Leveraging known dynamics with bandits
    Hassan SaberFabien PesquerelOdalric-Ambrym Maillard, and 1 more author
    In Proceedings of the 15th Asian Conference on Machine Learning, 11–14 nov 2024

2023

  1. Fast Asymptotically Optimal Algorithms for Non-Parametric Stochastic Bandits
    Dorian BaudryFabien PesquerelRémy Degenne, and 1 more author
    In Advances in Neural Information Processing Systems, 11–14 nov 2023
  2. PhD thesis
    Information per unit of interaction in stochastic sequential decision making
    Fabien Pesquerel
    Dec 2023

2022

  1. IMED-RL: Regret optimal learning of ergodic Markov decision processes
    Fabien Pesquerel, and Odalric-Ambrym Maillard
    In Advances in Neural Information Processing Systems, Dec 2022

2021

  1. Stochastic bandits with groups of similar arms.
    Fabien PesquerelHassan Saber, and Odalric-Ambrym Maillard
    In Advances in Neural Information Processing Systems, Dec 2021