publications

publications in reversed chronological order.

2023

  1. Fast Asymptotically Optimal Algorithms for Non Parametric Stochastic Bandits
    Rémy Degenne Dorian Baudry, and  Odalric-Ambrym Maillard
    In Thirty-Seven Conference on Neural Information Processing Systems 2023
  2. Logarithmic Regret in Communicating MDPs: Leveraging Known Dynamics with Bandits
    Odalric-Ambrym Maillard Hassan Saber, and Mohammad Sadegh Talebi
    In Asian Conference on Machine Learning 2023

2022

  1. IMED-RL: Regret optimal learning of ergodic Markov decision processes
    Fabien Pesquerel, and Odalric-Ambrym Maillard
    In Thirty-Sixth Conference on Neural Information Processing Systems 2022

2021

  1. Stochastic bandits with groups of similar arms.
    Fabien PesquerelHassan Saber, and Odalric-Ambrym Maillard
    In Advances in Neural Information Processing Systems 2021