Recent Changes

Monday, April 25

  1. page Lecture Notes and references edited ... Application of Optimistic tree-search algorithm UCT to Computer Go Lecture 21-22 (Wed April 6…
    ...
    Application of Optimistic tree-search algorithm UCT to Computer Go
    Lecture 21-22 (Wed April 6, Mon April 11): UCRL2 algorithm.
    ...
    {UCRL2slidesComplete.pdf} (Complete slide deck for
    ...
    part I (April{Lecture 21.pdf} (April 6) Algorithm
    ...
    part II (April{Lecture 22.pdf} (April 11) Regret
    Main Reference
    Near-optimal Regret Bounds for Reinforcement Learning [ pdf {jaksch10aJournal.pdf} ]
    (view changes)
    1:25 pm
  2. file Lecture 21.pdf uploaded
    1:23 pm
  3. file Lecture 22.pdf uploaded
    1:23 pm
  4. page Lecture Notes and references edited ... Remi Munos. From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Opti…
    ...
    Remi Munos. From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning. 2014. <hal-00747575v5> [ pdf {MCTSMunos.pdf} ]
    Other related references
    ...
    by Levente Kocsis.
    Csaba
    Kocsis, Csaba Szepesvári, ECML
    UCT algorithm: first optimistic (UCB like) approach to Monte Carlo Tree Search
    Modification of UCT with Patterns in Monte-Carlo Go.
    ...
    Lecture 21-22 (Wed April 6, Mon April 11): UCRL2 algorithm.
    Lecture Slides {UCRL2slidesComplete.pdf} (Complete deck for two lectures: April 6 and 11, includes algorithm description and regret analysis)
    Lecture notes part I (April 6) Algorithm description
    Lecture notes part II (April 11) Regret analysis

    Main Reference
    Near-optimal Regret Bounds for Reinforcement Learning [ pdf {jaksch10aJournal.pdf} ]
    (view changes)
    1:22 pm

Tuesday, April 19

  1. page Lecture Notes and references edited Lecture 23 23-24 (Wed April 13): 13, Mon April 18): X-armed bandits, ... Lecture Slides {M…
    Lecture 2323-24 (Wed April 13):13, Mon April 18): X-armed bandits,
    ...

    Lecture Slides {MTCSSlidesPartI.pdf}{XarmedMTCS slides complete.pdf}
    Figures in these slides were copied from Munos, 2014 (reference below).
    Main Reference
    (view changes)
    10:00 am
  2. page Lecture Notes and references edited ... Lecture 4 (Monday Feb 1, 2016): Thompson Sampling Lecture Notes {Lecture 4.pdf} References…
    ...
    Lecture 4 (Monday Feb 1, 2016): Thompson Sampling
    Lecture Notes {Lecture 4.pdf}
    References
    S. Agrawal, N. Goyal, ”Further optimal regret bounds for Thompson Sampling”, In Proceedings of the 16th International Conference on Artificial Intelligence and Statistics (AISTATS), 2013

    Lecture 3 (Wednesday Jan 27, 2016) : UCB algorithm, lower bounds
    Lecture notes {Lecture 3 part 1.pdf} (Last edited on 1/31/2016)
    (view changes)
    2:48 am

Monday, April 18

  1. page home edited Announcements 4/1/2016 4/18/2016 Here {ProjectPresentationsSlots.pdf} is the final schedule…

    Announcements
    4/1/2016 4/18/2016
    Here {ProjectPresentationsSlots.pdf} is the final schedule for project presentations. Please check.
    4/1/2016

    The deadline for submitting project report is updated, it is May 2, in class.
    3/24/2016 Sign up for a project presentation slot here
    (view changes)
    8:05 am

Friday, April 15

  1. file MCTS-Zooming.pdf (deleted) uploaded Deleted File
    1:10 pm

More