Reinforcement Learning blog

Satinder Singh started a new blog named Reinforcement Learning blog. Now, I can see that Michael Littman is one of the authors of the blog – though he hasn’t published any post yet. This is a good news for RL. Good luck to them!

This entry was posted in miscellaneous. Bookmark the permalink.

2 Responses to Reinforcement Learning blog

  1. Hi,

    I wanted to leave a comment to respond to your comment about the “rationality” of minimax-Q, but comments are closed at http://thesilog.sologen.net/?p=76, so I decided to leave my comment here. :-)

    The “non-rationality” of minimax-Q follows from Bowling and Veloso’s (idiosyncratic?) definition of rationality. Specifically, they define it to be converging to best response against a stationary strategy (even a suboptimal one). Minimax-Q actually ignores the opponent’s strategy and assumes a worst-case opponent, so, indeed minimax-Q fails to satisfy their definition.

    Your alternative definition is interesting. It says that a “rational” learning algorithm should adopt a best response to any Nash-equilibrium opponent. In zero-sum games (which is where minimax-Q makes the most sense), this definition is equivalent to saying that the learner should adopt a minimax policy. Of course, that’s exactly what minimax-Q does, so it passes your rationality test in this case.

    -Michael

  2. Amir massoud Farahmand says:

    Thanks Michael for your clarification!
    I have ignored this blog for a long time.

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>