Active Learning and Reinforcement Learning

The problem of active learning can be considered as a special case of reinforcement learning (Sanjoy Dasgupta noted it). We can consider it as learning a policy (which selects new data point) that maximizes the increase in some classification performance, e.g. empirical risk, our estimate about structural risk, or anything similar.

