Lecture, four hours; discussion, one hour; outside study, seven hours. Requisite: course 131A. Key concepts, principles, and algorithms of online learning and learning how to make decisions under uncertainty in broad context, including Markov decision processes, optimal stopping, reinforcement learning, structural results for online learning, multiarmed bandits learning, multiagent learning, multiagent deep learning. Letter grading.
Click on any course to view its details