强化学习¶
本页按照主题记录并梳理强化学习的有关内容。
Reference¶
-
K. Murphy, “Reinforcement learning: An overview,” 12 2024.
Abstract: This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics (including a very brief discussion of RL+LLMs).
@article{murphy2024reinforcement, author = {Kevin Murphy}, eprint = {2412.05265}, month = {12}, title = {Reinforcement Learning: An Overview}, url = {https://arxiv.org/pdf/2412.05265.pdf}, year = {2024} }