数学理论与应用 ›› 2022, Vol. 42 ›› Issue (3): 71-.doi: 10.3969/j.issn.1006-8074.2022.03.006

• • 上一篇    下一篇

一类随机最优控制问题的拟蒙特卡洛方法研究

周洪敏, 罗贤兵*, 叶昌伦   

  1. 贵州大学数学与统计学院, 贵阳 550025
  • 出版日期:2022-09-30 发布日期:2022-09-28
  • 通讯作者: 罗贤兵(1978−), 博士, 教授, 从事微分方程数值解、不确定量化及最优控制问题的数值研究; E−mail:luoxb121@163.com
  • 基金资助:
    国家自然科学基金项目(11961008)资助

Quasi-Monte Carlo Method for a Class of Stochastic Optimal Control Problems

Zhou Hongmin,Luo Xianbing*, Ye Changlun   

  1. School of Mathematics and Statistics, Gui Zhou University, Guiyang 550025, China
  • Online:2022-09-30 Published:2022-09-28

摘要: 本文使用梯度投影优化方法求解一类随机最优控制问题. 蒙特卡洛方法是处理随机最优控制问题的一种常用方法, 但收敛速度慢. 我们选取收敛速度较快的拟蒙特卡洛方法. 为使随机抽样维数和时间离散点独立, 我们对 Brown 运动进行 Karhunen-Lo${\grave{\rm e}}$ve 截断, 用拟蒙特卡洛方法中 Sobol 点序列抽样, 得出数值近似的误差, 并通过数值实验验证方法的有效性.

关键词: 随机微分方程, 最优控制, $Karhunen-Lo\grave{e}ve$展开, 拟蒙特卡洛方法

Abstract: In this paper, a gradient projection optimization method is applied to solve a class of stochastic optimal control problems. The Monte Carlo method is a common method to deal with stochastic optimal control problems, but it has a notoriously slow convergence rate. We choose the Quasi-Monte Carlo method with faster convergence.  In order to make the random sampling dimensions and time discrete points independent, we use the Karhunen-Lo${\grave{\rm e}}$ve truncation for the Brown motion. Sobol sequences of  the Quasi-Monte Carlo method are used for sampling. The error of numerical approximation is presented, and the effectiveness of the method is verified by numerical experiments.

Key words: Stochastic differential equation, Optimal control , $Karhunen-Lo\grave{e}ve$ expansion, Quasi-Monte Carlo method