Old Web
English
Sign In
Acemap
>
Paper
>
Policy Optimization Based on Bayesian Decision Theory in Learning Period on Markov Decision Process
Policy Optimization Based on Bayesian Decision Theory in Learning Period on Markov Decision Process
2020
Naoki Ichijo
Yuta Nakahara
Yuto Motomura
Toshiyasu Matsushima
Keywords:
Dynamic programming
bayes decision theory
period
Computer science
Reinforcement learning
Artificial intelligence
Bayes estimator
Markov decision process
Correction
Source
Cite
Save
Machine Reading By IdeaReader
0
References
0
Citations
NaN
KQI
[]