A Libertine of Computer Science

Tag: Probability

Reinforcement Learning--A3C

Reinforcement Learning--REINFORCE

Reinforcement Learning--Deep Q Network[DQN]

Reinforcement Learning--Temporal-Difference

Reinforcement Learning--Monte-Carlo

Reinforcement Learning--Markov Decision Process

Reinforcement Learning--Taxonomy

Reinforcement Learning--Element

Multi-Armed Bandit Problem [多臂赌博机问题]

Prior Probability and Posterior Probability [先验概率和后验概率]

Maximum-Likelihood Estimation [最大似然估计]

Likelihood Function [似然函数]