A Libertine of Computer Science
Categories
Book (8)
Development (66)
Research (68)
Writing (5)
Tags
Adjustment (12)
Algorithm (13)
Big Data (16)
Blockchain (1)
C&CPP (14)
Compile (6)
Concurrency (5)
CPU (5)
CUDA (5)
Database (25)
Distribute Computing (19)
Docker (5)
DuckDB (4)
FileSystem (12)
Git (1)
GPU (5)
Hash (1)
Idiom (1)
Java (10)
Latency (4)
Linux (1)
LLM (1)
Makefile (2)
Machine Learning (16)
NoSQL (1)
Note (12)
OS (8)
Paper (3)
Parallelism (7)
Pointer (3)
Probability (12)
Python (12)
RDMA (2)
Recommendation (1)
Reinforcement Learning (10)
Shell (3)
TensorFlow (5)
Virtualization (4)
Tag: Probability
Reinforcement Learning--A3C
Reinforcement Learning--REINFORCE
Reinforcement Learning--Deep Q Network[DQN]
Reinforcement Learning--Temporal-Difference
Reinforcement Learning--Monte-Carlo
Reinforcement Learning--Markov Decision Process
Reinforcement Learning--Taxonomy
Reinforcement Learning--Element
Multi-Armed Bandit Problem [多臂赌博机问题]
Prior Probability and Posterior Probability [先验概率和后验概率]
Maximum-Likelihood Estimation [最大似然估计]
Likelihood Function [似然函数]