From a probability density function to random samples Photo by Moritz Kindler on UnsplashT here are different methods for updating a reinforcement learning agent’s policy at each iteration. A few weeks ago we started experimenting with replacing our current method with a Bayesian inference step. Some of the data workloads within our agent are written…
