-
Policy Gradients Visual Tutorial
A visualization as well as a tutorial on reinforcement learning algorithms, starting with the good old gradient descent and ending with the policy gradients algorithms.
The algorithms run in the browser using Tensorflow.js and are visualized using pure D3.js. -
RL as Inference
A case study on the applicability of solving reinforcement learning (RL) tasks when posed as an inference problem.
-
Review: Semi-Supervised Learning in GANs
A review of semi-supervised learning methods using GANs. A prevalent theme in these methods was simply combining a supervised classification loss with the unsupervised GAN loss in order to make use of the huge set of unlabeled data.
-
FOPPL/HOPPL Compiler
Metropolis within Gibbs and HMC inference engines for a first-order probabilistic programming language (FOPPL) compiler as well as a likelihood weighting interpreter for a higher-order probabilistic programming language (HOPPL).
-
-
Pendulum Simulation
A simple simulation of a multi-link pendulum.
Tags: Simulation, Computer Graphics -