Farzad

Policy Gradients Visual Tutorial

A visualization as well as a tutorial on reinforcement learning algorithms, starting with the good old gradient descent and ending with the policy gradients algorithms.
The algorithms run in the browser using Tensorflow.js and are visualized using pure D3.js.

Tags: RL, Policy Gradients, Visualization, Tensorflow.js

RL as Inference

A case study on the applicability of solving reinforcement learning (RL) tasks when posed as an inference problem.

Tags: RL, Probabilistic Programming, Inference

Review: Semi-Supervised Learning in GANs

A review of semi-supervised learning methods using GANs. A prevalent theme in these methods was simply combining a supervised classification loss with the unsupervised GAN loss in order to make use of the huge set of unlabeled data.

Tags: GAN, Semi-Supervised Learning, Machine Learning

FOPPL/HOPPL Compiler

Metropolis within Gibbs and HMC inference engines for a first-order probabilistic programming language (FOPPL) compiler as well as a likelihood weighting interpreter for a higher-order probabilistic programming language (HOPPL).

Tags: Probabilistic Programming, Automatic Differentiation, MCMC, Compiler

Snake Locomotion

Simulating snake locomotion using PyBullet.

Tags: Simulation, PyBullet, RL

Pendulum Simulation

A simple simulation of a multi-link pendulum.

Tags: Simulation, Computer Graphics

(mini) Java Compiler

Compiler for a restricted version of the Java language, written in C++.

Tags: Compilers, C++, Systems