Hands-on Tutorials

A Practical Guide to Deep Q-Networks


An Introduction to the Advantage Actor Critic Algorithm


Figure 1: The Policy Gradients Algorithm

1. What is Reinforcement Learning?


1. Exponential Smoothing (or Exponentially Weighted Averages)


  • Which activation function should you choose: Sigmoid or Softmax?

1. Which activation function should you choose: Sigmoid or Softmax?



Mike Wang

Hi there, I write and teach about cool topics in Data Science

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store