We agreed to learn about Policy gradients from either the sutton and barto textbook or the UC Berkeley lectures. Also read SpinningUp's Policy Gradient sections.