Policy Gradient is a technique used in reinforcement learning where the goal is to optimize a policy that dictates the action to be taken by an agent. It involves adjusting the parameters of the policy in a way that maximizes the expected reward over time.