
A Very Short Introduction of Q-Learning
Q-Learning, introduced in 1989 by Chris Watkins, is a model-free reinforcement learning algorithm that discovers optimal decision-making strategies by evaluating actions in a given state. It is widely applied for scalable problem-solving, from fraud detection to energy grid optimisation and public transport scheduling.