Policy Iteration
25 December

A Very Short Introduction of Policy Iteration

Policy iteration, first introduced in the 1950s by Richard Bellman and refined by Andrew Barto and Richard Sutton, is a fundamental method in Reinforcement Learning for optimising decision-making strategies. By iteratively evaluating and improving policies, it ensures efficient and adaptive solutions for complex sequential decision problems.

MRF
25 December

A Very Short Introduction of Markov Random Fields (MRF)

Markov Random Fields (MRFs), introduced through Andrey Markov’s early 20th-century work and formalised by Julian Besag in the 1970s, are probabilistic graphical models for representing contextual dependencies. Widely used in applications like image processing, natural language processing, and environmental modeling, MRFs capture relationships within structured data.

25 December

A Very Short Introduction of Deep Belief Networks (DBNs)

Deep Belief Networks (DBNs), introduced in 2006 by Geoffrey Hinton and colleagues, revolutionised unsupervised learning by enabling hierarchical feature extraction and robust data representation. Widely used in industries like healthcare, finance, and transport, DBNs enhance tasks such as image recognition, NLP, and time-series prediction.