<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Q-Learning on Mateusz Pieniak</title><link>https://mateuszpieniak.com/tags/q-learning/</link><description/><generator>Hugo -- 0.163.3</generator><language>en-us</language><lastBuildDate>Tue, 23 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://mateuszpieniak.com/tags/q-learning/index.xml" rel="self" type="application/rss+xml"/><item><title>Reinforcement Learning 103: Approximate Methods</title><link>https://mateuszpieniak.com/courses/reinforcement-learning/103-approximate-methods/</link><pubDate>Tue, 23 Jun 2026 00:00:00 +0000</pubDate><guid>https://mateuszpieniak.com/courses/reinforcement-learning/103-approximate-methods/</guid><description>Approximate model-free RL: function approximation, regression targets, loss functions, semi-gradient TD, approximate SARSA, Expected SARSA, Q-learning, and the deadly triad.</description></item><item><title>Reinforcement Learning 102: Q-learning &amp; SARSA</title><link>https://mateuszpieniak.com/courses/reinforcement-learning/102-q-learning-sarsa/</link><pubDate>Sun, 21 Jun 2026 00:00:00 +0000</pubDate><guid>https://mateuszpieniak.com/courses/reinforcement-learning/102-q-learning-sarsa/</guid><description>Model-free RL: Monte Carlo and Temporal Difference control, Q-learning, SARSA, Expected SARSA, and exploration.</description></item></channel></rss>