Some Notes on Generalization in RL

This is a personal note on some of the ideas in RL the writer learned so far. Many of them could be incorrect and I’d be happy if you could let me know. In this article, we are going to include some of these ideas proposed in such field. Note that most of the content actually comes from this book. Background First of all, let’s define the notations and settings. $$ \begin{align*} {\cal S}\quad&\text{state space}\\ {\cal A}\quad&\text{action space}\\ H\quad&\text{horizon}\\ s\quad&\text{state}\\ a\quad&\text{action}\\ h\quad&\text{step}\\ r_h(s,a)\quad&\text{reward}\\ \Bbb P_h(\cdot\vert s,a)\quad&\text{transition probability}\\ K\quad&\text{number of episodes}\\ k\quad&\text{episode}\\ \end{align*} $$ We say we are in episodic setting and consider the finite horizon MDP ${\cal M}=({\cal S}, {\cal A}, H, \Bbb P, r)$. We are also going to use this setting in most parts of this article. ...

February 10, 2025

Settings in Theoretical Machine Learning

This is a personal learing note on some of the baisc settings I have encountered so far in a learning theory course. If there is any typo or other kind of mistake, I’d be happy if you could let me know. One of the main idea behind learing theory is to figure out why machine learning algorithms work from a statistical or mathematical perspective. This article talks about the basic settings used in learning theory. ...

December 10, 2024

Build Your Blog: One Possible Example

This article talks about the building process of this blog. Lots of people in both academic and engineering field have their own blog. But as known to all, building your personal website is a somewhat tedious work, especially for those who unfamiliar with front-end tools. This article introduces one possible example to build your blog in a both efficient and cost-free way in perspective from a non-frontend worker. Framework: Hugo Since we do not wish to write raw html, the first thing to do is to pick a blog framework. Modern blog frameworks help us build our websites more efficiently. They can convert markdown files into blog posts automatically. Also, there are lots of templates that we can use which reduce the necessity to design a layout by ourselves. ...

November 30, 2024