This commit is contained in:
Michael Pilosov 2022-03-20 20:05:49 -06:00
parent 177c1ff006
commit f6c2a1e630

View File

@ -22,10 +22,7 @@ python main.py
The inputs are the parameters to a `1x4` matrix which is multiplied against the observations of the state in order to make a decision for the next action (push left or right). The output of the vector inner-product is binarized by comparing it to zero as a threshold value.
The parameter space is standard normal.
There is no assumed error in observations; the "data variance" is designed to reflect the acceptable ranges for the parameters:
From [gym](https://www.gymlibrary.ml/pages/environments/classic_control/cart_pole):
There is no assumed error in observations; the "data variance" is designed to reflect the acceptable [ranges for the observations](https://www.gymlibrary.ml/pages/environments/classic_control/cart_pole):
- The cart x-position (index 0) can be take values between (-4.8, 4.8), but the episode terminates if the cart leaves the (-2.4, 2.4) range.
- The pole angle can be observed between (-.418, .418) radians (or ±24°), but the episode terminates if the pole angle is not in the range (-.2095, .2095) (or ±12°)