style
This commit is contained in:
parent
177c1ff006
commit
f6c2a1e630
@ -22,10 +22,7 @@ python main.py
|
||||
The inputs are the parameters to a `1x4` matrix which is multiplied against the observations of the state in order to make a decision for the next action (push left or right). The output of the vector inner-product is binarized by comparing it to zero as a threshold value.
|
||||
|
||||
The parameter space is standard normal.
|
||||
There is no assumed error in observations; the "data variance" is designed to reflect the acceptable ranges for the parameters:
|
||||
|
||||
From [gym](https://www.gymlibrary.ml/pages/environments/classic_control/cart_pole):
|
||||
|
||||
There is no assumed error in observations; the "data variance" is designed to reflect the acceptable [ranges for the observations](https://www.gymlibrary.ml/pages/environments/classic_control/cart_pole):
|
||||
- The cart x-position (index 0) can be take values between (-4.8, 4.8), but the episode terminates if the cart leaves the (-2.4, 2.4) range.
|
||||
- The pole angle can be observed between (-.418, .418) radians (or ±24°), but the episode terminates if the pole angle is not in the range (-.2095, .2095) (or ±12°)
|
||||
|
||||
|
Loading…
Reference in New Issue
Block a user