-
Notifications
You must be signed in to change notification settings - Fork 191
Open
Description
Hi,
Thanks a lot for this extremely useful implementation.
I wanted just to ask what is the ZFilter class, is it used to standardize the observed state according to the running mean and std of the observed states?
In addition, in the GAIL paper, they consider in the TRPO update a step in the direction of the gradient of the entropy. Is it considered here? I am not managing to find it in the code.
Thank you in advance.
Luca
Metadata
Metadata
Assignees
Labels
No labels