Release version 2.1.3b2
- Extract some repeatedly used functions into BaseEnv and make sure that step functions in the base environments call reset after an episode is terminated.
- Fix agents to be conformant to stable_baselines 1.6 specification (no policy_base keyword argument in BaseAlgorithm).
- Fix configuration of normalization wrapper during environment vectorization (remove norm_wrapper_clip_obs option).