Observation passing with interaction environments is not correct
Currently only observations from the interaction env are used an observations from the actual environment are thrown out.
Currently only observations from the interaction env are used an observations from the actual environment are thrown out.