reinforcement learning limits