Actor-critic reinforcement learning with simultaneous human control and feedback

Type
Publication
arXiv preprint arXiv:1703.01274