Learning feature relevance through step size adaptation in temporal-difference learning

Type
Publication
arXiv preprint arXiv:1903.03252