共 54 条
[1]
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[3]
Abdolmaleki A., 2018, Relative entropy regularized policy iteration
[4]
Abdolmaleki A., MULTIOBJECTIVE POLIC
[5]
Akkaya I., 2019, Solving rubik's cube with a robot hand
[9]
Andrychowicz M., 2021, ICLR 2021 9 INT C LE
[10]
Aslanides John, 2019, DM ENV PYTHON INTERF