Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning
Honghao Wei, Xiyue Peng, Arnob Ghosh, Xin Liu
arXiv:2401.00629v2 »Full PDF »
Honghao Wei, Xiyue Peng, Arnob Ghosh, Xin Liu
arXiv:2401.00629v2 »Full PDF »Xiyue Peng, Hengquan Guo, Jiawei Zhang, Dongqing Zou, Ziyu Shao, Honghao Wei, Xin Liu
arXiv:2410.19933v1 »Full PDF »Jonathan Booher, Khashayar Rohanimanesh, Junhong Xu, Vladislav Isenbaev, Ashwin Balakrishna, Ishan Gupta, Wei Liu, Aleksandr Petiushko
arXiv:2406.08878v4 »Full PDF »Chia Xin Liang, Pu Tian, Caitlyn Heqi Yin, Yao Yua, Wei An-Hou, Li Ming, Tianyang Wang, Ziqian Bi, Ming Liu
arXiv:2411.06284v1 »Full PDF »Siming Huang, Tianhao Cheng, J. K. Liu, Jiaran Hao, Liuyihan Song, Yang Xu, J. Yang, J. H. Liu, Chenchen Zhang, Linzheng Chai, Ruifeng Yuan, Zhaoxiang Zhang, Jie Fu, Qian Liu, Ge Zhang, Zili Wang, Yuan Qi, Yinghui Xu, Wei Chu
arXiv:2411.04905v2 »Full PDF »Jakub Łucki, Boyi Wei, Yangsibo Huang, Peter Henderson, Florian Tramèr, Javier Rando
arXiv:2409.18025v3 »Full PDF »Spotlight paper at Neurips 2024 SoLaR workshop
Qiang Hu, Jin Wen, Maxime Cordy, Yuheng Huang, Wei Ma, Xiaofei Xie, Lei Ma
arXiv:2404.14419v2 »Full PDF »Zhipeng Wei, Yuqi Liu, N. Benjamin Erichson
arXiv:2411.01077v1 »Full PDF »Shiji Zhao, Ranjie Duan, Xizhe Wang, Xingxing Wei
arXiv:2312.05508v3 »Full PDF »Accepted by NeurIPS2024
Yutao Mou, Shikun Zhang, Wei Ye
arXiv:2410.21965v1 »Full PDF »Accepted by NeurIPS2024 (Dataset and Benchmark Track)