1.Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning
Ma, Hao, Hu, Tianyi,
More...
Ma, Hao, Hu, Tianyi, Pu, Zhiqiang, Liu, Boyin, Ai, Xiaolin, Liang, Yanyan, Chen, Min
Less
Advances in Neural Information Processing Systems[1049-5258],
Published 2024,
Volume 37,
收錄情况:
SCOPUS