2026
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-Tuning of LLM Agents
Yueqi Song, Ketan Ramaneti, Zaid Sheikh, Ziru Chen, Boyu Gou, Tianbao Xie, Yiheng Xu, Danyang Zhang, Apurva Gandhi, Fan Yang, Joseph Liu, Tianyue Ou, Zhihao Yuan, Frank Xu, Shuyan Zhou, Xiang Yue, Tao Yu, Huan Sun, Yu Su, Graham Neubig
Cite Score
0
AI summary
This paper introduces the Agent Data Protocol (ADP), a light-weight representation language, to unify 13 existing agent training datasets, achieving an average performance gain of ~20% over base models and state-of-the-art or near-SOTA results on coding, browsing, tool use, and research benchmarks without domain-specific tuning.
Main Contributions
Abstract
Public research results on large-scale supervised finetuning of AI agents remain relatively rare, since the collection of agent training data presents unique challenges. In this work, we argue that the bottleneck is not a lack of underlying data sources, but that a large variety of data is fragmented across heterogeneous formats, tools, and interfaces. To this end, we introduce the Agent Data Protocol (ADP), a light-weight representation language that serves as an "interlingua" between agent datasets in diverse formats and unified agent training pipelines downstream. The design of ADP is expressive enough to capture a large variety of tasks, including API/tool use, browsing, coding, software engineering, and general agentic workflows, while remaining simple to parse and train on without engineering at a per-dataset level. In experiments, we unified a broad collection of 13 existing agent training datasets into ADP format, and converted the standardized ADP data into training-ready formats for multiple agent frameworks. We performed supervised finetuning on the unified data, and demonstrated an average performance gain of ~20% over corresponding base models, and delivers state-of-the-art or near-SOTA performance on standard coding, browsing, tool use, and research benchmarks, without domain-specific tuning. All code and data are released publicly, in the hope that ADP could help lower the barrier to standardized, scalable, and reproducible agent training.
Citation Graph
References [58]
P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016
37 papers in library cite
A. L. Maas, R. E. Daly, P. T. Pham, Dong Huang, Andrew Y. Ng, Christopher Potts - 2011
12 papers in library cite
Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeffrey Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman - 2021
7 papers in library cite
S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. R. Narasimhan, Yue Cao - 2022
3 papers in library cite
Q. Team - 2024
1 paper in library cites
A. Yang, A. Li, B. Yang, B. Zhang, B. Hui, Bo Zheng, B. Yu, C. Gao, C. Huang, C. Lv, C. Zheng, D. Liu, F. Zhou, F. Huang, F. Hu, H. Ge, H. Wei, Haowei Lin, Jie Tang, Jihan Yang, J. Tu, J. Zhang, Jihan Yang, Jihan Yang, Jingren Zhou, Jingren Zhou, Junyang Lin, K. Dang, K. Bao, K. Yang, Longhui Yu, L. Deng, M. Li, M. Xue, M. Li, Peizhao Zhang, Peng Wang, Qihao Zhu, R. Men, R. Gao, Shuming Liu, S. Luo, Tao Li, T. Tang, W. Yin, Xiang Ren, Xinpeng Wang, X. Zhang, Xiang Ren, Yu Fan, Yu Su, Y. Z. Zhang, Y. Z. Zhang, Y. Wan, Yibo Liu, Zhengtao Wang, Z. Cui, Zhengyou Zhang, Zijian Zhou, Z. Qiu - 2025
5 papers in library cite
C. E. Jimenez, Jihan Yang, A. Wettig, S. Yao, K. Pei, O. Press, K. R. Narasimhan - 2024
2 papers in library cite
E. Nijkamp, Bo Pang, H. Hayashi, L. Tu, Haiming Wang, Y. Zhou, Silvio Savarese, Caiming Xiong - 2023
1 paper in library cites
Y. Zheng, Robert Zhang, J. Zhang, Yanfang Ye, Z. Luo, Z. Feng, Yi Ma - 2024
1 paper in library cites
B. Hui, Jihan Yang, Z. Cui, Jihan Yang, D. Liu, Li Zhang, T. Liu, J. Zhang, B. Yu, K. Lu - 2024
1 paper in library cites
Jihan Yang, C. E. Jimenez, A. Wettig, K. Lieret, S. Yao, K. Narasimhan, O. Press - 2024
1 paper in library cites
Z. Luo, Chenfeng Xu, P. Zhao, Q. Sun, X. Geng, W. Hu, C. Tao, J. Ma, Q. Lin, D. Jiang - 2023
1 paper in library cites
Shuyan Zhou, F. F. Xu, H. Zhu, Xinyu Zhou, R. Lo, A. Sridhar, X. Cheng, Tianyue Ou, Yonatan Bisk, D. Fried, U. Alon, Graham Neubig - 2024
1 paper in library cites
G. Mialon, C. Fourrier, Thomas Wolf, Yann Lecun, T. Scialom - 2023
1 paper in library cites
A. H. Ai - 2024
1 paper in library cites
Xiaodong Liu, H. Yu, Haowei Zhang, Yiheng Xu, X. Lei, H. Lai, Y. Gu, H. Ding, K. Men, K. Yang, S. Zhang, Xiang Deng, A. Zeng, Z. Du, Chiyuan Zhang, S. Shen, Tong Zhang, Yu Su, Huan Sun, M. Huang, Y. Dong, Jie Tang - 2024
1 paper in library cites
A. Zeng, Mickel Liu, R. Lu, B. Wang, Xiaodong Liu, Y. Dong, Jie Tang - 2023
1 paper in library cites
Shuming Liu, H. Cheng, Haozhe Liu, Haowei Zhang, F. Li, T. Ren, X. Zou, Jihan Yang, H. Su, Jiacheng Zhu - 2024
1 paper in library cites
A. Drouin, M. Gasse, M. Caccia, I. H. Laradji, M. D. Verme, T. Marty, D. Vazquez, N. Chapados, A. Lacoste - 2024
1 paper in library cites
J. Luo, Wenxuan Zhang, Y. Yuan, Y. Zhao, Jihan Yang, Y. Gu, Bo Wu, Berlin Chen, Z. Qiao, Q. Long - 2025
1 paper in library cites
J. Pan, Xinpeng Wang, Graham Neubig, Navdeep Jaitly, H. Ji, A. Suhr, Y. Z. Zhang - 2025
1 paper in library cites
Jihan Yang, K. Leret, C. E. Jimenez, A. Wettig, K. Khandpur, Y. Z. Zhang, B. Hui, O. Press, Ludwig Schmidt, Diyi Yang - 2025
1 paper in library cites
Z. Xi, Y. Ding, Weizhu Chen, B. Hong, H. Guo, J. Wang, X. Guo, Diyi Yang, C. Liao, Weiran He, S. Gao, L. C. Chen, R. Zheng, Y. Zou, T. Gui, Q. Zhang, X. Qiu, X. Huang, Ziyi Wu, Y. G. Jiang - 2025
1 paper in library cites
M. Mohammadi, Yiwei Li, J. Lo, W. Yip - 2025
1 paper in library cites
T. L. S. D. Chezelles, M. Gasse, A. Lacoste, M. Caccia, A. Drouin, L. Boisvert, M. Thakkar, T. Marty, R. Assouel, S. O. Shayegan, L. K. Jang, X. H. Lu, O. Yoran, D. Kong, F. F. Xu, Siva Reddy, Graham Neubig, Q. Cappart, Ruslan Salakhutdinov, N. Chapados - 2025
1 paper in library cites
Yiheng Xu, Di Lu, Z. Shen, J. Wang, Zhengtao Wang, Y. Mao, Caiming Xiong, Tao Yu - 2024
1 paper in library cites
A. Team - 2024
1 paper in library cites
A. Mitra, L. D. Corro, G. Zheng, S. Mahajan, D. Rouhana, A. Codas, Y. Lu, W. G. Chen, O. Vrousgos, C. Rosset - 2024
1 paper in library cites
Timo Schick, J. D. Yu, R. Dessi, Roberta Raileanu, M. Lomeli, Eric Hambro, Luke Zettlemoyer, N. Cancedda, T. Scialom - 2023
2 papers in library cite
Gloria Wang, Y. Xie, Y. Jiang, A. Mandlekar, C. Xiao, Yuxuan Zhu, L. Fan, A. Anandkumar - 2024
2 papers in library cite
L. Ning, Z. Liang, Zhejun Jiang, H. Qu, Y. Ding, W. Fan, X. Y. Wei, Stephen Lin, Haozhe Liu, P. S. Yu - 2025
1 paper in library cites
Ziru Chen, K. Liu, Q. Wang, Wenxuan Zhang, Joseph Liu, D. Lin, K. Chen, F. Zhao - 2024
1 paper in library cites
Yueqi Song, W. Xiong, Xuandong Zhao, D. Zhu, Wenhao Wu, K. Wang, Chun-Liang Li, W. Peng, Shanda Li - 2024
1 paper in library cites
J. Zhang, T. Lan, R. Murthy, Ze Liu, W. Yao, M. Zhu, J. Tan, T. Hoang, Ze Liu, L. Yang - 2024
1 paper in library cites
C. Rawles, A. Li, D. Rodriguez, O. Riva, T. Lillicrap - 2023
1 paper in library cites
A. Paullada, I. D. Raji, E. M. Bender, E. Denton, A. Hanna - 2021
1 paper in library cites
D. Zha, Z. P. Bhat, K. H. Lai, Fan Yang, Zhejun Jiang, S. Zhong, X. Hu - 2025
1 paper in library cites
Xinpeng Wang, Yanru Chen, Lifan Yuan, Y. Z. Zhang, Yiwei Li, H. Peng, H. Ji - 2024
1 paper in library cites
Apurva Gandhi, Graham Neubig - 2025
1 paper in library cites
I. M. Putrama, P. Martinek - 2024
1 paper in library cites
N. Chowdhury, J. Aung, C. J. Shern, O. Jaffe, D. Sherburn, G. Starace, E. Mays, R. Dias, M. Aljubeh, M. Glaese - 2024
1 paper in library cites
A. Golubev, S. Polezhaev, K. Zainullina, M. Trofimova, I. Badertdinov, Y. Anapolskiy, D. Litvintseva, S. Karasik, F. Fisin, S. Skvortsov - 2024
1 paper in library cites
E. Bhardwaj, H. Gujral, S. Wu, C. Zogheib, T. Maharaj, C. Becker - 2024
1 paper in library cites
Xiang Deng, Y. Gu, Bo Zheng, S. Chen, S. Stevens, B. Wang, Huan Sun, Yu Su - 2023
1 paper in library cites
D. Mueller, Mark Dredze, N. Andrews - 2024
1 paper in library cites
S. Murty, H. Zhu, D. Bahdanau, Christopher D. Manning - 2024
1 paper in library cites
R. Kapoor, Y. P. Butala, M. Russak, J. Y. Koh, K. Kamble, W. Alshikh, Ruslan Salakhutdinov - 2024
1 paper in library cites
T. Zheng, G. Zhang, T. Shen, Xiaodong Liu, Bill Yuchen Lin, J. Fu, Weizhu Chen, Xiang Yue - 2024
1 paper in library cites
Xinpeng Wang, Boxuan Li, Yueqi Song, F. F. Xu, X. Tang, Mingchen Zhuge, J. Pan, Yueqi Song, Boxuan Li, J. Singh, H. H. Tran, F. Li, R. Ma, M. Zheng, B. Qian, Y. Shao, Niklas Muennighoff, Y. Z. Zhang, B. Hui, Junyang Lin, R. Brennan, H. Peng, H. Ji, Graham Neubig - 2025
1 paper in library cites
M. Hu, Y. Zhou, W. Fan, Y. Nie, B. Xia, T. Sun, Z. Ye, Z. Jin, Yiwei Li, Qinlang Chen, Zhengyou Zhang, Yuzhi Wang, Q. Ye, B. Ghanem, P. Luo, G. Li - 2025
1 paper in library cites
H. Li, L. Ding, M. Fang, D. Tao - 2024
1 paper in library cites
Tianyue Ou, F. F. Xu, Aman Madaan, Joseph Liu, R. Lo, A. Sridhar, S. Sengupta, Dan Roth, Graham Neubig, Shuyan Zhou - 2024
1 paper in library cites
T. Masterman, S. Besen, M. Sawtell, A. Chao - 2024
1 paper in library cites
K. Xu, Y. Kordi, T. Nayak, A. Asija, Yuzhi Wang, K. Sanders, A. Byerly, J. Zhang, B. V. Durme, Daniel Khashabi - 2024
1 paper in library cites
Suhas Kotha, J. M. Springer, A. Raghunathan - 2024
1 paper in library cites
X. H. Lu, Z. Kasner, Siva Reddy - 2024
1 paper in library cites
S. Yao, H. Chen, Jihan Yang, K. Narasimhan - 2022
1 paper in library cites
J. Zhang, T. Lan, M. Zhu, Ze Liu, T. Q. Hoang, S. Kokane, W. Yao, J. Tan, A. Prabhakar, H. Chen, Ze Liu, Y. Feng, T. M. Awalgaonkar, R. N. Rithesh, Ziru Chen, Runxin Xu, J. C. Niebles, S. Heinecke, Haiming Wang, Silvio Savarese, Caiming Xiong - 2025
1 paper in library cites
Cited by
0
papers in your library
Cites
27
papers in your library
Read
on April 16, 2026
Your review
Tags
Paper Aliases
No aliases