2026

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-Tuning of LLM Agents

Yueqi Song, Ketan Ramaneti, Zaid Sheikh, Ziru Chen, Boyu Gou, Tianbao Xie, Yiheng Xu, Danyang Zhang, Apurva Gandhi, Fan Yang, Joseph Liu, Tianyue Ou, Zhihao Yuan, Frank Xu, Shuyan Zhou, Xiang Yue, Tao Yu, Huan Sun, Yu Su, Graham Neubig

citations

Cite Score

0

AI summary

This paper introduces the Agent Data Protocol (ADP), a light-weight representation language, to unify 13 existing agent training datasets, achieving an average performance gain of ~20% over base models and state-of-the-art or near-SOTA results on coding, browsing, tool use, and research benchmarks without domain-specific tuning.

Main Contributions

  • Introduction of the Agent Data Protocol (ADP), a standardized, expressive representation language for agent data.
  • Unification of 13 diverse existing agent training datasets into the ADP format, creating the largest publicly available dataset for agent training (ADP Dataset V1) with 1.3M trajectories.
  • Demonstration of significant performance improvements (average ~20% gain over base models) through supervised fine-tuning on ADP-unified data across various domains.
  • Achievement of state-of-the-art or near-SOTA performance on coding, browsing, tool use, and research benchmarks using ADP, without domain-specific tuning.
  • Release of all code and datasets in open source to promote standardized, scalable, and reproducible agent training.

Abstract

Public research results on large-scale supervised finetuning of AI agents remain relatively rare, since the collection of agent training data presents unique challenges. In this work, we argue that the bottleneck is not a lack of underlying data sources, but that a large variety of data is fragmented across heterogeneous formats, tools, and interfaces. To this end, we introduce the Agent Data Protocol (ADP), a light-weight representation language that serves as an "interlingua" between agent datasets in diverse formats and unified agent training pipelines downstream. The design of ADP is expressive enough to capture a large variety of tasks, including API/tool use, browsing, coding, software engineering, and general agentic workflows, while remaining simple to parse and train on without engineering at a per-dataset level. In experiments, we unified a broad collection of 13 existing agent training datasets into ADP format, and converted the standardized ADP data into training-ready formats for multiple agent frameworks. We performed supervised finetuning on the unified data, and demonstrated an average performance gain of ~20% over corresponding base models, and delivers state-of-the-art or near-SOTA performance on standard coding, browsing, tool use, and research benchmarks, without domain-specific tuning. All code and data are released publicly, in the hope that ADP could help lower the barrier to standardized, scalable, and reproducible agent training.

Citation Graph

Loading graph...

References [58]

Sort:
Filter:

P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016

37 papers in library cite

A. L. Maas, R. E. Daly, P. T. Pham, Dong Huang, Andrew Y. Ng, Christopher Potts - 2011

12 papers in library cite

Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeffrey Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman - 2021

7 papers in library cite

S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. R. Narasimhan, Yue Cao - 2022

3 papers in library cite

Q. Team - 2024

1 paper in library cites

A. Yang, A. Li, B. Yang, B. Zhang, B. Hui, Bo Zheng, B. Yu, C. Gao, C. Huang, C. Lv, C. Zheng, D. Liu, F. Zhou, F. Huang, F. Hu, H. Ge, H. Wei, Haowei Lin, Jie Tang, Jihan Yang, J. Tu, J. Zhang, Jihan Yang, Jihan Yang, Jingren Zhou, Jingren Zhou, Junyang Lin, K. Dang, K. Bao, K. Yang, Longhui Yu, L. Deng, M. Li, M. Xue, M. Li, Peizhao Zhang, Peng Wang, Qihao Zhu, R. Men, R. Gao, Shuming Liu, S. Luo, Tao Li, T. Tang, W. Yin, Xiang Ren, Xinpeng Wang, X. Zhang, Xiang Ren, Yu Fan, Yu Su, Y. Z. Zhang, Y. Z. Zhang, Y. Wan, Yibo Liu, Zhengtao Wang, Z. Cui, Zhengyou Zhang, Zijian Zhou, Z. Qiu - 2025

5 papers in library cite

C. E. Jimenez, Jihan Yang, A. Wettig, S. Yao, K. Pei, O. Press, K. R. Narasimhan - 2024

2 papers in library cite

E. Nijkamp, Bo Pang, H. Hayashi, L. Tu, Haiming Wang, Y. Zhou, Silvio Savarese, Caiming Xiong - 2023

1 paper in library cites

Y. Zheng, Robert Zhang, J. Zhang, Yanfang Ye, Z. Luo, Z. Feng, Yi Ma - 2024

1 paper in library cites

B. Hui, Jihan Yang, Z. Cui, Jihan Yang, D. Liu, Li Zhang, T. Liu, J. Zhang, B. Yu, K. Lu - 2024

1 paper in library cites

Jihan Yang, C. E. Jimenez, A. Wettig, K. Lieret, S. Yao, K. Narasimhan, O. Press - 2024

1 paper in library cites

Z. Luo, Chenfeng Xu, P. Zhao, Q. Sun, X. Geng, W. Hu, C. Tao, J. Ma, Q. Lin, D. Jiang - 2023

1 paper in library cites

Shuyan Zhou, F. F. Xu, H. Zhu, Xinyu Zhou, R. Lo, A. Sridhar, X. Cheng, Tianyue Ou, Yonatan Bisk, D. Fried, U. Alon, Graham Neubig - 2024

1 paper in library cites

G. Mialon, C. Fourrier, Thomas Wolf, Yann Lecun, T. Scialom - 2023

1 paper in library cites

A. H. Ai - 2024

1 paper in library cites

Xiaodong Liu, H. Yu, Haowei Zhang, Yiheng Xu, X. Lei, H. Lai, Y. Gu, H. Ding, K. Men, K. Yang, S. Zhang, Xiang Deng, A. Zeng, Z. Du, Chiyuan Zhang, S. Shen, Tong Zhang, Yu Su, Huan Sun, M. Huang, Y. Dong, Jie Tang - 2024

1 paper in library cites

A. Zeng, Mickel Liu, R. Lu, B. Wang, Xiaodong Liu, Y. Dong, Jie Tang - 2023

1 paper in library cites

Shuming Liu, H. Cheng, Haozhe Liu, Haowei Zhang, F. Li, T. Ren, X. Zou, Jihan Yang, H. Su, Jiacheng Zhu - 2024

1 paper in library cites

A. Drouin, M. Gasse, M. Caccia, I. H. Laradji, M. D. Verme, T. Marty, D. Vazquez, N. Chapados, A. Lacoste - 2024

1 paper in library cites

J. Luo, Wenxuan Zhang, Y. Yuan, Y. Zhao, Jihan Yang, Y. Gu, Bo Wu, Berlin Chen, Z. Qiao, Q. Long - 2025

1 paper in library cites

J. Pan, Xinpeng Wang, Graham Neubig, Navdeep Jaitly, H. Ji, A. Suhr, Y. Z. Zhang - 2025

1 paper in library cites

Jihan Yang, K. Leret, C. E. Jimenez, A. Wettig, K. Khandpur, Y. Z. Zhang, B. Hui, O. Press, Ludwig Schmidt, Diyi Yang - 2025

1 paper in library cites

Z. Xi, Y. Ding, Weizhu Chen, B. Hong, H. Guo, J. Wang, X. Guo, Diyi Yang, C. Liao, Weiran He, S. Gao, L. C. Chen, R. Zheng, Y. Zou, T. Gui, Q. Zhang, X. Qiu, X. Huang, Ziyi Wu, Y. G. Jiang - 2025

1 paper in library cites

M. Mohammadi, Yiwei Li, J. Lo, W. Yip - 2025

1 paper in library cites

T. L. S. D. Chezelles, M. Gasse, A. Lacoste, M. Caccia, A. Drouin, L. Boisvert, M. Thakkar, T. Marty, R. Assouel, S. O. Shayegan, L. K. Jang, X. H. Lu, O. Yoran, D. Kong, F. F. Xu, Siva Reddy, Graham Neubig, Q. Cappart, Ruslan Salakhutdinov, N. Chapados - 2025

1 paper in library cites

Yiheng Xu, Di Lu, Z. Shen, J. Wang, Zhengtao Wang, Y. Mao, Caiming Xiong, Tao Yu - 2024

1 paper in library cites

A. Team - 2024

1 paper in library cites

A. Mitra, L. D. Corro, G. Zheng, S. Mahajan, D. Rouhana, A. Codas, Y. Lu, W. G. Chen, O. Vrousgos, C. Rosset - 2024

1 paper in library cites

Timo Schick, J. D. Yu, R. Dessi, Roberta Raileanu, M. Lomeli, Eric Hambro, Luke Zettlemoyer, N. Cancedda, T. Scialom - 2023

2 papers in library cite

Gloria Wang, Y. Xie, Y. Jiang, A. Mandlekar, C. Xiao, Yuxuan Zhu, L. Fan, A. Anandkumar - 2024

2 papers in library cite

L. Ning, Z. Liang, Zhejun Jiang, H. Qu, Y. Ding, W. Fan, X. Y. Wei, Stephen Lin, Haozhe Liu, P. S. Yu - 2025

1 paper in library cites

Ziru Chen, K. Liu, Q. Wang, Wenxuan Zhang, Joseph Liu, D. Lin, K. Chen, F. Zhao - 2024

1 paper in library cites

Yueqi Song, W. Xiong, Xuandong Zhao, D. Zhu, Wenhao Wu, K. Wang, Chun-Liang Li, W. Peng, Shanda Li - 2024

1 paper in library cites

J. Zhang, T. Lan, R. Murthy, Ze Liu, W. Yao, M. Zhu, J. Tan, T. Hoang, Ze Liu, L. Yang - 2024

1 paper in library cites

C. Rawles, A. Li, D. Rodriguez, O. Riva, T. Lillicrap - 2023

1 paper in library cites

A. Paullada, I. D. Raji, E. M. Bender, E. Denton, A. Hanna - 2021

1 paper in library cites

D. Zha, Z. P. Bhat, K. H. Lai, Fan Yang, Zhejun Jiang, S. Zhong, X. Hu - 2025

1 paper in library cites

Xinpeng Wang, Yanru Chen, Lifan Yuan, Y. Z. Zhang, Yiwei Li, H. Peng, H. Ji - 2024

1 paper in library cites

Apurva Gandhi, Graham Neubig - 2025

1 paper in library cites

I. M. Putrama, P. Martinek - 2024

1 paper in library cites

N. Chowdhury, J. Aung, C. J. Shern, O. Jaffe, D. Sherburn, G. Starace, E. Mays, R. Dias, M. Aljubeh, M. Glaese - 2024

1 paper in library cites

A. Golubev, S. Polezhaev, K. Zainullina, M. Trofimova, I. Badertdinov, Y. Anapolskiy, D. Litvintseva, S. Karasik, F. Fisin, S. Skvortsov - 2024

1 paper in library cites

E. Bhardwaj, H. Gujral, S. Wu, C. Zogheib, T. Maharaj, C. Becker - 2024

1 paper in library cites

Xiang Deng, Y. Gu, Bo Zheng, S. Chen, S. Stevens, B. Wang, Huan Sun, Yu Su - 2023

1 paper in library cites

D. Mueller, Mark Dredze, N. Andrews - 2024

1 paper in library cites

S. Murty, H. Zhu, D. Bahdanau, Christopher D. Manning - 2024

1 paper in library cites

R. Kapoor, Y. P. Butala, M. Russak, J. Y. Koh, K. Kamble, W. Alshikh, Ruslan Salakhutdinov - 2024

1 paper in library cites

T. Zheng, G. Zhang, T. Shen, Xiaodong Liu, Bill Yuchen Lin, J. Fu, Weizhu Chen, Xiang Yue - 2024

1 paper in library cites

Xinpeng Wang, Boxuan Li, Yueqi Song, F. F. Xu, X. Tang, Mingchen Zhuge, J. Pan, Yueqi Song, Boxuan Li, J. Singh, H. H. Tran, F. Li, R. Ma, M. Zheng, B. Qian, Y. Shao, Niklas Muennighoff, Y. Z. Zhang, B. Hui, Junyang Lin, R. Brennan, H. Peng, H. Ji, Graham Neubig - 2025

1 paper in library cites

M. Hu, Y. Zhou, W. Fan, Y. Nie, B. Xia, T. Sun, Z. Ye, Z. Jin, Yiwei Li, Qinlang Chen, Zhengyou Zhang, Yuzhi Wang, Q. Ye, B. Ghanem, P. Luo, G. Li - 2025

1 paper in library cites

H. Li, L. Ding, M. Fang, D. Tao - 2024

1 paper in library cites

Tianyue Ou, F. F. Xu, Aman Madaan, Joseph Liu, R. Lo, A. Sridhar, S. Sengupta, Dan Roth, Graham Neubig, Shuyan Zhou - 2024

1 paper in library cites

T. Masterman, S. Besen, M. Sawtell, A. Chao - 2024

1 paper in library cites

K. Xu, Y. Kordi, T. Nayak, A. Asija, Yuzhi Wang, K. Sanders, A. Byerly, J. Zhang, B. V. Durme, Daniel Khashabi - 2024

1 paper in library cites

Suhas Kotha, J. M. Springer, A. Raghunathan - 2024

1 paper in library cites

X. H. Lu, Z. Kasner, Siva Reddy - 2024

1 paper in library cites

S. Yao, H. Chen, Jihan Yang, K. Narasimhan - 2022

1 paper in library cites

J. Zhang, T. Lan, M. Zhu, Ze Liu, T. Q. Hoang, S. Kokane, W. Yao, J. Tan, A. Prabhakar, H. Chen, Ze Liu, Y. Feng, T. M. Awalgaonkar, R. N. Rithesh, Ziru Chen, Runxin Xu, J. C. Niebles, S. Heinecke, Haiming Wang, Silvio Savarese, Caiming Xiong - 2025

1 paper in library cites

Cited by

0

papers in your library

Cites

27

papers in your library

Read

on April 16, 2026

Your review

Tags

ICLR2026

Paper Aliases

No aliases