arxiv:2402.15506

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Published on Feb 23

· Submitted by

akhaliq on Feb 26

Upvote

Authors:

Jianguo Zhang ,

Rithesh Murthy ,

Zhiwei Liu ,

Weiran Yao ,

Juntao Tan ,

Liangwei Yang ,

Yihao Feng ,

Zuxin Liu ,

Tulika Awalgaonkar ,

Juan Carlos Niebles ,

Caiming Xiong

Abstract

Autonomous agents powered by large language models (LLMs) have garnered significant research attention. However, fully harnessing the potential of LLMs for agent-based tasks presents inherent challenges due to the heterogeneous nature of diverse data sources featuring multi-turn trajectories. In this paper, we introduce AgentOhana as a comprehensive solution to address these challenges. AgentOhana aggregates agent trajectories from distinct environments, spanning a wide array of scenarios. It meticulously standardizes and unifies these trajectories into a consistent format, streamlining the creation of a generic data loader optimized for agent training. Leveraging the data unification, our training pipeline maintains equilibrium across different data sources and preserves independent randomness across devices during dataset partitioning and model training. Additionally, we present xLAM-v0.1, a large action model tailored for AI agents, which demonstrates exceptional performance across various benchmarks.

View arXiv page View PDF Add to collection

Community

librarian-bot

Feb 27

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

jianguozhang

Paper author Feb 27

Check our version V2:
https://arxiv.org/pdf/2402.15506v2.pdf

alexyogo22

Mar 26

•

edited Mar 26

Nice paper! Thanks for the insights.

Did you also compare two versions of xLAM, one trained with your standard format, and one just with various formats? I am curious on how much performance gain we got from standardizing the tool usage trajectories.