AutoGen_Studio-12
AutoGen_Studio-12
If everything looks good, that completes our task. If you need any
Message
further assistance or modifications, please let me know.
Hydrate workflow Results (7 files) Total messages
specifications into AutoGen Userproxy
agents and run tasks Groupchat manager
Content
Image Generator
autogenstudio.cli Quality Assurance
0 5 10 15 20
Command Line
Seattle_Weather_Childrens_Book.pdf Tool call Success Failure
Figure 2: AUTO G EN S TUDIO provides a backend api (web, python, cli) and a UI which implements a playground
(shown), build and gallery view. In the playground view, users can run tasks in a session based on a workflow. Users
can also observe actions taken by agents, reviewing agent messages and metrics based on a profiler module.
how often agents use tools and the status of tool declarative (JSON), users can also easily export,
use (success or failure), for each agent. version and reshare them.
4.1.3 Deploying Workflows 4.2 Backend API - Web, Python, and
AUTO G EN S TUDIO enables users to export work- Command Line
flows as a JSON configuration file. An exported The backend API comprises three main compo-
workflow can be seamlessly integrated into any nents: a web API, a Python API, and a command-
Python application (listing 2), executed as an API line interface. The web API consists of REST
endpoint using the AUTO G EN S TUDIO command endpoints built using the FastAPI library2 , sup-
line interface (figure 2a), or wrapped in a Docker porting HTTP GET, POST, and DELETE methods.
container for large-scale deployment on various These endpoints interact with several key classes:
platforms (Azure, GCP, Amazon, etc.). A DBM anager performs CRUD (Create, Read,
Update, Delete) operations on various entities such
from autogenstudio import as skills, models, agents, memory, workflows, and
WorkflowManager sessions. The W orkf lowM anager class handles
wm = WorkflowManager ( " workflow . the ingestion of declarative agent workflows, con-
json " ) verts them into AUTO G EN agent objects, and exe-
wm . run ( message = " What is the cutes tasks (see listing 2). A P rof iler class parses
height of the Eiffel Tower " ) agent messages to compute metrics. When a user
initiates a task within a session, the system retrieves
the session history, instantiates agents based on
listing 2: Workflows can be imported in python apps. their serialized representations from the database,
executes the task, streams intermediate messages to
4.1.4 Template Gallery the UI via websocket, and returns the final results.
AUTO G EN S TUDIO also provides a command-line
The UI also features a gallery view - a repository
interface with utilities for launching the bundled UI
of components (skills, models, agents, workflows)
and running exported workflows as API endpoints.
that users can import, extend, and reuse in their own
2
workflows. Since each component specification is FastAPI: https://fastapi.tiangolo.com/
AutoGen Studio GitHub Issue Visualization (UMAP)
5 Usage and Evaluation
Issues with API Keys,
Model Configuration, and
In this project, we have adopted an in-situ, iterative Local Server Connections
(27)
evaluation approach. Since its release on GitHub Issues with AutoGen
Studio: Docker access,
(5 months), the AUTO G EN S TUDIO package has validation errors, and
compatibility (17)
Issues with Autogen
been installed over 200 K times and has been itera- Studio: Skills not
updating, Code execution,
tively improved based on feedback from usage (> and Group Chat (21)
135 GitHub issues). Issues highlighted several user
AutoGen Studio 2 Issues with Group Chat
pain points that were subsequently addressed in- Compatibility, API Workflow, Agent Creation,
Issues, and Documentation and Model Changes (18)
cluding: (a) challenges in defining, persisting, and Updates (10)
reusing components, resolved by implementing a Accessibility and
Multimodality in Autogen
Studio, UI Improvements,
database layer; (b) difficulties in authoring compo- Group Chat Support, and
Test Suite (14)
nents, resolved by supporting automated tool gener-
ation from descriptions and integrating an IDE for AutoGen Studio Feature AutoGen Studio: Database
Requests: Workflow Implementation, Custom
Sharing, File Uploads, UI Configurations, and
editing tools; (c) frustrations caused by components Improvements, and Model Performance Enhancements
Testing (14) (14)
failing during end-to-end tests, addressed by incor-
porating a test button for components (e.g.,models)
and workflows in the build view. Figure 3 displays Figure 3: Plot of GitHub issues (n = 8 clusters) from
a plot of all AUTO G EN S TUDIO issues. Each point the AUTO G EN S TUDIO repo. User feedback ranged
represents an issue, based on an embedding of its from support with workflow authoring tools (e.g., the
text (title + body) using OpenAI’s text-embedding- ability configure and test models) to general installation.
3-large model. The embeddings were reduced to
two dimensions using UMAP, clustered with K- workflow, where entities are first defined and per-
Means (k = 8), and cluster labels generated using sisted independently, and then composed ultimately
GPT-4 (grounded on 10 samples from its centroid). into multi-agent workflows, provides a good de-
Finally, in Appendix A, we demonstrate how AU - veloper experience. This includes providing tools
TO G EN S TUDIO can effectively be used to support to support authoring entities e.g., the ability de-
an engineer persona in rapidly prototyping, testing, fine and test models, an IDE for generating/editing
and iteratively debugging a MULTI - AGENT work- tools (code), and a a canvas-based visual layout
flow, and deploying it as an API endpoint to address of workflows with drag-and-drop interaction for
a concrete task (generating books). associating entities in the workflow.
References
Harrison Chase. 2022. LangChain. Github.