Popular repositories Loading
-
leaderboard
leaderboard PublicOpen execution logs, trajectories, and results from evaluation runs on the Χ-Bench task.
Python 1
Repositories
Showing 2 of 2 repositories
- chi-bench Public
Χ-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?
actava-ai/chi-bench’s past year of commit activity - leaderboard Public
Open execution logs, trajectories, and results from evaluation runs on the Χ-Bench task.
actava-ai/leaderboard’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…