Config
⚠️ Validation Errors:
BASE CONFIG
_
_
Edit the middle part. Prefix (parent config) and suffix (auto-increment) are fixed.
MODEL SETTINGS
Provider:
DATASET
Select from available datasets in datasets/ directory
Select a base config to view dataset fields
💡 Use field names like
{problem_description} in messages below
INITIAL MESSAGES
ENVIRONMENT SETTINGS
DATASET SETTINGS
0
10
Use -1 for unlimited turns
Show live LLM token streaming in the output (recommended for dev loop)
TOOLS
Select a base config to view and edit tool descriptions
ENVIRONMENT
View the full environment implementation (prepare_messages, get_tools, on_assistant_message, etc.)
Results
Loading results...
Welcome to Agent Dev Loop
Select a result to view or configure and launch a new agent evaluation.