You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The smoke-temporary-id workflow validates temporary ID functionality for creating parent-child issue hierarchies and cross-references. Currently, it performs all work (creating 3 issues, adding 1 comment) in a single agent context. This experiment tests whether decomposing the work into sub-agents—one per issue creation—reduces token consumption and improves reliability without sacrificing correctness.
Hypothesis
H0: Decomposing issue creation into sub-agents does not reduce effective token count or improve success rate compared to single-agent execution.
H1: Using sub-agents for per-item work reduces effective token count by 15-25% and improves success rate by maintaining smaller, focused contexts that are less prone to hallucination or incomplete JSON emission.
Experiment Configuration
Add the following experiments: block to the workflow frontmatter:
single_agent: Current behavior—all work performed in the main agent prompt (3 issue creations + 1 comment in one context)
sub_agents: Decompose work—launch 3 background task agents (one per issue) from a coordinator prompt, then add comment in main context after sub-agents complete
Workflow Changes Required
The experiment uses handlebars conditional blocks to switch between single-agent and sub-agent execution paths. Always use the value-comparison form{{#if experiments.sub_agent_strategy == "variant"}} (never the internal env-var form).
Before (current single-agent approach, lines 50-110):
# Smoke Test: Temporary ID Functionality
This workflow validates that temporary IDs work correctly for:
1. Creating parent-child issue hierarchies
2. Cross-referencing issues in bodies
3. Different temporary ID formats (3-8 alphanumeric characters)
**IMPORTANT**: Use the exact temporary ID format `aw_` followed by 3-8 alphanumeric characters (A-Za-z0-9).
## Test 1: Create Parent Issue with Temporary ID
Create a parent tracking issue with a temporary ID. Use a 6-character alphanumeric ID.
```json
{
"type": "create_issue",
"temporary_id": "aw_test01",
"title": "Test Parent: Temporary ID Validation",
"body": "This is a parent issue created to test temporary ID functionality.\n\nSub-issues:\n- #aw_test02\n- #aw_test03\n\nAll references should be replaced with actual issue numbers."
}
Test 2: Create Sub-Issues with Cross-References
[... rest of single-agent instructions ...]
**After** (with experiment conditional):
```markdown
# Smoke Test: Temporary ID Functionality
This workflow validates that temporary IDs work correctly for:
1. Creating parent-child issue hierarchies
2. Cross-referencing issues in bodies
3. Different temporary ID formats (3-8 alphanumeric characters)
**IMPORTANT**: Use the exact temporary ID format `aw_` followed by 3-8 alphanumeric characters (A-Za-z0-9).
{{#if experiments.sub_agent_strategy == "single_agent"}}
## Single-Agent Mode
Create all issues in this context.
### Test 1: Create Parent Issue
```json
{
"type": "create_issue",
"temporary_id": "aw_test01",
"title": "Test Parent: Temporary ID Validation",
"body": "This is a parent issue created to test temporary ID functionality.\n\nSub-issues:\n- #aw_test02\n- #aw_test03\n\nAll references should be replaced with actual issue numbers."
}
Test 2: Create Sub-Issue 1
{
"type": "create_issue",
"temporary_id": "aw_test02",
"parent": "aw_test01",
"title": "Sub-Issue 1: Test Temporary ID References",
"body": "This is sub-issue 1.\n\nParent: #aw_test01\nRelated: #aw_test03\n\nAll temporary IDs should be resolved to actual issue numbers."
}
Test 3: Create Sub-Issue 2
{
"type": "create_issue",
"temporary_id": "aw_test03",
"parent": "aw_test01",
"title": "Sub-Issue 2: Test Different ID Length",
"body": "This is sub-issue 2 with an 8-character temporary ID.\n\nParent: #aw_test01\nRelated: #aw_test02\n\nTesting that longer temporary IDs (8 chars) work correctly."
}
Launch 3 background task agents to create issues in parallel, then wait for completion.
You are the coordinator. Your job:
Launch 3 background task agents (one per issue) with complete temporary ID instructions
Wait for all 3 agents to complete (you will receive automatic completion notifications)
After all complete, add a summary comment to the parent issue using temporary ID aw_test01
Agent 1: Create Parent Issue
Launch a background task agent:
# Use the task tool with agent_type: task, mode: background# Prompt: "Create a parent issue with temporary ID aw_test01. Title: 'Test Parent: Temporary ID Validation'. Body: 'This is a parent issue created to test temporary ID functionality.\n\nSub-issues:\n- #aw_test02\n- #aw_test03\n\nAll references should be replaced with actual issue numbers.' Use safeoutputs create_issue with temporary_id: aw_test01."
Agent 2: Create Sub-Issue 1
Launch a second background task agent:
# Prompt: "Create a sub-issue with temporary ID aw_test02 and parent aw_test01. Title: 'Sub-Issue 1: Test Temporary ID References'. Body: 'This is sub-issue 1.\n\nParent: #aw_test01\nRelated: #aw_test03\n\nAll temporary IDs should be resolved to actual issue numbers.' Use safeoutputs create_issue with temporary_id: aw_test02 and parent: aw_test01."
Agent 3: Create Sub-Issue 2
Launch a third background task agent:
# Prompt: "Create a sub-issue with temporary ID aw_test03 and parent aw_test01. Title: 'Sub-Issue 2: Test Different ID Length'. Body: 'This is sub-issue 2 with an 8-character temporary ID.\n\nParent: #aw_test01\nRelated: #aw_test02\n\nTesting that longer temporary IDs (8 chars) work correctly.' Use safeoutputs create_issue with temporary_id: aw_test03 and parent: aw_test01."
After All Agents Complete
Once you receive completion notifications for all 3 agents, verify their success and add a summary comment.
{{/if}}
Final Step: Add Summary Comment
Regardless of strategy, add a comment to the parent issue summarizing test results:
{
"type": "add_comment",
"issue_number": "aw_test01",
"body": "## Test Results\n\n✅ Parent issue created with temporary ID `aw_test01`\n✅ Sub-issue 1 created with temporary ID `aw_test02` and linked to parent\n✅ Sub-issue 2 created with temporary ID `aw_test03` and linked to parent\n✅ Cross-references resolved correctly\n\n**Validation**: Check that:\n1. All temporary ID references (#aw_*) in issue bodies are replaced with actual issue numbers (#123)\n2. Sub-issues show parent relationship in GitHub UI\n3. Parent issue shows sub-issues in task list\n\nTemporary ID format validated: `aw_[A-Za-z0-9]{3,8}`"
}
🧪 Experiment Campaign: smoke-temporary-id
Workflow file:
.github/workflows/smoke-temporary-id.mdSelected dimension:
sub_agent_strategyTriggered by:
ab-testing-advisoron 2026-05-22Background
The
smoke-temporary-idworkflow validates temporary ID functionality for creating parent-child issue hierarchies and cross-references. Currently, it performs all work (creating 3 issues, adding 1 comment) in a single agent context. This experiment tests whether decomposing the work into sub-agents—one per issue creation—reduces token consumption and improves reliability without sacrificing correctness.Hypothesis
H0: Decomposing issue creation into sub-agents does not reduce effective token count or improve success rate compared to single-agent execution.
H1: Using sub-agents for per-item work reduces effective token count by 15-25% and improves success rate by maintaining smaller, focused contexts that are less prone to hallucination or incomplete JSON emission.
Experiment Configuration
Add the following
experiments:block to the workflow frontmatter:Variant descriptions:
single_agent: Current behavior—all work performed in the main agent prompt (3 issue creations + 1 comment in one context)sub_agents: Decompose work—launch 3 backgroundtaskagents (one per issue) from a coordinator prompt, then add comment in main context after sub-agents completeWorkflow Changes Required
The experiment uses handlebars conditional blocks to switch between single-agent and sub-agent execution paths. Always use the value-comparison form
{{#if experiments.sub_agent_strategy == "variant"}}(never the internal env-var form).Before (current single-agent approach, lines 50-110):
Test 2: Create Sub-Issues with Cross-References
[... rest of single-agent instructions ...]
Test 2: Create Sub-Issue 1
{ "type": "create_issue", "temporary_id": "aw_test02", "parent": "aw_test01", "title": "Sub-Issue 1: Test Temporary ID References", "body": "This is sub-issue 1.\n\nParent: #aw_test01\nRelated: #aw_test03\n\nAll temporary IDs should be resolved to actual issue numbers." }Test 3: Create Sub-Issue 2
{ "type": "create_issue", "temporary_id": "aw_test03", "parent": "aw_test01", "title": "Sub-Issue 2: Test Different ID Length", "body": "This is sub-issue 2 with an 8-character temporary ID.\n\nParent: #aw_test01\nRelated: #aw_test02\n\nTesting that longer temporary IDs (8 chars) work correctly." }{{/if}}
{\{#if experiments.sub_agent_strategy == "sub_agents"}}
Sub-Agent Mode
Launch 3 background
taskagents to create issues in parallel, then wait for completion.You are the coordinator. Your job:
taskagents (one per issue) with complete temporary ID instructionsaw_test01Agent 1: Create Parent Issue
Launch a background task agent:
Agent 2: Create Sub-Issue 1
Launch a second background task agent:
# Prompt: "Create a sub-issue with temporary ID aw_test02 and parent aw_test01. Title: 'Sub-Issue 1: Test Temporary ID References'. Body: 'This is sub-issue 1.\n\nParent: #aw_test01\nRelated: #aw_test03\n\nAll temporary IDs should be resolved to actual issue numbers.' Use safeoutputs create_issue with temporary_id: aw_test02 and parent: aw_test01."Agent 3: Create Sub-Issue 2
Launch a third background task agent:
# Prompt: "Create a sub-issue with temporary ID aw_test03 and parent aw_test01. Title: 'Sub-Issue 2: Test Different ID Length'. Body: 'This is sub-issue 2 with an 8-character temporary ID.\n\nParent: #aw_test01\nRelated: #aw_test02\n\nTesting that longer temporary IDs (8 chars) work correctly.' Use safeoutputs create_issue with temporary_id: aw_test03 and parent: aw_test01."After All Agents Complete
Once you receive completion notifications for all 3 agents, verify their success and add a summary comment.
{{/if}}
Final Step: Add Summary Comment
Regardless of strategy, add a comment to the parent issue summarizing test results:
{ "type": "add_comment", "issue_number": "aw_test01", "body": "## Test Results\n\n✅ Parent issue created with temporary ID `aw_test01`\n✅ Sub-issue 1 created with temporary ID `aw_test02` and linked to parent\n✅ Sub-issue 2 created with temporary ID `aw_test03` and linked to parent\n✅ Cross-references resolved correctly\n\n**Validation**: Check that:\n1. All temporary ID references (#aw_*) in issue bodies are replaced with actual issue numbers (#123)\n2. Sub-issues show parent relationship in GitHub UI\n3. Parent issue shows sub-issues in task list\n\nTemporary ID format validated: `aw_[A-Za-z0-9]{3,8}`" }Expected Outcome
#aw_test01replaced with actual numbers like#1234parentfieldSuccess Criteria: All 3 issues created, all temporary ID references resolved, parent-child relationships established.