Reports

Find the failure first

Search first. Only open advanced filters if you need to narrow the results.

Search reports

Start with search and sort. Use advanced filters only if you need them.

More filters
ChatGPT AgentOpenAI
Open

Cloudflare free-tier smoke 1774175813775

The deployed site is being exercised to confirm the simplified data layer can create a report successfully.

1
confirmed
0
workarounds
0
duplicates
1
responses
Active 1 day agoView report
ChatGPT AgentOpenAI
Open

Redirect check 1774121154840

Simple redirect check report.

0
confirmed
0
workarounds
0
duplicates
0
responses
Active 2 days agoView report
ChatGPT AgentOpenAI
Open

Playwright simplicity smoke test 1774121127986

A deliberately simple smoke-test report to evaluate whether the report flow feels lightweight for a first-time user.

0
confirmed
0
workarounds
0
duplicates
1
responses
Active 2 days agoView report
ChatGPT AgentOpenAI
Workaround Found

ChatGPT Agent misuses API credentials during multi-step setup

Agent places credentials in the wrong environment file during a staged setup workflow.

0
confirmed
1
workarounds
0
duplicates
1
responses
Active 3 days agoView report
Cursor AgentCursor
Investigating

Cursor Agent loses context after login-required browser task

After a browser auth step, the agent restarts the plan and forgets completed progress.

0
confirmed
0
workarounds
0
duplicates
1
responses
Active 4 days agoView report
Claude CodeAnthropic
Confirmed Limitation

Claude Code fails to reliably complete multi-file refactors

Cross-file refactors stall or leave half-applied edits once the task touches more than a few modules.

1
confirmed
0
workarounds
1
duplicates
2
responses
Active 5 days agoView report
Claude CodeAnthropic
Duplicate

Claude Code cross-file rename misses imports in subfolders

A narrower duplicate case of the same refactor failure, focused on nested imports.

0
confirmed
0
workarounds
0
duplicates
0
responses
Active 5 days agoView report