issues Search Results · language:Edge language:Python linked:pr language:JavaScript language:Java language:PHP
Filter by
6.2M results
The agent-eval compare command exists but is not wired into CI. CI should fail when quality regresses against a
committed baseline, and always upload the run s reports.
Goals
- Compare current eval ...
ci
Fix tests
I cleaned up some functions and removed some redundant casting, but now the tests are failing. Fix the tests
Context
- Muay Thaiger: PvP Idle Fighting game
- Cleanup
When tasks in one suite use different trial counts, the suite-level k (a single max) mislabels pass@k / pass^k.
Goals
- Record each task s actual k.
- Report a k range (e.g. pass@2..4) instead of ...
bug
Graders, adapters, and reporters are registered by hand in each package s __init__.py. External packages can t add their
own without editing this repo.
Goals
- Discover plugins via Python importlib.metadata ...
enhancement
Problem
starter-kit/SESSION_RUNNER.md step 3C: Document Learnings instructs:
Update the workstream document and/or the Learnings table below:
…pointing sessions at SESSION_RUNNER s own Learnings (added ...
Suites can only be authored as a single YAML document. Real datasets, logs, and production traces are far easier to emit
as JSONL (one task per line), so the harness should ingest them directly.
Goals ...
enhancement
Updated moves
Add a staminaCost property to each move, and make the use move function take the fighter as well as the opponent.
Context
- Muay Thaiger: PvP Idle Fighting game
- FIGHT!
...
Problem
ensure_collection is annotated with idempotentHint=True, readOnlyHint=False but doesn t explicitly set
destructiveHint=False. Since it only installs to a temp directory and never deletes, making ...
enhancement
Problem
The find_collection prompt was added in #24 but the README Prompts table was not updated.
Fix
Add row to README Prompts table:
| `find_collection` | Guide through search, install, and explore ...
documentation
Problem
skills.py:81 sets execute permissions on generated scripts with S_IEXEC | S_IXGRP | S_IXOTH, making them executable by
all users. On shared/multi-user systems this is unnecessarily permissive. ...
bug
security

Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip! Restrict your search to the title by using the in:title qualifier.