Issue search results

44M results (769 ms)

anote-ai/Research-IntentSpecification
Proactive clarification agents: optimal question selection when intent is ambiguous

When users provide ambiguous input, LLMs typically either silently pick an interpretation or ask generic clarifying questions — both failure modes that compound downstream errors. Recent work (Ask-before-Plan, ...

ambiguity

clarification

research

jwj2002/agents
feat(usage): Claude transcript collector + activity-miner (§8 step 1)

Spec Reference Source: specs/fleet-usage-monitor.md Sections: §2.1, §3, §4.2, §4.3, §4.4, §4.5, §8 step 1 What to Build Implement claude-config/scripts/usage_collector_claude.py — the Claude transcript ...

build-slice

fleet-usage-monitor

from-spec

tests

anote-ai/Research-IntentSpecification
Closing the intent gap: benchmarks for intent formalization quality

The intent gap — the distance between an informal natural language requirement and a checkable formal specification — is identified as the central challenge for reliable AI-generated code (arXiv:2603.17150), ...

formalization

high-impact

research

anote-ai/Research-CodeBench
Regression detection as a first-class benchmark task

Existing benchmarks like SWE-bench measure whether an agent can fix a reported bug, but not whether the fix introduces regressions — a core real-world concern given that maintenance consumes 70–90% of ...

regression

research

testing

adityait019/orchestrator
waw

anote-ai/Research-CodeBench
Cost-adjusted efficiency metrics: pass@t and token-budget-constrained evaluation

Agentic coding workflows now average 1–3.5M tokens per task including retries and self-correction loops, and reasoning models like DeepSeek-R1 consume substantially more tokens than non-reasoning counterparts ...

cost-efficiency

metrics

research

bhack/mini-eq
[Bug]: I've added many presets, now the saved preset loading failed

Before opening - [x] I searched existing issues for a similar report. - [x] This is not a security vulnerability. Affected area App UI Mini EQ version 0.8.6 Install method Flathub Linux distribution ...

bug

anote-ai/Research-CodeBench
Security-aware scoring: integrating vulnerability detection into pass@k

Current pass@k metrics treat any test-passing solution as correct, ignoring that 45% of AI-generated code samples fail OWASP Top 10 security tests and AI-generated code introduces 2.74× more vulnerabilities ...

high-impact

research

security

MillenniumDawn/Millennium-Dawn
[CRASH] Game crashes when USA declares war

Reported by mahdation422 via Discord on 2026-06-06 (backfilled) Tags: Crash Describe the Crash game crash when i go to war as usa To Reproduce See Discord thread for details. Screenshots No screenshots ...

crash

from-discord

politics

p2pool-starter-stack/pithead
Dashboard: config to read workers that use a custom XMRig API token

Problem The dashboard reads each worker s :8080/1/summary and authenticates a direct miner with only two schemes (build/dashboard/mining_dashboard/client/xmrig_client.py): 1. no token, or 2. Authorization: ...

dashboard

documentation

enhancement

security

setup

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues

ProTip! Restrict your search to the title by using the in:title qualifier.

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues

ProTip! Restrict your search to the title by using the in:title qualifier.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter by

State

Advanced

anote-ai/Research-IntentSpecification
Proactive clarification agents: optimal question selection when intent is ambiguous

jwj2002/agents
feat(usage): Claude transcript collector + activity-miner (§8 step 1)

anote-ai/Research-IntentSpecification
Closing the intent gap: benchmarks for intent formalization quality

anote-ai/Research-CodeBench
Regression detection as a first-class benchmark task

adityait019/orchestrator
waw

anote-ai/Research-CodeBench
Cost-adjusted efficiency metrics: pass@t and token-budget-constrained evaluation

bhack/mini-eq
[Bug]: I've added many presets, now the saved preset loading failed

anote-ai/Research-CodeBench
Security-aware scoring: integrating vulnerability detection into pass@k

MillenniumDawn/Millennium-Dawn
[CRASH] Game crashes when USA declares war

p2pool-starter-stack/pithead
Dashboard: config to read workers that use a custom XMRig API token

Learn how you can use GitHub Issues to plan and track your work.

Learn how you can use GitHub Issues to plan and track your work.

issues Search Results · language:Dune language:Python language:Java language:Java language:HTML language:CSS language:Java

Filter by

State

Advanced

44M results

Learn how you can use GitHub Issues to plan and track your work.

Learn how you can use GitHub Issues to plan and track your work.