pullrequests Search Results · language:Dune language:JavaScript language:Java language:JavaScript language:Python language:HTML
Filter by
211M results
QA is really two tasks, run via mode:
- Vision-QA (mode: img) — rendered views + a numeric question → a number
- Code-QA (mode: code) — CadQuery code + a numeric question → a number
The paper reports ...
Summary
- Switch windows-latest → ubuntu-latest (Linux runners bill at 1× vs 2× for Windows)
- Remove push: branches: [main] trigger — CI now runs on PRs only, not again after merge
- Replace PowerShell-specific ...
This pull request was generated by the mq tool
[test] flake rate: 0.1 logical conflict every: 100 sleep for: 600s close stale after: 48 hours
[pullrequest] requests per hour: 60 target branch: main
...
Что сделано
- tools/nav_v2_deal_api_smoke_test.mjs получил NAV_V2_ACTION для read smoke.
- Поддержаны read actions:
- get_deal_card по умолчанию;
- get_deal_card_lite.
- Direct RPC comparison ...
What does this PR do?
This PR documents the second formal evaluation baseline run (2026-06-27) against gpt-5.5, expands the evaluation suite
with six new test cases (OE010-OE015), and significantly revamps ...
Bumps org.apache.tomcat.embed:tomcat-embed-core from 9.0.112 to 11.0.23.
![Dependabot compatibility
score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=org.apache.tomcat.embed:tomcat-embed-core ...
dependencies
Refactor AirCompSim from hard-coded experiment sweeps in main.py into a reusable configuration and runner layer, add a
local Streamlit dashboard, and fix several runtime/result-correctness issues found ...
Summary
- Adds a README.md with a short project description and an overview of the scripts
Test plan
- [ ] README renders correctly on GitHub
Summary
- Replace unique SEC name matching with deterministic SEC candidate ranking and ambiguity handling.
- Treat LLM-extracted ticker, CIK, and legal-name values as verification hints, not authoritative ...