issues Search Results · language:Dune language:JavaScript language:Java language:PHP language:Python language:JavaScript
Filter by
60.7M results
Create a script that validates provenance coverage.
Proposed script:
scripts/corpus/validate-corpus-provenance.js
Inputs:
corpus/provenance/source-authority-register.json
corpus/provenance/corpus-v4-provenance.json ...
corpus
provenance
v4
validation
Create validation scripts for corpus inventory files.
Proposed script:
scripts/corpus/validate-corpus-inventory.js
Inputs:
schemas/corpus-document.schema.json
schemas/corpus-inventory.schema.json
data/corpus/corpus-v4-core-inventory.json ...
corpus
pipeline
v4
validation
Hi, thanks for the great work on iMeanFlow.
We re training a iMeanFlow model at scale and noticed a pattern we wanted to ask about. Early training is clean, but as
training progresses the u (average-velocity) ...
Create a script that generates or normalizes v4 corpus inventory files.
Proposed script:
scripts/corpus/create-corpus-v4-inventory.js
Inputs:
data/corpus/corpus-v1-inventory.json
data/corpus/corpus-v4-selection-rationale.json ...
corpus
metadata
pipeline
v4
Create an inventory for the search-only reference corpus.
This layer is not fully annotated. It exists to support lexical recurrence, phrase searches, negative checks, and
context.
Expected files:
data/corpus/corpus-v4-reference-inventory.json ...
corpus
metadata
provenance
v4
Create the extended validation corpus inventory.
The validation corpus should include all v4-core documents plus additional documents used to test recurrence, negative
findings, genre coverage, and period ...
corpus
metadata
selection-rationale
v4
Create the v4 core corpus inventory.
The inventory should include all current v1 documents plus the 20 priority additions unless already present.
Expected files:
data/corpus/corpus-v4-core-inventory.json ...
corpus
metadata
selection-rationale
v4
Create a provenance schema and source authority register for document sources.
Expected files:
schemas/corpus-provenance.schema.json
corpus/provenance/source-authority-register.json
corpus/provenance/corpus-v4-provenance.json ...
corpus
provenance
v4
validation
Create a schema for corpus inventory files.
Expected file:
schemas/corpus-inventory.schema.json
Required top-level fields:
{
inventory_id : string ,
corpus_version : string ,
created_date ...
corpus
metadata
v4
validation
Create a schema for document-level metadata.
Expected file:
schemas/corpus-document.schema.json
Required fields:
{
doc_id : string ,
corpus_version : string ,
corpus_tier : v1 | v4-core ...
corpus
metadata
v4
validation

Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip! Restrict your search to the title by using the in:title qualifier.