issues Search Results · language:Dune language:HTML language:JavaScript language:Java language:Java language:TypeScript
Filter by
55.9M results
Create a light annotation template for the 75–100 document validation corpus.
This is not full Stage 4A annotation.
Expected files:
data/corpus/v4-validation-light-annotation-template.csv
schemas/v4-validation-light-annotation.schema.json ...
corpus
metadata
pipeline
v4
Update the reliability sampling strategy so Stage 4H human reliability and Stage 4M model stress testing can sample from
the expanded v4 corpus.
Proposed script:
scripts/corpus/generate-reliability-sample-update.js ...
corpus
methodology
reliability
v4
Create a report explaining how the v4 expansion affects the project’s claims.
Proposed script:
scripts/corpus/generate-corpus-expansion-impact-report.js
Inputs:
data/corpus/corpus-v1-inventory.json ...
corpus
docs
publication-package
v4
Create a report showing corpus coverage by period, genre, audience, rhetorical function, and research relevance.
Proposed script:
scripts/corpus/generate-corpus-coverage-report.js
Inputs:
data/corpus/corpus-v4-core-inventory.json ...
corpus
docs
pipeline
v4
Create a script to validate that sentence IDs are unique, stable, and non-destructive.
Proposed script:
scripts/corpus/validate-sentence-ids.js
Inputs:
corpus/segmented/
corpus/segmented/v4-core/
data/corpus/corpus-v4-segmentation-manifest.json ...
corpus
sentence-ids
v4
validation
Create or update segmentation scripts for the v4 corpus.
Proposed script:
scripts/corpus/segment-corpus-v4.js
Inputs:
corpus/normalized/v4-core/
data/corpus/corpus-v4-core-inventory.json
Outputs: ...
corpus
pipeline
segmentation
sentence-ids
v4
Create or update a script to ingest raw v4 corpus documents into the project structure.
Proposed script:
scripts/corpus/ingest-corpus-documents.js
Inputs:
corpus/raw/v4-core/
corpus/raw/v4-validation/ ...
corpus
pipeline
v4
Problem
The ui.localStorageTrigger UI option is currently being persisted in local storage as part of the user interface
settings, but the code that triggers the user interface doesn t look at this saved ...
Add raw text files for the v4 core additions.
Expected directory:
corpus/raw/v4-core/
File naming convention:
doc_###--short-title--yyyy-mm-dd.txt
Example:
doc_029--peoria-speech--1854-10-16.txt ...
corpus
provenance
v4
Create a script that validates provenance coverage.
Proposed script:
scripts/corpus/validate-corpus-provenance.js
Inputs:
corpus/provenance/source-authority-register.json
corpus/provenance/corpus-v4-provenance.json ...
corpus
provenance
v4
validation

Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip! Restrict your search to the title by using the in:title qualifier.