Helium Trades released an open benchmark that may complement ValueBench on value consistency and political lean.
Helium Model Worldview Benchmark (304 prompts):
- Stated priorities vs forced tradeoffs
- Name-swap and cue-swap consistency
- 50 balanced political Likert items
- Safety refusal profiles across 12 models
Dataset: https://huggingface.co/datasets/HeliumTrades/helium-model-worldview-benchmark
Overview: https://heliumtrades.com/benchmarks/
Helium Trades released an open benchmark that may complement ValueBench on value consistency and political lean.
Helium Model Worldview Benchmark (304 prompts):
Dataset: https://huggingface.co/datasets/HeliumTrades/helium-model-worldview-benchmark
Overview: https://heliumtrades.com/benchmarks/