Support eval with deepspeed zero3

### System Info

```Shell
In-training simulation eval (eval_freq > 0 with a sim env) is not supported under parameter sharding (DeepSpeed ZeRO-3 or FSDP): each rank rolls out independently, so the sharded-param all-gathers in the eval forward desync across ranks and hang at NCCL. Use ZeRO-1/2 (params replicated), or run eval out-of-process on saved checkpoints.
  File "/fss/bot/OpenTau_main/src/opentau/scripts/train.py", line 1494, in <module>
```

### Information

- [x] One of the scripts in the src/opentau/scripts/ folder of OpenTau
- [ ] My own task or dataset (give details below)

### Reproduction

Simpley running cosmos3 on multi node with zero3

### Expected behavior

Should be able to run evals with zero3 sharding while training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support eval with deepspeed zero3 #435

System Info

Information

Reproduction

Expected behavior

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Support eval with deepspeed zero3 #435

Description

System Info

Information

Reproduction

Expected behavior

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions