We incorporate an inefficient reference PyTorch implementation in gpt_oss/torch/design.py. This code uses standard PyTorch operators to point out the precise design architecture, with a small addition of supporting tensor parallelism in MoE so that the larger product can run with this code (e.Even though it might not be essentially the most visuall