which seems to have eventually been merged into vllm: https://docs.vllm.ai/en/latest/features/batch_invariance/
So you can get determinism locally. On a cursory search I wasn't able to find any LLM provider advertising determism; if you need it for research you might have to rent a dedicated GPU pod and run vllm there with the appropriate settings.