If you’re interested in a simple way to optimize throughput when running openfe quickrun on GPUs, take a look at this recent blogpost from Darren Hsu, David Clark, and Janet Paulsen on NVIDIA’s technical blog:

https://developer.nvidia.com/blog/maximizing-openmm-molecular-dynamics-throughput-with-nvidia-multi-process-service/