Because the model isn't dynamic (it doesn't learn) it is stateless and can be scaled elastically.
How many users come with the same prompt?