Hi there Hannah
Thanks for using QandA platform
Azure OpenAI's quota feature enables assignment of rate limits to your deployments, up-to a global limit called your “quota”. Quota is assigned to your subscription on a per-region, per-model basis in units of Tokens-per-Minute (TPM). Your subscription is onboarded with a default quota for most models.
reference - https://learn.microsoft.com/en-us/answers/questions/2124071/azure-ai-studio
Have a look at this similar question
If this helps kindly accept the answer thanks much.