Hi ca
Yes, rate limits could be causing the timeouts when uploading large PDFs to Azure OpenAI.
Here are some steps:
- Try smaller chunks (e.g., 500–1000 tokens) instead of large sections.
- You can upload files up to 8 GB in total, but using the Uploads API is key for larger sizes. Also, breaking the file into chunks smaller than 512 MB is a good practice.
- Increase Quota Submit a quota increase request via Azure Quota Increase Portal
- Azure OpenAI Studio (Playground) supports larger uploads and longer processing times than the API.
- Use Azure Portal’s Metrics tab to track token and request usage.
- OpenAI models have a maximum token limit per request. If your PDF exceeds this, it may trigger rate limits
- The GPT-4.1 model has a limit of 1 million tokens per minute and 1,000 requests per minute for the default tier.
Kindly refer below link: quotas-limits
Thank You.