How to resolve Azure OpenAI rate limit errors when uploading large PDFs

ca 0 Reputation points
2025-06-11T01:08:26.2+00:00

I am attempting to use the OpenAI GPT-4.1 model to upload a large PDF document. Despite sending the document in several smaller chunks, timeouts are still occurring. Could this issue be related to request limits?

Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,561 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Saideep Anchuri 8,320 Reputation points Microsoft External Staff Moderator
    2025-06-11T01:42:13.43+00:00

    Hi ca

    Yes, rate limits could be causing the timeouts when uploading large PDFs to Azure OpenAI.

    Here are some steps:

    1. Try smaller chunks (e.g., 500–1000 tokens) instead of large sections.
    2. You can upload files up to 8 GB in total, but using the Uploads API is key for larger sizes. Also, breaking the file into chunks smaller than 512 MB is a good practice.
    3. Increase Quota Submit a quota increase request via Azure Quota Increase Portal
    4. Azure OpenAI Studio (Playground) supports larger uploads and longer processing times than the API.
    5. Use Azure Portal’s Metrics tab to track token and request usage.
    6. OpenAI models have a maximum token limit per request. If your PDF exceeds this, it may trigger rate limits
    7. The GPT-4.1 model has a limit of 1 million tokens per minute and 1,000 requests per minute for the default tier.

    Kindly refer below link: quotas-limits

    Thank You.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.