Welcome to the Microsoft Q&A Platform! Thank you for asking your question here.
We apologize for the inconvenience caused.
As we understand that, your application encountered a service interruption with Cosmos DB on 2025-02-13 between 3:00 AM and 3:30 AM Pacific Time. Unfortunately, I couldn't find any specific internal documentation or emails regarding an outage during that time.
However, I can provide some general information and troubleshooting steps that might help.
- Verify Substatus Code: The error message you received includes a substatus code (20001), which indicates client-side connectivity issues. This can be due to network conditions or transient connectivity problems.
- Check Network Configuration: Ensure that your network configuration is correct and that all required ports are enabled. Transient connectivity issues can cause timeouts and can be safely retried following the design recommendations.
- Service Health Dashboard: Although the Service Health dashboard didn't show any events at the time of the outage, it's always a good idea to check for any ongoing issues. You can also monitor the Azure status page for updates. https://azure.status.microsoft/en-gb/status
- Retry Policies: With Server Side Retry enabled, you can increase the retry parameters to handle transient errors better. Ensure that your application design follows the guide for designing resilient applications with Azure Cosmos DB SDKs.
To receive more timely alerts, you can consider the following options:
- Azure Monitor Alerts: Configure Azure Monitor to send alerts based on specific metrics or logs. You can set up alerts to trigger immediately when certain conditions are met, rather than waiting for hourly Application Insights alerts.
- Custom Alerts: Implement custom alerting mechanisms within your application to detect and notify you of service interruptions in real-time.
Please refer to the below mentioned link for more information:
https://learn.microsoft.com/en-us/azure/cosmos-db/nosql/troubleshoot-service-unavailable
I hope, This response will address your query and helped you to overcome on your challenges.
If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.