Amazon’s cloud infrastructure, AWS, is coming back online after a worldwide outage that crippled dozens of popular apps and online services. The disruption, which began in the late‑night hours of June 28, left millions of users on platforms like DoorDash, FTX, and other major web services unable to access their data.
What caused the problem?
Amazon’s systems reported an internal error in one of its data centers that triggered a cascade of failures across the public cloud network. The root cause was traced to a mis‑configured software update that caused certain servers to shut down unexpectedly.
How many businesses were hit?
More than 200 companies reported outages, ranging from food‑delivery platforms to finance apps and even a few large e‑commerce sites. According to independent monitoring sites, roughly 35 % of the global AWS traffic was down for several hours.
Where did the outages happen?
The most severe impact was felt in the US, Europe, and Asia Pacific. Users in the eastern US and parts of Canada experienced the longest downtime, while customers in the UK and Australia saw a quicker recovery.
What did Amazon do?
Amazon launched a full‑scale incident response. Engineers rolled back the software change, restored system health, and shifted workloads to healthy regions. After about eight hours, the majority of services returned to normal.
What’s next for AWS?
Amazon said it will revise its deployment procedures to add extra safety checks before updates. Customers were offered “thank‑you” credits for any downtime they experienced. The company also announced a new “resilience‑first” policy that is aimed at preventing a repeat of this event.
Who should watch this?
- Businesses that rely on cloud services for critical operations
- Startups that use AWS for API and database hosting
- Developers who want to understand best‑practice fallback strategies
Who can help you?
If you run a site on AWS, double‑check your health‑check alerts and consider adding redundancy in multiple regions. AWS offers a “multi‑region failover” feature that can keep your apps running even if one data center goes offline.
The incident is a reminder that even the biggest tech players can hit a snag. Still, the swift recovery shows the robustness of modern cloud infrastructures and the importance of quick incident response. Whether you’re a casual user or a tech company, staying informed about these incidents helps you prepare for any future hiccups.
Stay informed on all the latest news, real-time breaking news updates, and follow all the important headlines in world News on Latest NewsX. Follow us on social media Facebook, Twitter(X), Gettr and subscribe our Youtube Channel.