Keboola - Notice history

AWS EU (eu-central-1) - Operational

100% - uptime
October 2025 · 99.72%November · 99.96%December · 100.0%
October 2025
November 2025
December 2025

GCP EU (europe-west3) - Operational

100% - uptime
October 2025 · 99.69%November · 99.92%December · 100.0%
October 2025
November 2025
December 2025

AWS US (us-east-1) - Operational

100% - uptime
October 2025 · 99.04%November · 99.98%December · 100.0%
October 2025
November 2025
December 2025

GCP US (us-east4) - Operational

100% - uptime
October 2025 · 99.65%November · 99.98%December · 100.0%
October 2025
November 2025
December 2025

Azure NE (north-europe) - Operational

100% - uptime
October 2025 · 99.54%November · 99.98%December · 100.0%
October 2025
November 2025
December 2025

Notice history

December 2025

November 2025

Data Streams data are not propagated to Storage
  • Resolved
    Resolved

    Data Streams functionality has been fully restored as of 10:30 UTC.


    The Data Streams metadata layer on GCP EU experienced an outage starting 27 November at 22:05 UTC. We restored it from a snapshot taken at 22:00 UTC on 27 November.

    No existing Storage data was affected. The issue was isolated to incoming Data Streams data.


    For most customers, no data was lost. Data continued to be received during the outage and was automatically imported once the metadata layer was restored.

    Unfortunately, 3 customers were affected by data loss for the period between 27 November 22:00 UTC and 28 November 10:30 UTC. We will contact these customers directly.


    We sincerely apologize for the inconvenience this has caused. We're reviewing our systems to prevent similar issues in the future and improve the reliability of Data Streams service.

  • Update
    Update

    We are still investigating the root cause of the issue and working on restoring the Data Streams data from snapshots.

    Next update in 60 minutes.

  • Update
    Update

    Initial issues with Data Streams data propagation began yesterday, 27 November, at 22:32 UTC.

    The next update will be provided in 30 minutes.

  • Investigating
    Investigating

    We are currently investigating issues with Date Streams.

    Our team has been notified and is actively investigating.

    Next update in 30 minutes or when new information is available.

October 2025

Microsoft Azure Issue
  • Resolved
    Resolved

    Automatic scaling is turned back on and all platforms are fully operational and we'll continue monitoring all stacks. We're sorry for this inconvenience.

  • Update
    Update

    We're seeing full operations on all Azure stacks. We'll be enabling automatic scaling shortly and all jobs are currently processing as usual.

  • Monitoring
    Monitoring

    Azure is promising full mitigation within next four hours. We are closely monitoring the situation and once the issue is fixed we will resume all operations and turn autoscaling back on.

    At this stage, we anticipate full mitigation within the next four hours as we continue to recover nodes. This means we expect recovery to happen by 23:20 UTC on 29 October 2025. We will provide another update on our progress within two hours, or sooner if warranted.

  • Update
    Update

    We have limited autoscaling of all Azure stacks to limit cascading the issue to new nodes and thus limiting the number of waiting jobs. We have seen partial improvement in certain regions so our hopes are up.

  • Update
    Update

    Microsoft has updated the impact and states that network infrastructure in all Azure regions is affected.

  • Identified
    Identified

    These issues can propagate downstream to AWS and GCP stacks, where the symptoms can include:

    • Limited operations of some AI features (error explains, config description generator)

  • Investigating
    Investigating

    We are aware of performance degradation affecting our Azure infrastructure. This appears to be related to an upstream issue with Azure's announced Azure Portal Access Issues (https://azure.status.microsoft/en-us/status).

    Symptoms include:

    • Failures to prepare new nodes to accept workload, causing some jobs to be in the waiting state.

    Our team is monitoring the situation.

    We apologize for the disruption and will provide an update within 30 minutes or when new information is available.

Scheduled Partial Maintenance of all Azure stacks – October 25, 2025
  • Completed
    2025-10-25 at 06:15
    Completed
    2025-10-25 at 06:15
    Maintenance has completed successfully
  • In progress
    2025-10-25 at 05:50
    In progress
    2025-10-25 at 05:50
    Maintenance is now in progress
  • Planned
    2025-10-25 at 05:50
    Planned
    2025-10-25 at 05:50

    We would like to inform you about the planned maintenance of all Keboola stacks hosted on Azure.

    During the database upgrades there will be a short service outage on all Azure stacks, including all single-tenant stacks and Azure North Europe multi-tenant stack (connection.north-europe.azure.keboola.com). This will take place on Saturday, October 25, 2025 between 05:50 and 06:30 UTC.

    Effects of the Maintenance

    During the above period, services will be scaled down and the processing of jobs may be delayed. For a very brief period (at around 06:00 UTC) the service will be unavailable for up to 10 minutes and APIs may respond with a 500 error code. After that, all services will scale up and start processing all jobs. No running jobs, data apps, or workspaces will be affected. Delayed scheduled flows and queued jobs will resume after the maintenance is completed.

    Detailed Schedule

    • 05:50–06:00 UTC: processing of new jobs stops.

    • 06:00–06:15 UTC: service enhancement period.

    • 06:15 UTC: processing of jobs resumes.

Failing jobs on all stacks due to an outage in AWS US east region
  • Resolved
    Resolved

    AWS reports continued improvement in recovery. All previously affected stacks are now operational. This incident has been resolved.

  • Update
    Update

    AWS reports progress with the mitigation and we are seeing signs of recovery. Jobs are successfully starting and finishing. We continue monitoring the issue.

  • Update
    Update

    The AWS outage is still ongoing and continues to affect our AWS US stack. We’re actively monitoring the situation and will share updates as soon as we have more details.

  • Monitoring
    Monitoring

    AWS reports another "significant API errors and connectivity issues across multiple services in the US-EAST-1 Region" which has impact on AWS US stack. Jobs are delayed.

  • Investigating
    Investigating

    We still see degraded performance in AWS US stack, the jobs are currently failing start.

  • Update
    Update

    The AWS incident in the US region is still not fully resolved. AWS is reporting errors leading to limited capacity to schedule new workloads, which may continue to affect the AWS US stack performance.

  • Update
    Update

    All stacks are operational except AWS US. The AWS US stack shows degraded performance, primarily affecting job listing visibility in the UI. Job execution is not impacted and continues to run as expected.

  • Update
    Update

    We’re still seeing issues on the AWS US stack following the AWS incident. Some jobs are stuck in processing and may not complete. We’re working on mitigation. Other stacks are running normally, and we’re continuing to monitor their performance.

  • Update
    Update

    Jobs are now running successfully on all stacks. AWS has reported significant signs of recovery. Orchestration has been unpaused on the AWS US stack, and all orchestrations scheduled between 10:30 and 11:30 (CET) will be gradually triggered until synchronization is restored.

  • Update
    Update

    We’re seeing jobs complete successfully on all stacks except the AWS US stack, so we’re resuming orchestrations for those unaffected stacks and continue monitoring the issue.

  • Monitoring
    Monitoring

    We’ve paused job scheduling to prevent further impact while the AWS incident is being resolved.

  • Update
    Update

    We identified multiple stacks are experiencing degraded job execution performance. The degradation is linked to platform images hosted in the affected AWS US region.

  • Identified
    Identified

    Based on report from AWS https://health.aws.amazon.com/health/status that this issue is affecting cloud on AWS us east 1 region only.

    We will report back if we have more information

  • Investigating
    Investigating

    We are currently investigating issue with jobs not starting on AWS US east stack.

    Our team has been notified and is actively investigating.

    Next update in 20 minutes or when new information is available.

October 2025 to December 2025

Next