Keboola - Notice history

AWS EU (eu-central-1) - Operational

100% - uptime
September 2025 · 100.0%October · 99.72%November · 99.98%
September 2025
October 2025
November 2025

GCP EU (europe-west3) - Operational

100% - uptime
September 2025 · 100.0%October · 99.69%November · 100.0%
September 2025
October 2025
November 2025

AWS US (us-east-1) - Operational

100% - uptime
September 2025 · 100.0%October · 99.04%November · 100.0%
September 2025
October 2025
November 2025

GCP US (us-east4) - Operational

100% - uptime
September 2025 · 99.99%October · 99.65%November · 100.0%
September 2025
October 2025
November 2025

Azure NE (north-europe) - Operational

100% - uptime
September 2025 · 99.91%October · 99.54%November · 100.0%
September 2025
October 2025
November 2025

Notice history

November 2025

EU Central Job Issues
  • Resolved
    Resolved
    This incident has been resolved.
  • Update
    Update
    We are currently investigating this incident.
  • Investigating
    Investigating

    We are currently investigating reports of outage affecting Job Service on our https://connection.eu-central-1.keboola.com/ stack. Users may experience slow response times, connection errors, UI errors.

    Our engineering team has been alerted and is actively investigating the root cause.

    We apologize for the disruption and will provide an update within 30 minutes or when new information is available.

EU Central Job Issues
  • Resolved
    Resolved
    This incident has been resolved.
  • Monitoring
    Monitoring

    The issues should be resolved now, we're carefully monitoring the status.

  • Investigating
    Investigating

    We are currently investigating reports of outage affecting Job Service on our https://connection.eu-central-1.keboola.com/ stack. Users may experience slow response times, connection errors, UI errors.

    Our engineering team has been alerted and is actively investigating the root cause.

    We apologize for the disruption and will provide an update within 30 minutes or when new information is available.

October 2025

Microsoft Azure Issue
  • Resolved
    Resolved

    Automatic scaling is turned back on and all platforms are fully operational and we'll continue monitoring all stacks. We're sorry for this inconvenience.

  • Update
    Update

    We're seeing full operations on all Azure stacks. We'll be enabling automatic scaling shortly and all jobs are currently processing as usual.

  • Monitoring
    Monitoring

    Azure is promising full mitigation within next four hours. We are closely monitoring the situation and once the issue is fixed we will resume all operations and turn autoscaling back on.

    At this stage, we anticipate full mitigation within the next four hours as we continue to recover nodes. This means we expect recovery to happen by 23:20 UTC on 29 October 2025. We will provide another update on our progress within two hours, or sooner if warranted.

  • Update
    Update

    We have limited autoscaling of all Azure stacks to limit cascading the issue to new nodes and thus limiting the number of waiting jobs. We have seen partial improvement in certain regions so our hopes are up.

  • Update
    Update

    Microsoft has updated the impact and states that network infrastructure in all Azure regions is affected.

  • Identified
    Identified

    These issues can propagate downstream to AWS and GCP stacks, where the symptoms can include:

    • Limited operations of some AI features (error explains, config description generator)

  • Investigating
    Investigating

    We are aware of performance degradation affecting our Azure infrastructure. This appears to be related to an upstream issue with Azure's announced Azure Portal Access Issues (https://azure.status.microsoft/en-us/status).

    Symptoms include:

    • Failures to prepare new nodes to accept workload, causing some jobs to be in the waiting state.

    Our team is monitoring the situation.

    We apologize for the disruption and will provide an update within 30 minutes or when new information is available.

Scheduled Partial Maintenance of all Azure stacks – October 25, 2025
  • Completed
    2025-10-25 at 06:15
    Completed
    2025-10-25 at 06:15
    Maintenance has completed successfully
  • In progress
    2025-10-25 at 05:50
    In progress
    2025-10-25 at 05:50
    Maintenance is now in progress
  • Planned
    2025-10-25 at 05:50
    Planned
    2025-10-25 at 05:50

    We would like to inform you about the planned maintenance of all Keboola stacks hosted on Azure.

    During the database upgrades there will be a short service outage on all Azure stacks, including all single-tenant stacks and Azure North Europe multi-tenant stack (connection.north-europe.azure.keboola.com). This will take place on Saturday, October 25, 2025 between 05:50 and 06:30 UTC.

    Effects of the Maintenance

    During the above period, services will be scaled down and the processing of jobs may be delayed. For a very brief period (at around 06:00 UTC) the service will be unavailable for up to 10 minutes and APIs may respond with a 500 error code. After that, all services will scale up and start processing all jobs. No running jobs, data apps, or workspaces will be affected. Delayed scheduled flows and queued jobs will resume after the maintenance is completed.

    Detailed Schedule

    • 05:50–06:00 UTC: processing of new jobs stops.

    • 06:00–06:15 UTC: service enhancement period.

    • 06:15 UTC: processing of jobs resumes.

Failing jobs on all stacks due to an outage in AWS US east region
  • Resolved
    Resolved

    AWS reports continued improvement in recovery. All previously affected stacks are now operational. This incident has been resolved.

  • Update
    Update

    AWS reports progress with the mitigation and we are seeing signs of recovery. Jobs are successfully starting and finishing. We continue monitoring the issue.

  • Update
    Update

    The AWS outage is still ongoing and continues to affect our AWS US stack. We’re actively monitoring the situation and will share updates as soon as we have more details.

  • Monitoring
    Monitoring

    AWS reports another "significant API errors and connectivity issues across multiple services in the US-EAST-1 Region" which has impact on AWS US stack. Jobs are delayed.

  • Investigating
    Investigating

    We still see degraded performance in AWS US stack, the jobs are currently failing start.

  • Update
    Update

    The AWS incident in the US region is still not fully resolved. AWS is reporting errors leading to limited capacity to schedule new workloads, which may continue to affect the AWS US stack performance.

  • Update
    Update

    All stacks are operational except AWS US. The AWS US stack shows degraded performance, primarily affecting job listing visibility in the UI. Job execution is not impacted and continues to run as expected.

  • Update
    Update

    We’re still seeing issues on the AWS US stack following the AWS incident. Some jobs are stuck in processing and may not complete. We’re working on mitigation. Other stacks are running normally, and we’re continuing to monitor their performance.

  • Update
    Update

    Jobs are now running successfully on all stacks. AWS has reported significant signs of recovery. Orchestration has been unpaused on the AWS US stack, and all orchestrations scheduled between 10:30 and 11:30 (CET) will be gradually triggered until synchronization is restored.

  • Update
    Update

    We’re seeing jobs complete successfully on all stacks except the AWS US stack, so we’re resuming orchestrations for those unaffected stacks and continue monitoring the issue.

  • Monitoring
    Monitoring

    We’ve paused job scheduling to prevent further impact while the AWS incident is being resolved.

  • Update
    Update

    We identified multiple stacks are experiencing degraded job execution performance. The degradation is linked to platform images hosted in the affected AWS US region.

  • Identified
    Identified

    Based on report from AWS https://health.aws.amazon.com/health/status that this issue is affecting cloud on AWS us east 1 region only.

    We will report back if we have more information

  • Investigating
    Investigating

    We are currently investigating issue with jobs not starting on AWS US east stack.

    Our team has been notified and is actively investigating.

    Next update in 20 minutes or when new information is available.

September 2025

Scheduled Partial Maintenance of all Azure stacks
  • Completed
    2025-09-20 at 04:13
    Completed
    2025-09-20 at 04:13

    The maintenance has been completed, and all services have been scaled back up. The platform is fully operational, and jobs are now being processed as usual. All delayed jobs will be processed shortly. Thank you for your patience.

  • In progress
    2025-09-20 at 04:00
    In progress
    2025-09-20 at 04:00

    The announced partial maintenance is ongoing, and we are expecting downtime to begin any minute now.

  • Planned
    2025-09-20 at 03:50
    Planned
    2025-09-20 at 03:50

    We would like to inform you about the planned maintenance of all Keboola stacks hosted on Azure.

    During the database upgrades there will be a short service outage on all Azure stacks, including all single-tenant stacks and Azure North Europe multi-tenant stack (connection.north-europe.azure.keboola.com). This will take place on Saturday, September 20, 2025 between 05:50 and 06:30 UTC.

    Effects of the Maintenance

    During the above period, services will be scaled down and the processing of jobs may be delayed. For a very brief period (at around 06:00 UTC) the service will be unavailable for up to 10 minutes and APIs may respond with a 500 error code. After that, all services will scale up and start processing all jobs. No running jobs, data apps, or workspaces will be affected. Delayed scheduled flows and queued jobs will resume after the maintenance is completed.

    Detailed Schedule

    • 05:50–06:00 UTC: processing of new jobs stops.

    • 06:00–06:15 UTC: service enhancement period.

    • 06:15 UTC: processing of jobs resumes.

Azure North Europe Stack – Major Degradation
  • Resolved
    Resolved
    We identified issues on one of our Kubernetes nodes. The node was taken out of service, and all systems are now operating normally. No jobs were lost or failed; the impact was limited to visible error messages in the user interface. We apologize for the inconvenience caused.
  • Investigating
    Investigating

    We are observing a major degradation in the Azure North Europe stack (connection.north-europe.azure.keboola.com). The root cause is still unknown and investigation is ongoing. Users may encounter various errors in the UI. Mitigations have been initiated.

September 2025 to November 2025

Next