Manufacturing

Optimizing Multicluster Observability and Incident Management for a Semiconductor Giant

A top global semiconductor company, headquartered in Europe with over 50,000 employees, faced challenges managing complex OpenShift deployments across multiple clusters. The Platform Engineering team supported hundreds of developers with varying levels of cloud-native knowledge, using an extensive tech stack of Kubernetes, OpenShift, EKS, Prometheus, Grafana, and Elasticsearch.

Challenges Encountered

Limited Cross-Cluster Visibility: Teams lacked a unified view of multiple clusters, creating challenges in triaging issues and managing incidents across their vast environment.

High Support Demand: Non-Kubernetes-savvy developers relied heavily on the platform team to troubleshoot, creating bottlenecks that limited innovation.

Alert Noise: Unfiltered alerts led to low signal-to-noise ratios, increasing cognitive load for developers and slowing response times.

Security and Compliance Risks: The customer success team needed access to the CLI and control plane for troubleshooting, which posed compliance concerns.

The Solution

Robusta provided the platform team with a one-click rollout solution, empowering developers to self-diagnose application issues and receive alert prioritization enriched with relevant metrics and logs. By adding Robusta’s secure interface, the platform team reduced their MTTR and maintained system reliability without increasing compliance risks. Additionally, Robusta’s cross-cluster observability enabled Christine and her team to gain centralized health insights, enhancing response times.

Robusta offers exactly what they need:

Reduced MTTR and Incident Noise through Prioritized, Context-Rich Alerts: Robusta’s AI-driven alerting system filters and prioritizes alerts based on urgency and relevance, giving each alert context such as historical trends, associated logs, and recent metrics. This contextualized prioritization empowers the team to quickly pinpoint critical issues, leading to faster resolution times (MTTR) and less distraction from minor alerts, enabling faster recovery from incidents and improved system resilience.

Increased Developer Autonomy, Resulting in Fewer Tickets and Faster Release Cycles: With Robusta’s self-service capabilities, developers gain the tools to diagnose and address application issues independently. This shift reduces the volume of support tickets directed to the platform team, allowing developers to troubleshoot without waiting for intervention. As a result, developers work with greater autonomy, moving their projects along at a faster pace, which translates into shorter development cycles and quicker time-to-market.

Enhanced Cluster Visibility Across Clusters, Reducing Cognitive Load and Downtime: Robusta’s centralized dashboard provides a comprehensive view across all clusters, giving the platform team a single source of truth for monitoring the health and performance of their Kubernetes environment. This cross-cluster visibility helps the team detect and address multi-cluster incidents quickly, lowering cognitive load and minimizing downtime. Instead of toggling between multiple windows, the platform team can focus on rapid decision-making and high-level platform improvements.

Improved Security and Compliance with Limited Cluster Access Requirements: Robusta’s secure, role-based access controls allow the customer success team to troubleshoot without needing full access to Kubernetes control planes. By limiting the need for CLI access, the organization maintains compliance with internal security standards, reducing the risk of unauthorized changes or access issues. This feature also ensures that troubleshooting doesn’t compromise platform security, striking a balance between effective incident response and regulatory adherence.

The Outcome

Using Robusta, the semiconductor company successfully scaled its Kubernetes environment without overburdening the platform team, streamlined incident management, and provided a seamless experience for developers across their OpenShift platform. This case highlights Robusta’s ability to optimize multi cluster observability for enterprise-level deployments.

Download in PDF

Trusted By Platform Engineering and DevOps Teams Around The Globe

I really like the stream of information you get simply by installing Robusta. As an operator, it is a no brainer to add it to my clusters. Gives really good insights without a lot of effort.

Matthias Nguyen, Managing Director Unbasical GmbH

"It's the easiest monitoring solution there is for k8s, an excellent, feature rich product, with a team of people behind it you could have a beer with."

Andrew Riddell, IT Systems Manager UGL

“I start mornings by checking production in Robusta. I love how Robusta is opinionated, highlights problems and significant events. After viewing details, I know enough to resolve issues.”

Keir Robinson, Engineering Manager, Navenio

"By adding Robusta to kube-prometheus-stack and enabling alert grouping, we reduced the number of Slack messages by 90% without missing a single important notification."

Yoni Golob, DevOps Engineer,
Placer.ai

“I use Robusta for governance of my Kubernetes infrastructure. A major strength is the Prometheus integration (kube-prometheus-stack).”

Roberto Iannone, DevOps Engineer, RiAtlas

“We manage kubernetes clusters for multiple clients. With Robusta, it's far easier to compare deployments across our clusters, and notice discrepancies in deployed versions.”

Asbjørn Dyhrberg Thegler, DevOps Consultant, Deranged

“One of the most satisfying features of Robusta is consolidating monitoring data from dozens of clusters across multiple regions into a unified interface.”

Silviu Iaşcu, Director Infra Operations & Cloud,
Jedox

“We adopted Robusta for one of our clients in order to have enriched alerts coming from both in-cluster Kubernetes events and an out-of-cluster Alert Manager installation.”

Diego Ojeda, DevOps Consultant, BinBash

“With Robusta, I don’t need to check my cluster’s health every day. If something needs my attention, I get a message in Teams. I can escalate critical issues immediately.”

Oleg Minaev, Lead Backend Developer, Aureliym GmbH

“We're using Robusta to standardize k8s alerting. Previously, we were using kube-prometheus-stack but the default alerts were too noisy and it was harder to configure”.

James Wu, Space Telescope Science Institute

“I told my devops team to evaluate all the observability tools they want, and to choose the best one for Kubernetes. They chose Robusta."

Yonatan Itai, VP R&D, Cyera

Lorem ipsum dolor sit amet consectetur. Lectus cras mauris egestas vestibulum libero quam aliquet tortor. Platea malesuada quis quam ultrices eu egestas.

Lorem ipsum dolor sit amet consectetur. Lectus cras mauris egestas vestibulum libero quam aliquet tortor. Platea malesuada quis quam ultrices eu egestas.

Lorem ipsum dolor sit amet consectetur. Lectus cras mauris egestas vestibulum libero quam aliquet tortor. Platea malesuada quis quam ultrices eu egestas.

Create your account to get started

Email us, and we'll provide you with a login link to complete your onboarding from your computer, where Robusta performs at its best.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.