of our enterprise systems. This role will set the vision for site reliability engineering (SRE) practices, observability frameworks... visibility and actionable insights. Lead performance engineering initiatives including capacity planning, load testing...
transformations. Experience implementing an SRE practice to improve platform and application reliability, performance..., Rally, SonarQube, Jira, Azure DevOps, or AWS Code Pipeline. Proven experience using performance and observability tools...
improvements in system health, performance, and reliability. Perform capacity planning, conduct performance profiling, and execute... system tuning to support scalability, enhance reliability, and optimize system performance across distributed environments...
deployment health. Implement and tune observability tools and monitoring systems to ensure performance and reliability. Lead... direction through cross-functional collaboration. Lead postmortems and share learnings across teams; maintain decision logs...
specializing in VMware, Kubernetes, OpenShift, Storage, and Compute platforms. In this high-impact role, you will lead the design... infrastructure. This role demands a systems thinker who thrives in large-scale hybrid environments, prioritizes platform reliability...