StackPulse has announced a Kubernetes-centric “operations center” initiative as a part of its Reliability platform.
With these additions, StackPulse gives organizations running Kubernetes a powerful set of capabilities to augment their existing incident response practices, helping Site Reliability Engineers (SRE) understand and investigate issues faster, and deploy well-tested outage mitigation strategies, helping prevent customer-facing downtime.
The 15-month old company that exited stealth mode in January, with $28 million in funding, automates tasks associated with outage response so that SRE and DevOps teams can recover applications more quickly, saving lost revenue and degraded customer experiences.
Additionally, StackPulse has released more than a dozen playbooks built by SRE experts that remediate common Kubernetes problems. Using the StackPulse platform to automate these playbooks reduces the time to resolution, helping teams restore services faster and meet SLOs.
All the Kubernetes tools and automated diagnostics are available to teams in the same platform as StackPulse’s incident response functionality so teams can communicate during outages, centralize event data, and take action to remediate.