2nd International Conference on Sustainable Computing and Intelligent Systems (SCIS 2025)

Mr. Sai Raghavendra Varanasi

Accelerating Claims Resolution: SRE-Led MTTR Optimization in Cigna

Abstract:

Site Reliability Engineering (SRE) has emerged as a transformative force for increasing uptime and operational agility within healthcare organizations. In the context of Cigna’s large-scale claims systems, optimizing Mean Time to Recovery (MTTR) became a business imperative to ensure rapid, reliable claims processing and support the delivery of timely patient care. This session presents real-world strategies and architectural interventions deployed by Cigna’s SRE teams to analyze and systematically reduce MTTR across critical claims workflows. The talk covers the integration of advanced monitoring, automation of incident response, adoption of blameless postmortems, and the deployment of AI-powered root cause analysis for predictive resilience. Key metrics, implementation challenges, and outcomes are discussed, demonstrating how a culture of reliability engineering drives measurable improvements in claims turnaround, reduces customer impact, and elevates healthcare service quality. Attendees will gain actionable insights for leveraging SRE principles and data-driven automation to deliver robust, patient-centric operations in complex healthcare environments.

Profile:

Having 14 years of experience as a Software Change/Release Manager in orchestrating seamless release and change processes across both on-premises and cloud environments, including complex cloud migration projects. By leveraging advanced DevOps automation and AI-driven tools, pipelines have been optimized to support unified build, test, and deployment workflows adaptable to hybrid infrastructures. AI-enabled code analysis, automated validations, and predictive analytics streamline integration and deployment, enabling early risk detection in both legacy and cloud-native stacks. During migrations, intelligent automation assists in dependency mapping, environment replication, validation, minimizing downtime to ensure data integrity. Utilized AIOps solutions to deliver automated incident detection, triage, root cause analysis, risk assessment, low-risk cloud adoption. Site Reliability Engineering (SRE) practices are enhanced with intelligent runbooks, self-healing automation to support environment consistency, resilience for migration activities. Integrated AI in the release process to ensure reliable, secure, and compliant delivery for multiple web applications.