Use data and observability tools to analyze operational metrics, detect anomalies, forecast incidents, and deliver insights to improve application/system performance and reliability. Assist in root cause analysis by providing data-backed insights during and after incidents. Collaborate with IT to define the issue, root cause, and action plan. Recommend improvements to thresholds, alerts, and monitoring strategies. Monitor ticket or list of problems and manage the resolution.