Software Engineer
The Associate Operations Engineer is responsible for identifying, investigating, detecting, and protecting
service availability and data across a wide spectrum of source types and locations. The role involves
validating alerts or reports to determine whether they constitute an incident, ensuring incidents are
properly recorded in the appropriate reporting systems, and assessing their severity and impact. Initial
corrective actions may also be taken when required.
This role focuses on operational activities such as batch scheduling, processing, monitoring, and
remediation for services critical to the back-office functions that drive the business.
Additionally, the Associate Operations Engineer will act as a primary initiator of the Major Incident
process and will work closely with Incident Management and Security Operations teams when incidents
involve potential system integrity, access, or security concerns.
How You Will Make an Impact:
● Provide monitoring, correlation analysis, and incident response for operational events.
● Validate alerts or reports to determine whether they qualify as operational or security-related
incidents.
● Ensure incidents, batch failures, and reports are accurately recorded in the appropriate reporting
systems.
● Take accountability for incidents during assigned 24x7 roster shifts, ensuring timely resolution
and communication.
● Collaborate with support teams to manage event identification and incident resolution,
supporting a seamless Major Incident Management process.
● Assist in identifying event-to-incident correlations with a high degree of certainty and accurate
classification.
● Perform initial triage of alerts or events to determine whether they represent operational
incidents or security investigations.
● Monitor infrastructure, application, and platform logs to identify operational anomalies or
potential security-related events.
● Review system, middleware, and infrastructure logs to identify patterns indicating service
disruptions, abnormal behavior, or security concerns.
● Support the correlation of operational and security-related events during incident investigations
or major incidents.
● Maintain Runbook documentation to ensure it remains accurate, compliant, and up to date.