Support production infrastructure powering critical banking operations across enterprise platforms and systems
Utilize AI-powered operational assistants to troubleshoot incidents, access platform knowledge, and execute operational procedures
Monitor system health, performance, availability, and reliability across both legacy and distributed technology environments
Investigate incidents, perform root cause analysis, and drive timely resolution of production issues
Participate in change management, release management, disaster recovery, and business continuity activities
Collaborate with infrastructure, application, cybersecurity, risk, and business teams to ensure service availability and stability
Identify and implement opportunities for automation, operational improvements, and increased platform efficiency
Contribute to operational excellence initiatives and the evolution of AI-augmented operations practices.

Strong experience in infrastructure operations, systems administration, site reliability engineering (SRE), platform engineering, or related disciplines
Hands-on experience supporting highly available enterprise production environments
Practical expertise in Linux/Unix or Windows Server administration, cloud infrastructure operations, enterprise monitoring, or infrastructure automation
Strong knowledge of incident, change, and problem management processes
Advanced troubleshooting and root cause analysis skills
Familiarity with IT operations in regulated industries, preferably banking or financial services
Knowledge of banking operations such as payments, treasury, lending, wealth or asset management, securities, custody, or operational risk is an advantage
Exposure to IBM Z, z/OS, CICS, DB2, IMS, RACF, JES, SDSF, or related technologies is beneficial
Ability to learn quickly and adapt in complex technology ecosystems
Strong communication and collaboration skills for working across technical and business teams.

VibeOps Engineer

Key skills