Cognizant is one of the world's leading professional services companies, transforming clients' business, operating, and technology models for the digital era. The Principal DevOps Engineer will lead the implementation of DevOps best practices, work with engineering teams to provide technical solutions, and guide other engineers while ensuring high availability and performance of cloud production systems.
Responsibilities:
- This role requires a comprehensive understanding of DevOps best practices, including version control, automation, deployment, and monitoring
- In this role, you will work with various Engineering team members to provide technical solutions as well as guide and support other DevOps Engineers on the implementation
- Design, build, maintain, and troubleshoot tools and pipelines for creating and managing scalable, secure distributed software systems
- Foster a passion for improving product and system availability, performance, scalability, and security
- Operate cloud production systems with a strong focus on monitoring and metrics to assess feature health and usage
- Collaborate closely with product management and application engineers to ensure robust system design, prioritizing high availability
- Lead the evaluation of new technologies and drive their integration into our technology stack
- Provide support for deploying and operating multiple clusters across various client manufacturing sites and the global client business network
- Assist in deploying and operating Dev, QA and Prod clusters at AWS
- Streamline deployment and operational workflows through automation
- Thoroughly document systems, processes, and assets to meet client expectations for business continuity
- Administer CI tools (BitBucket, Jenkins, Artifactory, SonarQube)
- Work closely with the vendor in addressing any major upgrades, plug-ins, and critical fixes
- Propose and implement new technical strategies for DevOps processes
Requirements:
- Demonstrable track record of successful DevOps implementations, with a deep understanding of modern infrastructure and automation tools
- Extensive Hands-On experience with Docker, Docker-Compose, Kubernetes, Helm, and image management
- Five or more years of experience with Terraform, Ansible, or similar system automation/deployment tools
- Five or more years of experience with build systems like Jenkins and Bitbucket Pipelines
- Proficiency in shell scripting and Python
- Familiarity with enterprise logging solutions like Datadog
- Experience with Apache Pulsar and PostgreSQL
- Familiarity with JFrog Artifactory
- Competence in Linux system administration
- Strong teamwork skills and the ability to prioritize tasks in a fast-paced environment
- Availability in Eastern or Central US time zones (EST or CST)
- Java development experience is a bonus
- Experience in migrating existing workloads to Kubernetes is a plus
- Background in logistics, warehousing, 24/7 operational environments and distributed cloud/on prem deployments