CodeRoad Inc provides end-to-end software development services, helping businesses scale with ideal infrastructure solutions. As a DevOps Engineer, you will architect, scale, and maintain the cloud infrastructure for AI platforms, ensuring high-throughput microservices run seamlessly while maintaining security and compliance.
Responsibilities:
- Own and evolve production-grade cloud infrastructure on Azure, anchoring foundations across networking, compute, storage, and security
- Design and maintain robust Infrastructure-as-Code (IaC) architectures utilizing Terraform, establishing reusable modules and strict environment promotion practices
- Lead the orchestration and scaling of containerized services via Docker and Kubernetes, driving cluster hygiene, deployments, and smooth rollout strategies
- Build and optimize end-to-end CI/CD pipelines using GitHub Actions, integrating strict test gates, secure artifact management, and automated secrets handling
- Collaborate with engineering teams to elevate service reliability, implementing advanced observability, tailored alerting, SLOs, and proactive incident response workflows
- Architect and support high-availability data and messaging layers, optimizing operations for PostgreSQL and asynchronous workloads powered by RabbitMQ or Kafka
Requirements:
- 5+ years of professional experience in DevOps, SRE, or Platform Engineering roles
- Advanced Azure Ecosystem Expertise with a proven track record of operating complex production environments
- Deep Terraform Proficiency, including remote state management, module design, drift detection, and zero-downtime change management
- Hands-on Container Mastery spanning Docker and Kubernetes day-2 operations, ingress routing (NGinx/HAProxy), and HTTPS/TLS best practices
- An Ownership Mindset with a strong capability to evaluate infrastructure design options for scalability, flexibility, and security
- Advanced English proficiency (written and spoken) for fluid, asynchronous, and real-time collaboration
- Bachelor's degree in Computer Science, Computer Engineering, or equivalent practical experience
- Production experience operating HashiCorp Vault (HA setups, token rotation workflows, and strict least-privilege policies)
- Solid background in HIPAA and SOC2 audit readiness, compliance tracking, and threat modeling
- Hands-on familiarity with modern observability stacks for metrics, logs, and distributed tracing
- Experience leveraging LLMs and AI tools as a force multiplier within engineering and infrastructure automation workflows