Axway is a leader in enterprise integration, helping organizations with digital transformation through secure software solutions. They are seeking a Principal Cloud Services Architect to design and implement cloud infrastructure solutions and manage mission-critical workloads on AWS.
Responsibilities:
- Design and implement cloud and hybrid (mix of cloud and on-premises) based infrastructure solutions that include both virtualized compute and storage
- Design and implement high availability and disaster recovery solutions that span the cloud and on-premises
- Design and implement VPCs to deploy mission-critical production applications using AWS infrastructure services
- Act as a Technical Lead for customers' onboarding projects and work with Customers to establish G2C connectivity
- Install, configure, implement, and support Windows/Linux servers, including management of user/group accounts & policies, integration to Active Directory, Azure Active Directory, and Active Directory Federation Services
- Manage the patching regime of all systems
- Manage the global hosting asset/inventory and perform lifecycle planning and execution
- Manage monitoring systems such as Prometheus, Nagios, or equivalent
- Manage other hosting-related technologies such as proxy/web filtering, load balancers, WAF, backup and replication solutions, and so forth, such as NetApp ONTAP, GlusterFS, EFS, etc
- Provide L3 support for the cloud infrastructure team on AWS, Storage, Cloud Networking, and Linux platforms
- Create and review monthly operations reports
- Define SOPs and participate in Audits and DR activities
- Ensure all solutions comply with corporate security policies and standards
- Gather design requirements from stakeholders (e.g., Business and application groups) and translate them into functional and technical requirements
- Automate repeatable activities
- Able to perform Capacity Planning on the customer environment on Managed Cloud and provide recommendations for improving availability and cost optimization
- Manage the lifecycle of all requests and incidents that arise, driving for the root cause of problems to prevent incidents from recurring
- Participate in after-hours support (on call) and scheduled implementation activities as required
Requirements:
- 9+ years of Experience with working on AWS Infrastructure services (Architecture/administration/operations)
- Strong working knowledge of AWS services: EC2, EBS, S3, EKS, Route53, IAM, Load balancers, ACM VPN, VPC, Private Link, Transit Gateway, WAF, RDS, Systems Manager, Billing, Trusted Advisor, SSO, Cloud Watch, Cloud Trail, Backup, and Lambda
- Should be proficient in Terraform, Python, Ruby, Bash, and PowerShell
- Proficiency in Linux/Unix and Windows OS is a must
- Good working knowledge of Networking concepts and tools: Routing, Switching, Firewall security, Proxy, Reverse Proxy, HAProxy, Nginx
- Experience in using the tools: GiT, Jenkins, Puppet, Ansible
- Experience with System hardening guidelines, e.g., CIS, NIST
- Experience with Infrastructure Monitoring tools is required
- Strong knowledge on Kubernetes platform
- Require minimal supervision and work well in the Production Operations team
- Ability to work efficiently under pressure on non-routine and highly complex tasks
- Team player with strong interpersonal, written, and verbal communication skills
- Ability to work in a multicultural team spread across the globe (Europe, India, NA)
- Must have a customer-first mindset
- Experience in working with Ticketing tools like Salesforce, Service Now, and Jira is preferred
- AWS Solution Architect certification is a plus
- Knowledge of Microsoft Azure is a plus