Key Accountabilities:
1. Design and implement scalable cloud infrastructure solutions across multiple platforms (AWS, Azure, etc), ensuring high availability, disaster recovery, and optimal performance while following security best practices and compliance requirements.
2. Lead the development and maintenance of Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, or Pulumi, establishing standardised deployment processes and maintaining version control of infrastructure configurations.
3. Architect and implement CI/CD pipelines, automating build, test, and deployment processes while ensuring code quality, security scanning, and automated testing are integrated throughout the delivery pipeline.
4. Monitor and optimize cloud infrastructure costs through resource optimization, implementing cost allocation strategies, and providing regular cost analysis reports with recommendations for improvement.
5. Establish and maintain cloud governance frameworks, including access management policies, security controls, and compliance monitoring, ensuring adherence to industry standards and organizational requirements.
6. Collaborate with cross-functional development & Data teams to drive cloud adoption, providing technical guidance, conducting code reviews, and sharing best practices for cloud infrastructure development.
7. Design and implement container orchestration solutions using Kubernetes or similar technologies, ensuring scalability, reliability, and efficient resource utilization.
8. Develop and maintain comprehensive documentation for cloud infrastructure, including architecture diagrams, runbooks, and standard operating procedures.
9. Lead incident response and troubleshooting efforts for critical production issues, performing root cause analysis and implementing preventive measures to improve system reliability.
10. Research and evaluate new cloud technologies and services, making recommendations for adoption to improve infrastructure capabilities and operational efficiency.
Networking/Key Relationships:
Internal
Infrastructure Platform Team
Security Team
Enterprise Architecture Team
Business Partners
Development Teams
Data Analytics Team
Finance Team
Compliance/Risk Management Team
External
Cloud Service Providers (AWS, Azure) - Managing vendor relationships, staying current with service updates, and escalating support issues when needed.
Minimum Knowledge & Experience Required for the Position:
Relevant cloud certifications preferred (e.g., AWS Solutions Architect Professional, Azure Solutions Architect Expert, etc)
5+ years of hands-on experience in cloud platform engineering, with at least 3 years focused on major cloud providers (AWS, Azure, or GCP)
3+ years of experience with Infrastructure as Code (IaC) and automation tools
3+ years of experience with CI/CD implementation and management
Good understanding of DevOps principles and practices
Experience setting up an Internal Development platform (would be a plus)
Skills & Capabilities:
Technical Skills:
1. Expert-level knowledge of major cloud platforms (AWS/Azure), including compute, PaaS, storage, networking, and security services
2. Strong proficiency in Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, or ARM templates
3. Good knowledge of Serveless technologies (Lambda, Docker, Kubernetes) and container orchestration
4. Good experience with CI/CD tools and practices (Jenkins, GitLab CI, GitHub Actions)
5. Solid scripting and automation skills.
6. Expertise in monitoring, logging, and observability tools (Prometheus, Grafana, ELK Stack)
Soft Skills:
1. Strategic thinking and planning
2. Strong analytical and problem-solving abilities
3. Excellent collaboration and team skills
4. Adaptability and willingness to learn new technologies
5. Good time management and organizational skills
6. Ability to work under pressure and handle critical incidents
Travel Requirements:
Not required