About the Role
Overview:
We are looking for a seasoned Cloud DevOps Engineer who can design, implement, and manage our cloud infrastructure on AWS. The ideal candidate will have extensive experience in DevOps practices, automation tools, cloud computing technologies, and disaster recovery strategies. This role involves ensuring the reliability, scalability, security, and disaster recovery readiness of our cloud-based systems while collaborating with cross-functional teams to drive continuous improvement.
Key Responsibilities:
Cloud Infrastructure Management:Design, deploy, manage, and optimize AWS cloud infrastructure including EC2 instances, S3 buckets, RDS databases, VPCs, subnets, security groups, IAM roles/policies.
Ensure compliance with best practices for security, scalability, and performance.Automation & Scripting:
Develop automation scripts using tools like AWS CloudFormation, Terraform, Ansible or similar technologies.
Automate deployment processes using CI/CD pipelines (AWS CodePipeline/Jenkins/GitLab CI/CD).
Monitoring & Logging:
Implement monitoring solutions using AWS CloudWatch or third-party tools like Prometheus/Grafana.
Set up logging mechanisms to ensure visibility into system performance and issues (Cloud watch).
Security & Compliance:
Implement security best practices such as encryption at rest/in transit (KMS), access controls (IAM), network security (VPCs).
Ensure compliance with industry standards like HIPAA/GDPR if applicable.
Disaster Recovery:
Design and implement robust disaster recovery strategies using AWS services such as Amazon S3 Glacier for data archiving and recovery.
Configure backups for critical data using services like Amazon RDS Automated Backups or AWS Backup.
Develop Business Continuity Plans (BCPs) and Disaster Recovery Plans (DRPs) tailored to the organization’s needs.
Conduct regular disaster recovery drills to ensure readiness.
Collaboration & Communication:
Work closely with development teams to understand application requirements and provide appropriate infrastructure support.
Communicate technical information effectively to both technical and non-technical stakeholders.
Troubleshooting & Optimization:
Troubleshoot issues related to cloud infrastructure performance or availability.
Optimize resource utilization for cost efficiency without compromising on performance or reliability.
Documentation & Knowledge Sharing:
Maintain detailed documentation of cloud architecture designs and configurations.
Share knowledge through training sessions or workshops within the team.
Qualifications:
Education: Bachelor’s degree in Computer Science or related field; advanced degree preferred but not required.
Experience:
Minimum of 5 years of experience in a similar role focusing on cloud computing (AWS preferred).
Proven track record in designing and managing large-scale cloud infrastructures.
Experience with DevOps tools such as Jenkins/GitLab CI/CD pipelines is essential.
Previous experience in implementing disaster recovery solutions is highly desirable.
Technical Skills:I
In-depth knowledge of AWS services including EC2, S3, RDS, VPCs etc.
Proficiency in scripting languages like Python/Bash/PowerShell.
Experience with automation tools like Terraform/CloudFormation/Ansible.
Strong understanding of containerization technologies like Docker/Kubernetes if applicable.
Familiarity with monitoring/logging tools such as CloudWatch/Prometheus/Grafana.
Knowledge of backup/recovery solutions like Amazon S3 Glacier and AWS Backup.
Certifications:AWS Certified Solutions Architect – Associate/Professional level certification is highly desirable but not mandatory.
Soft Skills:
Strong problem-solving skills with attention to detail.
Excellent communication skills both written and verbal.
Ability to work independently as well as part of a team.
Continuous learner who stays updated with latest trends in cloud computing and DevOps practices.
What We Offer:
Competitive salary package
Professional development opportunities
Access to cutting-edge technologies
Collaborative work environment