Sr.cloud Engineer

6 months ago


Jeddah, Saudi Arabia Talent Pal Full time

We are seeking an experienced Senior Cloud Engineer with expertise in AWS, Kubernetes, and various cloud technologies. As a Senior Cloud Engineer, you will be responsible for maintaining and optimizing AWS EC2 instances, managing Kubernetes clusters, and ensuring the smooth operation of critical infrastructure components such as Teleport, Graylog, Sentry, and Clickhouse. Additionally, you will be responsible for implementing and maintaining monitoring tools such as PagerDuty, CloudWatch, Grafana, and Nagios to ensure the availability, performance, and security of our cloud-based systems.

***Responsibilities**:
**Responsibilities**:
1/ AWS Infrastructure Management:

- Manage and optimize AWS EC2 instances, ensuring their availability, performance, and scalability.
- Implement and maintain infrastructure-as-code (IaC) using tools such as Terraform to automate provisioning and management tasks.
- Collaborate with cross-functional teams to design, architect, and implement scalable and reliable cloud infrastructure solutions.

2/ Kubernetes Cluster Maintenance:

- Manage and maintain Kubernetes clusters, including deployment, scaling, and troubleshooting.
- Implement and maintain cluster monitoring, logging, and alerting systems.

3/Cloud Technologies Maintenance:

- Maintain and optimize critical cloud technologies such as Teleport, Graylog, Sentry, and Clickhouse.
- Monitor and troubleshoot issues related to these technologies, ensuring their availability and performance.
- Collaborate with relevant teams to address any security vulnerabilities or compliance requirements.

4/ Monitoring and Alerting:

- Implement and maintain monitoring and alerting systems, including PagerDuty, CloudWatch, Grafana, and Nagios.
- Configure monitoring dashboards, alarms, and notifications to provide real-time visibility into system performance and availability.
- Collaborate with cross-functional teams to identify and resolve performance bottlenecks and ensure effective incident response.

5/ Automation and Scripting:

- Develop automation scripts and tools to streamline cloud infrastructure management and monitoring processes.
- Implement continuous integration and delivery (CI/CD) pipelines for infrastructure code and configurations.
- Promote best practices for infrastructure automation and security across the organization.

**Requirements**:

- Minimum 7 years of experience as a Cloud Engineer or similar role, with a focus on AWS and Kubernetes.
- Extensive hands-on experience in managing and optimizing AWS EC2 instances.
- Solid understanding of cloud infrastructure best practices and optimization techniques.
- Proficiency in managing and maintaining Kubernetes clusters, including deployment, scaling, and troubleshooting.
- Strong understanding of containerization technologies such as Docker and container orchestration platforms like Kubernetes.
- Experience maintaining and optimizing cloud technologies such as Teleport, Graylog, Sentry, and Clickhouse.
- Familiarity with logging, monitoring, and observability tools in cloud environments.
- Strong experience with monitoring and alerting systems such as PagerDuty, CloudWatch, Grafana, and Nagios.
- Ability to configure and maintain monitoring dashboards, alarms, and notifications.
- Proficiency in scripting and automation using languages like Python, Bash, or PowerShell.
- Experience with infrastructure-as-code (IaC) tools such as Terraform or CloudFormation.
- Excellent teamwork and collaboration skills, with the ability to work effectively across cross-functional teams.
- Strong verbal and written communication skills to articulate technical concepts and solutions effectively.