SRE (Site Reliability Engineer)

الإمارات - Dubai United Arab Emirates

We are seeking a highly motivated and experienced SRE to join our dynamic team. The ideal candidate will be responsible for improving software applications to ensure reliability amidst frequent updates from development teams.


Duties and Key Accountabilities

  •  Ensure all our infrastructure are running at optimal condition.
  • Provide deployment, patches and update on all services that running on public cloud and on premise.
  • Identify and resolve support ticket that are related to our infrastructure and services.
  • Work closely with developer to provide a completed, up to date and readable documentation.
  • Develop SRE task related documentation for future reference and better tracing.
  • Monitor our services using Grafana and identify bottleneck if any. Provide immediate action and troubleshooting when necessary.
  • Maintain, enhance our monitoring system including but not limited to Grafana, Victoria Metrics, Alert Manager.
  • Work closely with cross department to provide update and patch on our services using our CICD tools.
  • Identify on system log to provide better understand on service outage and issues.
  • Perform preventive maintenance to our system and infra.
  • Always willing to learn new technology and tools.

Skills

  • Having 1 years or more in DevOps, Network engineer, SRE related field is required.
  • Familiar with Linux and networking related skills.
  • Able to work and solve problems independently when required.
  • Having hands-on experience with bash script.
  • Brief understanding on how cloud infrastructure (Alicloud, AWS, GCP and more) works.
  • Able to work on call
  • Willing to learn new technology such as Grafana, Terraform, Gitlab CI/CD, ArgoCD and Ansible.


Having the following would be an edge.

 

  • Understand how docker and kubernetes work
  • Programming experience. (python and golang)
  • Brief understanding on Terraform, Ansible, Packer is a plus
  • Having hands-on knowledge in cloud computing, kubernetes, Gitlab etc.
  • Having hands-on knowledge in Terraform and Ansible related skills.


What we Offer?

● Training will be provided if needed

● Visa/EID

● Medical Insurance

● 30 working days Annual paid Vacation

● Flight Ticket allowance every Year

 

Additional Information:

● Work 5 days per week

● 9:00 AM - 6:00 PM

● Office is in Barsha Heights

تاريخ النشر: ١٦ مايو ٢٠٢٤
الناشر: Bayt
تاريخ النشر: ١٦ مايو ٢٠٢٤
الناشر: Bayt