Senior Data Availability Engineers

KSA
May 7, 2024
Apply Now

Job Description

Our client specializes in guiding organizations across both private and public sectors on their transformative path to the cloud, leveraging their cutting-edge Sovereign Multi-Cloud for Business Platform. Renowned as leaders in the design, supervision of construction, and operation of Data Centers throughout the MENA and GCC regions, they actively contribute to fostering innovation and operational efficiency.

With a focus on strategic IT transformation, the company aligns closely with client objectives, providing invaluable support in effective planning, seamless implementation, and proficient management of IT infrastructure.

About the role:

  • Execution of Disaster Recovery Plan (included in the IT Continuity and Resilience Document): Execute the detailed disaster recovery plan by following the steps to be taken in the event of a disaster.
  • Monitoring and Alerting: Continuously monitor the health and performance of the applications and databases, ensuring that they are running smoothly, and that DR is being performed as per the recovery plan and meeting the respective RTOs and RPOs. Notify the relevant stakeholders in case of any issues or failures, and coordinate with CSP to resolve issues.
  • Identify and report any changes in the source customer on prem environment and ensure that the change Request is communicated with the client’s prior to implementing changes in source environment to ensure smooth replication.
  • Testing and Validation: Lead disaster recovery testing (in compliance with scope of work, 2 Full tests yearly, 4 partial tests yearly, Review and Update BIA for in scope applications semi-annually).
  • Identify any weaknesses or gaps and update the recovery plan and any associated documentation accordingly. Involve relevant stakeholders/applications owners from customer end during testing.
  • Incident Management: Ensure ticket is raised on CSP service desk portal and coordinate with the necessary teams to resolve issues and restore services as quickly as possible.
  • Documentation and Reporting: Maintain accurate documentation of the disaster recovery processes, including recovery procedures, configurations, and any changes made. Generate reports on the status of the disaster recovery operations, highlighting any issues, improvements, or recommendations.
  • Communication and Collaboration: Act as a liaison between the CSP and customer, ensuring clear and effective communication. Collaborate and maintain relationships and application owners and IT Team.
  • Continuous Improvement: Continuously evaluate and improve the disaster recovery processes to enhance the overall resilience and reliability of the applications and databases.

Relevant skills and qualifications:

  • Bachelor degree in Computer Science, Engineering, or related field.
  • Proven experience in data engineering, with a focus on data availability and reliability.
  • Desirable proficiency in programming languages such as Python, Java, or Scala.
  • Experience with data backup technologies such as Veeam and DataGuard.
  • Desirable solid understanding of database systems, data warehousing, and cloud platforms.
  • Ability to thrive in a fast-paced, dynamic environment.
Share this post