●Be the guardian of our digital world. You’ll be the architect of our resilience, ensuring Deriv’s systems can weather any storm. It’s not just about backup plans, it’s about building a fortress that keeps us running smoothly, no matter what.
●Strategise and lead. You’ll create comprehensive disaster recovery (DR) plans, mentor your team, and guide us through the ever-changing landscape of IT resilience. You’ll provide leadership and direction for Business Impact Analysis (BIA) and DR planning.
●Build unbreakable systems. You’ll design and deploy cutting-edge DR solutions tailored to our critical cloud applications and services (AWS, GCP), ensuring they’re robust, scalable, and ready for anything.
●Anticipate and mitigate risks. You’ll conduct deep-dive risk assessments, leverage machine learning, and lead exercises that prepare us for the unexpected. Ensure DR strategies meet or exceed RTO and RPO.
●Automate for speed and efficiency. You’ll develop frameworks and orchestration tools (e.g., Chef, AWS Step Functions, Ansible, Terraform, AWS CloudFormation) that streamline recovery processes, minimising downtime and maximising our ability to bounce back quickly. Leverage IaC techniques to automate the deployment, configuration, and testing of disaster recovery environments.
●Test, validate, and improve. You’ll design rigorous testing protocols, including disaster recovery drills and simulations, using observability tools like Grafana and AWS CloudWatch to ensure our plans are battle-tested and effective. Implement logging frameworks to ensure continuous monitoring, validation, and improvement of disaster recovery procedures. Integrate chaos engineering practices with automated testing tools to stress-test systems.
●Collaborate across boundaries. You’ll partner with teams across the organisation, ensuring everyone understands their role in disaster recovery and is ready to act when needed. Work closely with architects, system engineers, and security specialists.
●Lead in the face of crisis. When disruptions occur, you’ll take charge, coordinating recovery efforts and minimising impact with a calm and steady hand.
●Ensure compliance and readiness. You’ll navigate the complex regulatory landscape, ensuring we’re always prepared for audits and inspections, especially within the high-stakes financial sector. Provide detailed performance reports to senior leadership, regulatory bodies, and stakeholders.
●Never stop learning. You’ll stay ahead of the curve, continuously improving our disaster recovery capabilities and sharing your knowledge with the team.
●10+ years in disaster recovery, business continuity, or a related field
●3+ years in a leadership role within a highly technical environment
●In-depth experience with AWS services critical to disaster recovery, such as AWS Backup, Amazon RDS Multi-AZ deployments, AWS Elastic Disaster Recovery, AWS CloudFormation, AWS Global Accelerator, and AWS Fault Injection Simulator (FIS).
●Proficiency in managing DR within cloud environments (AWS, GCP), with experience using tools like Terraform, Chef, Docker, Kubernetes, and Octopus Deploy for orchestrating and managing disaster recovery strategies.
●Extensive knowledge of modern architectures—microservices, serverless computing, containerisation
●Proven track record of leading complex, high-impact DR projects
●Strong familiarity with Agile methodologies and Business Continuity principles
●Exceptional analytical skills
●Ability to craft innovative solutions for complex DR challenges
●Outstanding interpersonal skills
●Ability to communicate complex technical concepts to executive leadership and lead cross-functional teams
●Bachelor’s degree in Computer Science, Information Technology, or related field
●Master’s degree or relevant certifications (e.g., CDRP, CISSP, ISO 22301 LI) are a plus