Job Description
Mozn is a rapidly growing technology firm revolutionising the field of Artificial Intelligence and Data Science headquartered in Riyadh, Saudi Arabia and it’s working to realise Vision 2030 with a proven track record of excellence in supporting and growing the tech ecosystem in Saudi Arabia and the GCC region. Mozn is the trusted AI technology partner for some of the largest government organizations, as well as many large corporations and startups.
We are in an exciting stage of scaling the company to provide AI-powered products and solutions both locally and globally that ensure the growth and prosperity of our digital humanity. It is an exciting time to work in the field of AI to create a long-lasting impact.
We are looking for a Senior Site Reliability Engineer to join our team. In this role, you’ll help ensure our systems are running and secure, manage our various enterprise applications, and maintain our networks.
Responsibilities and Duties
- What you do will be a mixture of software engineering, system architecture design, and operation
- You will be a part of the Engineering team working on a project. You will attend morning meetings, sprint planning as an SRE member of the team.
- You will be helping design, build, support and scale our cloud and on-premise infrastructure; Including monitoring, alerting and debugging infrastructure.
- You will design and implement continuous integration and deployment workflows, with best practices in testing linting and dependency management.
- You will maintain our data stores, monitor the load, design and implement backup and restore plans, scaling, clustering (sharding/replication).
- You will be Collaborating and coordinating with other departments (product, data science etc) to solve their use cases
- You will be exploring and learning new technologies that can complement or replace our current stack to improve it.
- You will be installing servers and network equipment and configuring them using infrastructure as code techniques.
- You will practice sustainable incident response and blameless postmortems.