About LucidyaLucidya is a fast-growing SaaS company powered by Machine Learning and Big Data technologies. We help brands unlock actionable insights by analyzing customer data from diverse digital channels. Our team thrives on solving tough problems in a collaborative, fast-paced, and impact-driven environment.
Role OverviewWe are looking for a Cloud & DevOps Engineering Manager to lead the development, reliability, and scalability of our cloud infrastructure, deployment pipelines, and production operations. This is a strategic and hands-on leadership role that blends DevOps, SRE, and Cloud responsibilities to ensure our platform is secure, reliable, and ready to scale rapidly.
As we prepare for our next stage of growth, this role is vital in helping shape the future of Lucidya’s technical strategy. While we understand this position encompasses multiple areas—Cloud, DevOps, SRE, Security, and Operations—it reflects our current team structure. The successful candidate will work closely with leadership to rapidly scale the team and evolve the organization toward industry-standard engineering practices.
Key Responsibilities☁️
Cloud Infrastructure & Operations - Architect and manage secure, scalable, and cost-efficient cloud environments (AWS, GCP, or Azure).
- Oversee Linux-based systems and ensure system availability, uptime, and performance.
- Maintain and evolve Infrastructure-as-Code practices using tools like Terraform and Ansible.
- Own disaster recovery, backup strategies, and business continuity planning.
- Monitor and respond to incidents, drive incident resolution, and implement preventive measures.
⚙️
DevOps & CI/CD - Design, build, and continuously improve CI/CD pipelines to support frequent, safe, and automated deployments.
- Manage environment consistency across dev, staging, and production environments.
- Work closely with developers to streamline the deployment and delivery process.
- Encourage a DevOps culture focused on automation, speed, and reliability.
🔐
Security & Compliance - Implement and enforce infrastructure and application security best practices.
- Manage cloud security, IAM policies, network protection, and vulnerability response.
- Ensure compliance with data protection and industry standards (e.g., ISO, SOC, GDPR where applicable).
- Conduct regular security audits and drive improvements proactively.
📈
Site Reliability Engineering (SRE) - Define and track SLAs, SLOs, and SLIs to ensure reliability and performance standards are met.
- Lead incident postmortems and drive systemic improvements.
- Identify and eliminate operational toil through automation and tooling.
- Implement observability best practices with modern monitoring, alerting, and logging solutions.
🧑🤝🧑
Leadership & Team Growth - Lead, mentor, and support a high-performing team of engineers.
- Define roles, set clear goals, conduct performance reviews, and support team development.
- Build a hiring roadmap and work closely with leadership to scale the team rapidly toward an industry-standard structure.
- Collaborate with engineering, product, and executive teams to align infrastructure goals with business priorities.
📊
Project & Stakeholder Management - Drive project execution from planning to delivery across multiple stakeholders.
- Prioritize initiatives, set realistic timelines, and communicate progress transparently.
- Ensure alignment between product needs and technical capabilities.
- Champion a culture of collaboration, ownership, and continuous improvement.