The Company:
Headquatered in belgium, this company is a global leader in manufacturing of equipment that supports sectors like logistics and construction. Their services include, tehcnical services, electronics repair, and training programs all backed by a commitment to innovation and operational excellence.
About the Role:
I am looking for a Site Reliability Engineering (SRE) Lead to take ownership of the reliability, performance, and scalability of my clients systems. You’ll play a key role in designing and implementing infrastructure solutions that enable their engineering teams to deploy faster and more confidently—without compromising stability or uptime.
As the SRE Lead, you’ll mentor a growing team of SREs, drive best practices in observability, automation, and incident management, and collaborate cross-functionally to ensure a seamless experience for both our internal teams and customers.
What You’ll Be Doing:
Leadership & Strategy
-Lead and grow a high-performing SRE team.
-Define and drive the SRE roadmap aligned with business goals.
-Advocate for a culture of reliability, automation, and continuous improvement.
System Reliability & Performance
-Own SLAs, SLOs, and error budgets for critical systems.
-Monitor system performance, diagnose issues, and implement long-term fixes.
Incident Response & Prevention
-Coordinate high-impact incident response efforts and postmortems.
-Drive root cause analysis and long-term improvements.
Tooling & Automation
-Build and enhance internal tooling to improve deployment, monitoring, and reliability.
-Implement infrastructure as code and CI/CD best practices.
Collaboration
-Work closely with engineering, security, and product teams to ensure reliability is factored into planning and development.
-Promote DevOps principles and empower teams with self-service infrastructure.
What We’re Looking For:
-Proven experience in an SRE or DevOps leadership role.
-Deep understanding of networking, containers (Docker, Kubernetes), and --cloud infrastructure (AWS/GCP/Azure).
-Strong skills in monitoring, observability, and alerting systems (Prometheus, Grafana, Datadog, etc.).
-Proficiency with infrastructure-as-code tools like Terraform or Pulumi.
-Experience with CI/CD pipelines and GitOps practices.
-Excellent communication and incident management skills.
-Passion for automation, documentation, and mentoring others.
-English - Dutch considered a plus
-Visa sponsorship not possible
Nice to Have:
-Experience with high-scale, customer-facing applications.
-Familiarity with service meshes, distributed tracing, or chaos engineering.
-Certifications in cloud or Kubernetes.
Interested to learn more, lets chat!