Job 1 van 26


Report this listing

Solliciteren



Lead Site Reliability Engineer


The Company:

Headquatered in belgium, this company is a global leader in manufacturing of equipment that supports sectors like logistics and construction. Their services include, tehcnical services, electronics repair, and training programs all backed by a commitment to innovation and operational excellence.


About the Role:

I am looking for a Site Reliability Engineering (SRE) Lead to take ownership of the reliability, performance, and scalability of my clients systems. You’ll play a key role in designing and implementing infrastructure solutions that enable their engineering teams to deploy faster and more confidently—without compromising stability or uptime.


As the SRE Lead, you’ll mentor a growing team of SREs, drive best practices in observability, automation, and incident management, and collaborate cross-functionally to ensure a seamless experience for both our internal teams and customers.


What You’ll Be Doing:

Leadership & Strategy

-Lead and grow a high-performing SRE team.

-Define and drive the SRE roadmap aligned with business goals.

-Advocate for a culture of reliability, automation, and continuous improvement.

System Reliability & Performance

-Own SLAs, SLOs, and error budgets for critical systems.

-Monitor system performance, diagnose issues, and implement long-term fixes.

Incident Response & Prevention

-Coordinate high-impact incident response efforts and postmortems.

-Drive root cause analysis and long-term improvements.

Tooling & Automation

-Build and enhance internal tooling to improve deployment, monitoring, and reliability.

-Implement infrastructure as code and CI/CD best practices.

Collaboration

-Work closely with engineering, security, and product teams to ensure reliability is factored into planning and development.

-Promote DevOps principles and empower teams with self-service infrastructure.


What We’re Looking For:

-Proven experience in an SRE or DevOps leadership role.

-Deep understanding of networking, containers (Docker, Kubernetes), and --cloud infrastructure (AWS/GCP/Azure).

-Strong skills in monitoring, observability, and alerting systems (Prometheus, Grafana, Datadog, etc.).

-Proficiency with infrastructure-as-code tools like Terraform or Pulumi.

-Experience with CI/CD pipelines and GitOps practices.

-Excellent communication and incident management skills.

-Passion for automation, documentation, and mentoring others.

-English - Dutch considered a plus

-Visa sponsorship not possible


Nice to Have:

-Experience with high-scale, customer-facing applications.

-Familiarity with service meshes, distributed tracing, or chaos engineering.

-Certifications in cloud or Kubernetes.


Interested to learn more, lets chat!

Solliciteren

Meer banen van je zoekopdracht