Talent.com
Senior Site Reliability Engineer

Senior Site Reliability Engineer

ICEO - Venture BuilderMadrid, Madrid, España
Hace 20 días
Descripción del trabajo

Senior Site Reliability Engineer Join to apply for the

Senior Site Reliability Engineer

role at

ICEO - Venture Builder .

Senior Site Reliability Engineer. Remote

Shape the reliability experience of an always‑on crypto platform delivering seamless service across North America and Europe.

As an SRE, you’ll be leading the charge in designing active‑active failover, cross‑region routing, and distributed services that are resilient to cloud outages and geopolitical quirks. This is real‑world chaos engineering at a global scale. You’ll lead the creation of a region‑aware CI / CD pipeline with canary deployments, automated rollbacks, and feature flags tailored per continent.

Join us remotely; you can be located anywhere in Europe within the

CET / CEST time zones . The position is

100% remote

and

full‑time .

About us : ZND is the simple gateway to digital finance, already trusted by thousands of users who have moved over €30+ million through the platform. Fueled by a token raise, we’re rolling out an AI chat assistant for every action and instant credit line on your digital assets. Our bold vision is to be the place where anyone can trade, earn, ask, and borrow in seconds – crypto made effortless.

What you will be doing :

Set and drive SRE strategy

– translate business goals into quarterly reliability targets, track progress, and adjust course as needed.

Own GCP / GKE architecture

– design, implement, and maintain secure, low‑latency, highly available clusters across regions.

Automate reliability

– build self‑healing, auto‑scaling, and automated incident‑response workflows that minimise manual toil.

Embed high availability

– partner with engineers and product to ship fault‑tolerant Node.js / JVM services and predictable releases.

Manage SLIs, and error budgets

– define, monitor, report, and continuously improve service reliability metrics.

Execute chaos engineering

– plan and run automated fault‑injection (e.g., Chaos Mesh) to validate resilience before customers are affected.

Lead incidents

– coordinate response, run blameless post‑mortems, and ensure corrective actions are prioritised and implemented.

Capacity and cost planning

– forecast growth, right‑size resources, and optimise spend without sacrificing performance.

Document and share knowledge

– create clear architecture diagrams, runbooks, and playbooks to keep the organisation unblocked.

Mentor and influence

– champion SRE and DevOps best practices.

Engage in team rituals

– contribute to daily stand‑ups, sprint planning, and roadmap reviews to keep reliability work aligned with product goals.

What do you need :

6+ years in DevOps / SRE with full platform ownership and risk‑based decision making.

Prior experience in fintech or crypto.

Kubernetes and Helm in daily use, Docker containerisation, CI / CD pipelines and version control.

Linux administration on Debian / Ubuntu; strong networking skills covering HTTP(S), DNS, TCP / IP, SSH, firewalls, proxies, load balancers.

Observability stack : Prometheus, Grafana.

Production experience with Kafka, Redis, Nginx.

Hands‑on cloud work in GCP, AWS or Azure, including HA / DR design with HPA, KEDA and affinity / anti‑affinity rules.

Proficient in at least one programming language : Python, Go, C++, or Java; operational depth with JVM and Node.js services.

English proficiency B2+ (written and spoken).

Personal traits : high ownership, open‑minded, naturally curious, strong communicator.

What we offer : Remote‑first company

– we enable you to work from anywhere in the world.

Flexible working hours

– core hours 11 am–3 pm CET, with flexibility outside those hours.

38 days of paid vacation leave

– plus 14 days of paid sick leave per year.

Join a forward‑thinking team where you have the autonomy to make your own choices and explore new ideas.

Automation & IaC

– Bash, Python, GoLang, Terraform.

Security

– SOPS, Okta, TFsec, Trivy, Istio.

Salary : B2B contract €75,000 – €90,000 yearly + additional compensation for on‑call responsibilities.

The recruitment process

Stage 1 : Screening meeting with Talent Acquisition Partner – basic information about ICEO, the project, the role, and the offer (≈ 45 min).

Stage 2 : Technical interview with 2 developers – focus on system architecture, problem solving, and technology choice (≈ 1 h).

Stage 3 : Interview with Lead of DevOps – practical questions on Docker, Kubernetes, Linux, and case tasks (≈ 1 h).

Stage 4 : Final interview with the Head of Technology (30 min).

Background check after an offer is extended (validity of the offer depends on a successful background check).

#J-18808-Ljbffr

Crear una alerta de empleo para esta búsqueda

Site Reliability Engineer • Madrid, Madrid, España

Ofertas relacionadas
  • Oferta promocionada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Resident AdvisorMadrid, Comunidad de Madrid, España
Founded in 2001, Resident Advisor (RA) is one of the world’s longest-running music media brands and a cornerstone of the dance, electronic and DJ ecosystem. The site serves over 6 million monthly us...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

LanguageWire, , Spain, España
We are looking for a Site Reliability Engineer, who is keen on infrastructure as code, continuous delivery and who aims to continuously improve their area of responsibility.We view Microsoft cloud ...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

OracleMadrid, Comunidad de Madrid, España
Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Applications & Infrastructure.This team focuses on product development and product strategy for Oracle Health, ...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer| Teletrabajo

Site Reliability Engineer| Teletrabajo

ALTEN SpainToledo, Castille-La Mancha, Spain
Teletrabajo
En ALTEN tenemos claro que el éxito de nuestros proyectos se debe a las personas que forman nuestro equipo.Por eso, si tienes experiencia laboral como Site Reliability Engineer (SRE) y te apasiona ...Mostrar másÚltima actualización: hace 1 día
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

GreenPowerMonitor, a DNV companyMadrid, Spain
DNV company , we’re at the heart of the global energy transformation.What You’ll Do Your next challenge? Accelerating the world’s clean energy transformation. As a Site Reliability Engineer, you’ll ...Mostrar másÚltima actualización: hace 16 días
  • Oferta promocionada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Omilia, , Spain, España
Senior Site Reliability Engineer.This individual will be part of a team responsible for operating and maintaining production clusters and developing our observability solutions; they will collabora...Mostrar másÚltima actualización: hace 12 días
  • Oferta promocionada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

CanonicalMadrid, Comunidad de Madrid, España
Senior Site Reliability Engineer role at Canonical.Next-gen operations at scale with pure Python infra-as-code, from bare metal to containers and applications. The goal is to perfect enterprise infr...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

JR SpainToledo, Castilla-La Mancha, España
Social network you want to login / join with : .We're Hiring : Site Reliability Engineer (SRE).Remote (Spain-based)Employment Type : . We’re looking for a Site Reliability Engineer to design, build, and ma...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

introMadrid, Community of Madrid, Spain
Site Reliability Engineer - Onsite.Our client, a global fintech that is rapidly evolving to become more technology and product led, is looking for a Platform / SR Engineer to join their team.They off...Mostrar másÚltima actualización: hace 11 días
  • Oferta promocionada
Site Reliability Engineer I

Site Reliability Engineer I

GreenPowerMonitor, a DNV companyMadrid, Comunidad de Madrid, España
At GreenPowerMonitor, a DNV company, we’re at the heart of the global energy transformation.We use data‑driven digital solutions to optimise solar and wind farms worldwide, making renewable energy ...Mostrar másÚltima actualización: hace 10 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

LanguageWire Ltd, , Spain, España
As a key player in our team, you'll be responsible for maintaining infrastructure components and CI / CD pipelines that are essential to the success of multipleinternal delivery teams.You'll join for...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

HappyRobot, , Spain, España
Get AI-powered advice on this job and more exclusive features.HappyRobot is the AI-native operating system for the real economy—a system that closes the circuit between intelligence and action.By c...Mostrar másÚltima actualización: hace 9 días
  • Oferta promocionada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

NetquestMadrid, Spain
Barcelona in 2001 and now part of the.European leader in technologies, data, and AI for Market Research — is shaping the future of consumer insights. We collect behavioral and declarative data from ...Mostrar másÚltima actualización: hace 6 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

XM, , Spain, España
You will join a team working with Observability, Escalations, Post-mortems, Correction of Errors, and other practices that will contribute to the company's goal of cloud resiliency.You will be resp...Mostrar másÚltima actualización: hace 4 días
  • Oferta promocionada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

NextivaMadrid, Comunidad de Madrid, España
Redefine the future of customer experiences.Were changing the game with a first-of-its-kind conversation-centric platform that unifies team collaboration and customer experience in one place.Powere...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

NexthinkMadrid, Comunidad de Madrid, España
Nexthink is the leader in digital employee experience management software.The company provides IT leaders with unprecedented insight allowing them to see, diagnose and fix issues at scale impacting...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

Crane Venture Partners, , Spain, España
At Tinybird, we help developers and data teams take flight by unlocking the power of real-time data to quickly build data pipelines and innovative data products. With Tinybird, you can ingest multip...Mostrar másÚltima actualización: hace más de 30 días
  • Oferta promocionada
Site Reliability Engineer

Site Reliability Engineer

Nextiva Inc., , Spain, España
Redefine the future of customer experiences.We’re changing the game with a first-of-its-kind, conversation-centric platform that unifies team collaboration and customer experience in one place.Powe...Mostrar másÚltima actualización: hace más de 30 días