Open Assessment Technologies is growing, and we’re looking for a DevOps Engineer to join our Cloud Operations team ensure the reliability, scalability, and performance of the platforms supporting national Computer-Based Assessment (CBA) campaigns across multiple countries. You will bridge the gap between software development and operations by applying engineering principles to infrastructure and deployment processes.
You will operate across both Google Cloud Platform (GCP) and Amazon Web Services (AWS) environments, supporting containerized workloads and legacy infrastructures alike. Your work will focus on automation, observability, CI / CD optimization, incident management, and improving system resilience to guarantee stable exam delivery to hundreds of thousands of students simultaneously.
Duties and responsibilities
Infrastructure & Cloud Management
- Design, implement, and maintain highly available, scalable, and secure cloud-based infrastructure on both GCP and AWS.
- Manage GCP-based Kubernetes clusters and AWS-based infrastructures.
- Apply Infrastructure as Code (IaC) practices using Terraform, Ansible or equivalent tools.
- Manage deployment workflows via ArgoCD, CodeDeploy and GitHub Actions.
Deployment Automation & CI / CD
Build and maintain CI / CD pipelines to automate provisioning, configuration, and delivery processes.Support integration engineers and QA in preparing, validating, and deploying environments for national assessment campaigns.Monitoring, Reliability & Incident Management
Set up and maintain observability systems using Google Cloud Operations Suite, Dynatrace, Grafana, and Datadog.Participate in on-call rotations during national campaigns, ensuring uptime, rapid incident response, and efficient postmortem follow-up.Define and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs) for critical services.Scalability, Performance & Resilience
Support capacity planning, load testing, and performance optimization to handle peak traffic during nationwide assessments.Implement self-healing, redundancy, and failover strategies across multi-cloud environments.Security & Compliance
Ensure compliance with ISO 27001, SOC 2, and GDPR standards.Implement IAM best practices, network security (firewalls, VPNs, service mesh), and data protection measures.Collaborate with security and compliance teams to maintain high standards across deployments.Collaboration & Continuous Improvement
Work closely with integration engineers, QA, and project managers to align operational reliability with campaign goals.Promote DevOps and SRE best practices.Contribute to documentation, automation scripts, and reusable deployment templates.Qualifications & Skills
5+ years of experience in DevOps, SRE, or Cloud Engineering roles.Strong expertise with GCP (GKE, Cloud Run, Cloud Functions, Cloud SQL, BigQuery, and Pub / Sub) and AWS (EC2, RDS, ElastiCache, S3, Lambda, CodeDeploy, …).Hands-on experience with Kubernetes, Terraform, Ansible, Helm, ArgoCD, and GitHub Actions.Scripting proficiency in Python, Bash, NodeJS, or Go.Write and optimize SQL queries for troubleshooting and performance monitoring.Experience with monitoring and logging systems : Dynatrace, Grafana, Datadog, Google Cloud Operations Suite (Stackdriver).Understanding and usage CI / CD (ArgoCD, GitHub Actions), GitOps, and IaC methodologies.Knowledge of PostgreSQL, Redis, but also ElasticSearch, Firestore and other NoSQL databases.Solid understanding of networking, security, and identity management in cloud environments.Familiarity with ISO 27001 controls and data protection practices.Excellent problem-solving and communication skills; ability to collaborate in cross-functional teams.Knowledge about ElasticSearch is a plus.Google Cloud and / or AWS certifications are a plus.Key Attributes
Strong sense of ownership and accountability for system reliability and performance.Ability to thrive under pressure, and manage incidents during live national campaigns.Curious mindset, eager to automate, optimize, and continuously improve operational processes.Team-oriented, with a collaborative approach and commitment to blameless culture.Working conditions
Permanent position based in Luxembourg or Spain.The employee will work during regular daily working hours, 8 hours a day.Remote work is possible, depending on regulations in the country of the employee.The employee undertakes to work overtime when required by the workload, including weekends and public holidays in cases of extreme urgency (paid accordingly depending on regulations of the country of the employee).