Esta oferta de trabajo no está disponible en tu país.

AI Systems Engineer – LLM Execution

OpenNebula SystemsPozuelo de Alarcón, Comunidad de Madrid, .ES

Hace más de 30 días

Tipo de contrato

Quick Apply

Descripción del trabajo

For over a decade now, OpenNebula Systems has been leading the development of the European open source technology that helps organizations around the world to manage their corporate data centers and build their Enterprise Clouds.

If you want to join an established leader in the cloud infrastructure industry and the global open source community, keep reading, because you can now join a team of exceptionally passionate and talented colleagues whose mission is to help the world's leading enterprises to implement their next-generation edge and cloud strategies. We are hiring!

Since 2019, and thanks to the support from the European Commission, OpenNebula Systems has been leading the edge computing innovation in Europe , investing heavily in research and open source development, and playing a key role in strategic EU initiatives such as the IPCEI-CIS and the “European Alliance for Industrial Data, Edge and Cloud”.

OpenNebula’s new AI Factory product line delivers sovereign, edge-to-cloud AI infrastructure—enabling enterprises and governments to deploy, orchestrate, and optimize next-generation AI workloads with full control. This role is key to building the execution layer powering that vision. We are currently looking for an AI Systems Engineer to come and join us in Europe as part of our new team developing the AI Factory product line.

Job Description

We are looking for a highly skilled AI Systems Engineer with hands-on experience in executing, tuning, and scaling Large Language Models (LLMs) across multi-GPU infrastructures. This role is central to the development of our new AI Factory product line, which enables open, sovereign, and disaggregated AI infrastructure across cloud and edge environments.

You will help design and optimize LLM execution pipelines, working at the intersection of inference engines, orchestration platforms, and LLM model catalogs. Your responsibilities will include communicating with users, addressing their needs, troubleshooting, and providing step by step solutions.

Responsibilities

Design, implement, and optimize LLM inference pipelines for multi-GPU and multi-node environments.
Integrate with cutting-edge inference engines (e.g., vLLM, TensorRT-LLM, DeepSpeed, etc.).
Tune execution parameters for latency, throughput, and memory efficiency across heterogeneous infrastructures.
Work closely with orchestration frameworks such as Ray, NVIDIA NeMo / Dynamo, and others to coordinate LLM serving at scale.
Integrate with LLM catalogs and registries such as HuggingFace, NVIDIA NIM, and internal repositories.
Collaborate with product and platform teams to shape a modular, portable AI Factory execution layer.
Interact with users and use cases, providing systems support, system architecture definition, making recommendations based on user needs, implementation, testing, user training, and deployment of open source solutions.
Troubleshoot incidents, identify root causes, fix and document problems, and implement preventive measures.
Deliver quality performance indicators within the scope of the assigned project, including project journals, status reports, and other standard documentation.
Work with other companies in the cloud-edge ecosystem within international projects and open-source communities. Availability to occasional travel and participation in international events and meetings.
Write and maintain software documentation and project reports.

Experience required

Academic Background and Certifications

Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.

Professional Experience

Strong hands-on experience deploying and optimizing LLMs in production environments

Experience with inference frameworks such as vLLM, TensorRT, Triton Inference Server, DeepSpeed-Inference, etc.

Hands-on experience with orchestration tools like Ray, NVIDIA NeMo / Dynamo, or KServe.

Experience deploying LLM workloads on hybrid or sovereign cloud environments.

Contributions to open-source LLM or inference projects.

Technical Experience

Deep knowledge of multi-GPU systems and GPU memory management.

Solid understanding of distributed systems and networking bottlenecks in model serving.

Programming experience in Python, with knowledge of CUDA and model quantization a plus.

Familiarity with LLM catalogs (e.g., HuggingFace, NGC, NIM).

Familiarity with open-source MLOps or AI workload orchestration platforms.

Language Skills

English fluency at a professional or native-equivalent level, with excellent clarity and expression in both writing and speech.

Soft Skills & Collaboration

Strong customer service mindset, with a focus on responsiveness and user satisfaction.

Clear communication and documentation with strong written and verbal English, async collaboration, and visibility of work.

Excellent problem-solving skills and a proactive approach to identifying and resolving issues.

Self-management and accountability with ability to work independently, manage time, and take ownership of tasks and deadlines

Technical autonomy and tool proficiency with confidence in using Git, CI / CD, remote collaboration tools (Slack, Zoom, GitHub, etc.), and solving problems without direct supervision.

What's in it for me?

Some of our benefits and perks vary depending on location and employment type, but we are proud to provide employees with the following;

Competitive compensation package and Flexible Remuneration Options : Meals, Transport, Nursery / Childcare…

Customized workstation (macOS, Windows, Linux any distro is welcome)

Private Health Insurance

6 hours workday on Fridays and everyday during August

PTO : Holidays, Personal Time, Sick Time, Parental leave.

All Remote company with bright HQ centrally located in Madrid, and offices in Boston (USA) and Brno (Czech Republic)

Healthy Work-Life Balance : We encourage the right for Digital Disconnecting and promote harmony between employees personal and professional lives

Flexible hiring options : Full Time / Part Time, Employee (Spain / Usa) / Contractor (other locations)

We are building an awesome, Engineering First Culture and your opinion matters : Thrive in the high-energy environment of a young company where openness, collaboration, risk-taking, and continuous growth are valued

Be exposed to a broad technology ecosystem. We encourage learning and researching new technologies and methods as part of your everyday duties

Crear una alerta de empleo para esta búsqueda

Ai Engineer • Pozuelo de Alarcón, Comunidad de Madrid, .ES

Ofertas relacionadas

Oferta promocionada

LLM Engineer

European Tech Recruitmadrid, España

Our client is Europe’s largest software companys With over 200+ team members worldwide, they deliver hyper-efficient AI that powers industries from finance to energy to manufacturing.They're lookin...Mostrar másÚltima actualización: hace 19 días

Oferta promocionada

Software Engineer - AI / ML Ops

HuspyMadrid, Comunidad de Madrid, España

Software Engineer - AI / ML Ops.Get AI-powered advice on this job and more exclusive features.The Story So Far : We’re Building a Global Brand in Real Estate. Huspy is one of the leading property tec...Mostrar másÚltima actualización: hace 1 día

Oferta promocionada

AI / ML Team Lead – Generative AI (LLMs, AWS)

Provectus IT, Inc., , Spain, España

Belgrade , Poland , Croatia , Spain , Greece , Italy , Bulgaria.Provectus helps companies adopt ML / AI to transform the ways they operate, compete, and drive value. The focus of the company is on bui...Mostrar másÚltima actualización: hace 19 días

Oferta promocionada

AI / ML Team Lead – Generative AI (LLMs, AWS)

Provectus, , Spain, España

Provectus helps companies adopt ML / AI to transform the ways they operate, compete, and drive value.The focus is on building ML infrastructure to drive end-to-end AI transformations, assisting busin...Mostrar másÚltima actualización: hace 15 días

Oferta promocionada

AI Machine Learning Engineer : AI Shopping Agents (Remote)

Constructor, , Spain, España

AI Machine Learning Engineer : AI Shopping Agents (Remote).AI Machine Learning Engineer : AI Shopping Agents (Remote).Get AI-powered advice on this job and more exclusive features.Constructor is the ...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

AI / ML TechLead (LLMs, AWS), $8000 Sign-On Bonus

ProvectusMadrid, Comunidad de Madrid, España

Provectus helps companies adopt ML / AI to transform the ways they operate, compete, and drive value.The focus of the company is on building ML Infrastructure to drive end-to-end AI transformations, ...Mostrar másÚltima actualización: hace 21 días

Oferta promocionada

MLOps Engineer (AI Platform)

Omilia, , Spain, España

Are you ready to move beyond maintaining legacy systems and build something truly new? What if your next role gave you the keys to architect an entire AI platform from the ground up, powering syste...Mostrar másÚltima actualización: hace 1 día

Oferta promocionada

AI Machine Learning & LLM Software Engineer - Madrid or Barcelona - Fixed Term Contract - Hybri[...]

EnvirorecMadrid, Comunidad de Madrid, España

AI Machine Learning & LLM Software Engineer - Madrid or Barcelona - Fixed Term Contract - Hybrid remote working - up to €60k-80k p / a - English language only. We are a specialist recruitment agency s...Mostrar másÚltima actualización: hace 1 día

Oferta promocionada

Lead Machine Learning Engineer – LLMs - Ramboll Tech

Ramboll Group A / SMadrid, Comunidad de Madrid, España

Lead Machine Learning Engineer – LLMs - Ramboll Tech.At Ramboll Tech, we believe innovation thrives in diverse, supportive environments where everyone can contribute their best ideas.As a Lead Mach...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

LLM Engineer

UltralyticsMadrid, Community of Madrid, Spain

AI, building the world's leading.We're looking for passionate individuals obsessed with AI, eager to make a global impact, and ready to excel in a dynamic, high-energy environment.This full-time LL...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Middle / Senior AI, ML Engineer

Provectus, , Spain, España

Join Provectus to be part of a team building cutting-edge technology solutions with a positive impact on society.Our company specializes in AI and ML technologies, cloud services, and data engineer...Mostrar másÚltima actualización: hace 11 días

Oferta promocionada

Senior ML / AI Engineer

Astrafy SAMadrid, Comunidad de Madrid, España

ML / AI engineering or data science roles.Google Cloud Platform (GCP), including hands-on use of Vertex AI and BigQuery.Proven experience delivering production-grade models and scalable ML systems.St...Mostrar másÚltima actualización: hace 11 días

Oferta promocionada

AI / ML Engineer (Remote in Spain)

M47 - AI Company, , Spain, España

AI / ML Engineer (Remote in Spain) at.We spark AI and help tech companies understand how AI can drive strategic objectives and plan implementation roadmaps. Join us in making the future more intellige...Mostrar másÚltima actualización: hace 4 días

Oferta promocionada

Senior IA / ML Engineer (Eng / Esp)

Plain Concepts, , Spain, España

We're growing our AI Dream Team! We are looking for a.Your role will involve crafting tailored solutions, training, deploying, and implementing groundbreaking developments in AI and machine learnin...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Senior IA / ML Engineer

Plain Concepts, , Spain, España

Titles? Meh, we’re not big on them, but let’s call this one.As part of our international AI / ML squad, you’ll craft tailor-made solutions that wow our clients. We’re hunting for a passionate AI Engin...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Machine Learning Engineer

Alinia AImadrid, Madrid, SPAIN

Machine Learning Engineer (Infra & Deployment).Remote (EU / US friendly time zones) | Full-time.AI safe and reliable for regulated industries such as finance. ML infra and take research to production : ...Mostrar másÚltima actualización: hace 16 días

Oferta promocionada

AI-Driven Systems Architect

beBeeDataMadrid, Comunidad de Madrid, España

We're building a pioneering AI platform that delivers hyper-personalized customer experiences at scale.Processing vast amounts of data daily, we utilize cutting-edge machine learning and large lang...Mostrar másÚltima actualización: hace 1 día

Oferta promocionada

Shelpuk AI Technology Consulting - Machine Learning Engineer

Dataphoenix, , Spain, España

We are a team of AI technology consultants and engineers with decades of experience in helping technology companies build sustainable competitive advantages through AI technology.From algorithm dev...Mostrar másÚltima actualización: hace 23 días