Talent.com
Esta oferta de trabajo no está disponible en tu país.
AI Systems Engineer – LLM Execution

AI Systems Engineer – LLM Execution

OpenNebula SystemsPozuelo de Alarcón, Comunidad de Madrid, .ES
Hace más de 30 días
Tipo de contrato
  • Quick Apply
Descripción del trabajo

For over a decade now, OpenNebula Systems has been leading the development of the European open source technology that helps organizations around the world to manage their corporate data centers and build their Enterprise Clouds.

If you want to join an established leader in the cloud infrastructure industry and the global open source community, keep reading, because you can now join a team of exceptionally passionate and talented colleagues whose mission is to help the world's leading enterprises to implement their next-generation edge and cloud strategies. We are hiring!

Since 2019, and thanks to the support from the European Commission, OpenNebula Systems has been leading the edge computing innovation in Europe , investing heavily in research and open source development, and playing a key role in strategic EU initiatives such as the IPCEI-CIS and the “European Alliance for Industrial Data, Edge and Cloud”.

OpenNebula’s new AI Factory product line delivers sovereign, edge-to-cloud AI infrastructure—enabling enterprises and governments to deploy, orchestrate, and optimize next-generation AI workloads with full control. This role is key to building the execution layer powering that vision. We are currently looking for an AI Systems Engineer to come and join us in Europe as part of our new team developing the AI Factory product line.

Job Description

We are looking for a highly skilled AI Systems Engineer with hands-on experience in executing, tuning, and scaling Large Language Models (LLMs) across multi-GPU infrastructures. This role is central to the development of our new AI Factory product line, which enables open, sovereign, and disaggregated AI infrastructure across cloud and edge environments.

You will help design and optimize LLM execution pipelines, working at the intersection of inference engines, orchestration platforms, and LLM model catalogs. Your responsibilities will include communicating with users, addressing their needs, troubleshooting, and providing step by step solutions.

Responsibilities

  • Design, implement, and optimize LLM inference pipelines for multi-GPU and multi-node environments.
  • Integrate with cutting-edge inference engines (e.g., vLLM, TensorRT-LLM, DeepSpeed, etc.).
  • Tune execution parameters for latency, throughput, and memory efficiency across heterogeneous infrastructures.
  • Work closely with orchestration frameworks such as Ray, NVIDIA NeMo / Dynamo, and others to coordinate LLM serving at scale.
  • Integrate with LLM catalogs and registries such as HuggingFace, NVIDIA NIM, and internal repositories.
  • Collaborate with product and platform teams to shape a modular, portable AI Factory execution layer.
  • Interact with users and use cases, providing systems support, system architecture definition, making recommendations based on user needs, implementation, testing, user training, and deployment of open source solutions.
  • Troubleshoot incidents, identify root causes, fix and document problems, and implement preventive measures.
  • Deliver quality performance indicators within the scope of the assigned project, including project journals, status reports, and other standard documentation.
  • Work with other companies in the cloud-edge ecosystem within international projects and open-source communities. Availability to occasional travel and participation in international events and meetings.
  • Write and maintain software documentation and project reports.

Experience required

Academic Background and Certifications

  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.
  • Professional Experience

  • Strong hands-on experience deploying and optimizing LLMs in production environments
  • Experience with inference frameworks such as vLLM, TensorRT, Triton Inference Server, DeepSpeed-Inference, etc.
  • Hands-on experience with orchestration tools like Ray, NVIDIA NeMo / Dynamo, or KServe.
  • Experience deploying LLM workloads on hybrid or sovereign cloud environments.
  • Contributions to open-source LLM or inference projects.
  • Technical Experience

  • Deep knowledge of multi-GPU systems and GPU memory management.
  • Solid understanding of distributed systems and networking bottlenecks in model serving.
  • Programming experience in Python, with knowledge of CUDA and model quantization a plus.
  • Familiarity with LLM catalogs (e.g., HuggingFace, NGC, NIM).
  • Familiarity with open-source MLOps or AI workload orchestration platforms.
  • Language Skills

  • English fluency at a professional or native-equivalent level, with excellent clarity and expression in both writing and speech.
  • Soft Skills & Collaboration

  • Strong customer service mindset, with a focus on responsiveness and user satisfaction.
  • Clear communication and documentation with strong written and verbal English, async collaboration, and visibility of work.
  • Excellent problem-solving skills and a proactive approach to identifying and resolving issues.
  • Self-management and accountability with ability to work independently, manage time, and take ownership of tasks and deadlines
  • Technical autonomy and tool proficiency with confidence in using Git, CI / CD, remote collaboration tools (Slack, Zoom, GitHub, etc.), and solving problems without direct supervision.
  • What's in it for me?

    Some of our benefits and perks vary depending on location and employment type, but we are proud to provide employees with the following;

  • Competitive compensation package and Flexible Remuneration Options : Meals, Transport, Nursery / Childcare…
  • Customized workstation (macOS, Windows, Linux any distro is welcome)
  • Private Health Insurance
  • 6 hours workday on Fridays and everyday during August
  • PTO : Holidays, Personal Time, Sick Time, Parental leave.
  • All Remote company with bright HQ centrally located in Madrid, and offices in Boston (USA) and Brno (Czech Republic)
  • Healthy Work-Life Balance : We encourage the right for Digital Disconnecting and promote harmony between employees personal and professional lives
  • Flexible hiring options : Full Time / Part Time, Employee (Spain / Usa) / Contractor (other locations)
  • We are building an awesome, Engineering First Culture and your opinion matters : Thrive in the high-energy environment of a young company where openness, collaboration, risk-taking, and continuous growth are valued
  • Be exposed to a broad technology ecosystem. We encourage learning and researching new technologies and methods as part of your everyday duties
  • Crear una alerta de empleo para esta búsqueda

    Ai Engineer • Pozuelo de Alarcón, Comunidad de Madrid, .ES

    Ofertas relacionadas
    • Oferta promocionada
    LLM Engineer

    LLM Engineer

    European Tech Recruitmadrid, España
    Our client is Europe’s largest software companys With over 200+ team members worldwide, they deliver hyper-efficient AI that powers industries from finance to energy to manufacturing.They're lookin...Mostrar másÚltima actualización: hace 19 días
    • Oferta promocionada
    Software Engineer - AI / ML Ops

    Software Engineer - AI / ML Ops

    HuspyMadrid, Comunidad de Madrid, España
    Software Engineer - AI / ML Ops.Get AI-powered advice on this job and more exclusive features.The Story So Far : We’re Building a Global Brand in Real Estate. Huspy is one of the leading property tec...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    AI / ML Team Lead – Generative AI (LLMs, AWS)

    AI / ML Team Lead – Generative AI (LLMs, AWS)

    Provectus IT, Inc., , Spain, España
    Belgrade , Poland , Croatia , Spain , Greece , Italy , Bulgaria.Provectus helps companies adopt ML / AI to transform the ways they operate, compete, and drive value. The focus of the company is on bui...Mostrar másÚltima actualización: hace 19 días
    • Oferta promocionada
    AI / ML Team Lead – Generative AI (LLMs, AWS)

    AI / ML Team Lead – Generative AI (LLMs, AWS)

    Provectus, , Spain, España
    Provectus helps companies adopt ML / AI to transform the ways they operate, compete, and drive value.The focus is on building ML infrastructure to drive end-to-end AI transformations, assisting busin...Mostrar másÚltima actualización: hace 15 días
    • Oferta promocionada
    AI Machine Learning Engineer : AI Shopping Agents (Remote)

    AI Machine Learning Engineer : AI Shopping Agents (Remote)

    Constructor, , Spain, España
    AI Machine Learning Engineer : AI Shopping Agents (Remote).AI Machine Learning Engineer : AI Shopping Agents (Remote).Get AI-powered advice on this job and more exclusive features.Constructor is the ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    AI / ML TechLead (LLMs, AWS), $8000 Sign-On Bonus

    AI / ML TechLead (LLMs, AWS), $8000 Sign-On Bonus

    ProvectusMadrid, Comunidad de Madrid, España
    Provectus helps companies adopt ML / AI to transform the ways they operate, compete, and drive value.The focus of the company is on building ML Infrastructure to drive end-to-end AI transformations, ...Mostrar másÚltima actualización: hace 21 días
    • Oferta promocionada
    MLOps Engineer (AI Platform)

    MLOps Engineer (AI Platform)

    Omilia, , Spain, España
    Are you ready to move beyond maintaining legacy systems and build something truly new? What if your next role gave you the keys to architect an entire AI platform from the ground up, powering syste...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    AI Machine Learning & LLM Software Engineer - Madrid or Barcelona - Fixed Term Contract - Hybri[...]

    AI Machine Learning & LLM Software Engineer - Madrid or Barcelona - Fixed Term Contract - Hybri[...]

    EnvirorecMadrid, Comunidad de Madrid, España
    AI Machine Learning & LLM Software Engineer - Madrid or Barcelona - Fixed Term Contract - Hybrid remote working - up to €60k-80k p / a - English language only. We are a specialist recruitment agency s...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    Lead Machine Learning Engineer – LLMs - Ramboll Tech

    Lead Machine Learning Engineer – LLMs - Ramboll Tech

    Ramboll Group A / SMadrid, Comunidad de Madrid, España
    Lead Machine Learning Engineer – LLMs - Ramboll Tech.At Ramboll Tech, we believe innovation thrives in diverse, supportive environments where everyone can contribute their best ideas.As a Lead Mach...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    LLM Engineer

    LLM Engineer

    UltralyticsMadrid, Community of Madrid, Spain
    AI, building the world's leading.We're looking for passionate individuals obsessed with AI, eager to make a global impact, and ready to excel in a dynamic, high-energy environment.This full-time LL...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Middle / Senior AI, ML Engineer

    Middle / Senior AI, ML Engineer

    Provectus, , Spain, España
    Join Provectus to be part of a team building cutting-edge technology solutions with a positive impact on society.Our company specializes in AI and ML technologies, cloud services, and data engineer...Mostrar másÚltima actualización: hace 11 días
    • Oferta promocionada
    Senior ML / AI Engineer

    Senior ML / AI Engineer

    Astrafy SAMadrid, Comunidad de Madrid, España
    ML / AI engineering or data science roles.Google Cloud Platform (GCP), including hands-on use of Vertex AI and BigQuery.Proven experience delivering production-grade models and scalable ML systems.St...Mostrar másÚltima actualización: hace 11 días
    • Oferta promocionada
    AI / ML Engineer (Remote in Spain)

    AI / ML Engineer (Remote in Spain)

    M47 - AI Company, , Spain, España
    AI / ML Engineer (Remote in Spain) at.We spark AI and help tech companies understand how AI can drive strategic objectives and plan implementation roadmaps. Join us in making the future more intellige...Mostrar másÚltima actualización: hace 4 días
    • Oferta promocionada
    Senior IA / ML Engineer (Eng / Esp)

    Senior IA / ML Engineer (Eng / Esp)

    Plain Concepts, , Spain, España
    We're growing our AI Dream Team! We are looking for a.Your role will involve crafting tailored solutions, training, deploying, and implementing groundbreaking developments in AI and machine learnin...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior IA / ML Engineer

    Senior IA / ML Engineer

    Plain Concepts, , Spain, España
    Titles? Meh, we’re not big on them, but let’s call this one.As part of our international AI / ML squad, you’ll craft tailor-made solutions that wow our clients. We’re hunting for a passionate AI Engin...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Machine Learning Engineer

    Machine Learning Engineer

    Alinia AImadrid, Madrid, SPAIN
    Machine Learning Engineer (Infra & Deployment).Remote (EU / US friendly time zones) | Full-time.AI safe and reliable for regulated industries such as finance. ML infra and take research to production : ...Mostrar másÚltima actualización: hace 16 días
    • Oferta promocionada
    AI-Driven Systems Architect

    AI-Driven Systems Architect

    beBeeDataMadrid, Comunidad de Madrid, España
    We're building a pioneering AI platform that delivers hyper-personalized customer experiences at scale.Processing vast amounts of data daily, we utilize cutting-edge machine learning and large lang...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    Shelpuk AI Technology Consulting - Machine Learning Engineer

    Shelpuk AI Technology Consulting - Machine Learning Engineer

    Dataphoenix, , Spain, España
    We are a team of AI technology consultants and engineers with decades of experience in helping technology companies build sustainable competitive advantages through AI technology.From algorithm dev...Mostrar másÚltima actualización: hace 23 días