Talent.com
Evaluation Engineer
Evaluation EngineerEuropean Tech Recruit • Madrid, Kingdom Of Spain, España
Evaluation Engineer

Evaluation Engineer

European Tech Recruit • Madrid, Kingdom Of Spain, España
Hace 13 días
Descripción del trabajo

We are seeking a highly skilled Evaluation Engineer to design and lead the evaluation strategy for our Agentic AI and Retrieval-Augmented Generation (RAG) systems.

In this role, you’ll translate complex customer workflows into measurable success metrics, ensuring our systems deliver reliable, explainable, and high-performing results across real-world applications.

Responsibilities

  • You will design and execute rigorous evaluation frameworks that measure reasoning, factual accuracy, reliability, and user success across diverse problem domains. This includes building reproducible evaluation pipelines with datasets, test suites, and automated tracking of regressions and improvements.
  • You’ll work closely with ML specialists and engineers to develop task-based, multi-step evaluations that reflect real-world system behavior—spanning retrieval, planning, memory, and tool usage—and inform continuous improvement.
  • Your work will also involve curating and generating high-quality datasets, implementing LLM-as-a-judge methods calibrated with human feedback, and conducting deep error analyses to identify and classify failure modes.
  • You’ll partner across teams to ensure evaluations align with production metrics such as latency, cost, and reliability, and you’ll contribute to high engineering standards through clear documentation, code reviews, and mentorship.

Qualifications

  • Master’s or Ph.D. in Computer Science, Machine Learning, Data Science, or a related technical field.
  • 3+ years (mid-level) or 5+ years (senior) of experience in applied AI / ML, ideally with production-deployed systems.
  • Proven expertise in designing evaluation methodologies for ML systems, especially in LLMs, RAG, or multi-agent architectures.
  • Experience creating and curating datasets, including synthetic and adversarial data, for evaluation and training.
  • Strong proficiency in Python, with hands-on experience using frameworks such as PyTorch, HuggingFace, LangGraph, LlamaIndex, and related ML / agentic toolkits.
  • Familiarity with cloud environments (preferably AWS) and good software engineering practices (Git, Docker, reproducible ML pipelines).
  • Excellent analytical, problem-solving, and communication skills, with the ability to turn ambiguity into structured, data-driven evaluation approaches.
  • Fluent in English.
  • If you are motivated by advancing the frontiers of intelligent systems and have the experience to design rigorous, real-world evaluations for cutting-edge AI technologies, we invite you to apply now or email your CV to

    By applying to this role you understand that we may collect your personal data and store and process it on our systems. For more information please see our Privacy Notice (

    In accordance with local employment laws, applicants must have current, valid authorisation to work in Spain at the time of application. We are unable to sponsor employment visas for this role. Applications from individuals without existing work authorisation for Spain cannot be considered.

    Crear una alerta de empleo para esta búsqueda

    Engineer • Madrid, Kingdom Of Spain, España

    Ofertas relacionadas
    AI Agent Evaluation Analyst

    AI Agent Evaluation Analyst

    Mindrift • Madrid, Comunidad de Madrid, España
    This opportunity is only for candidates currently residing in the specified country.Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of En...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Value Improvement Engineer (Electrical Engineer)

    Value Improvement Engineer (Electrical Engineer)

    Tecnicas Reunidas • Madrid, Community of Madrid, Spain
    Técnicas Reunidas group (TR) is a leading Oil & Gas International Engineering and Construction Company specialized in the design, construction and management of execution of Industrial plants world...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Evaluation Engineer

    Evaluation Engineer

    WhatJobs • Madrid, Kingdom Of Spain, España
    We are seeking a highly skilled Evaluation Engineer to design and lead the evaluation strategy for our Agentic AI and Retrieval-Augmented Generation (RAG) systems. In this role, you’ll translate com...Mostrar más
    Última actualización: hace 13 días • Oferta promocionada
    Evaluation Quality Lead : Strategy & Ops Across Europe

    Evaluation Quality Lead : Strategy & Ops Across Europe

    AUTO1 Group • madrid, madrid, España
    Una plataforma líder de comercialización de automóviles busca un Project Manager para gestionar la estrategia de calidad de evaluación en Europa. El candidato ideal tendrá un título relevante y 3-5 ...Mostrar más
    Última actualización: hace 5 horas • Oferta promocionada • Nueva oferta
    QA System Technician

    QA System Technician

    Chemo • Azuqueca de Henares, Castile-La Mancha, Spain
    Posición : Técnico / a de QA Sistemas.Localización : Azuqueca de Henares.Experiencia : 1 a 3 años en funciones similares. INSUD PHARMA opera en toda la cadena de valor farmacéutica, aportando conocimient...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Evaluation Engineer (AI & RAG Systems)

    Evaluation Engineer (AI & RAG Systems)

    European Tech Recruit • Madrid, Community of Madrid, Spain
    We are seeking a highly skilled Evaluation Engineer to design and lead the evaluation strategy for our Agentic AI and Retrieval-Augmented Generation (RAG) systems. In this role, you’ll translate com...Mostrar más
    Última actualización: hace 17 días • Oferta promocionada
    Value Improvement Engineer (Engineering or Project Management)

    Value Improvement Engineer (Engineering or Project Management)

    Tecnicas Reunidas • Madrid, Community of Madrid, Spain
    Técnicas Reunidas group (TR) is a leading Oil & Gas International Engineering and Construction Company specialized in the design, construction and management of execution of Industrial plants world...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Data & Evaluation Methods Expert

    Data & Evaluation Methods Expert

    NTU International A / S • Madrid, Community of Madrid, Spain
    EIB project : ESF+ ERDF EU wide market assessment studies for the 2028+ Programming Period.EC and EIB, and the vast experience accumulated in the context of fi-compass on the implementation of FIs, ...Mostrar más
    Última actualización: hace 20 días • Oferta promocionada
    Value Improvement Engineer

    Value Improvement Engineer

    Tecnicas Reunidas • Madrid, Kingdom Of Spain, España
    Técnicas Reunidas group (TR) is a leading Oil & Gas International Engineering and Construction Company specialized in the design, construction and management of execution of Industrial plants world...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Mid Cybersecurity Evaluator - Ethical Hacker

    Mid Cybersecurity Evaluator - Ethical Hacker

    jtsec Beyond IT Security • Madrid, Spain
    Formed by a team of recognized professionals in the IT security sector with more than 30 years of experience in this field. We have international projection and clients such as Checkpoint, Winbond, ...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Value Improvement Engineer (Engineering Or Project Management)

    Value Improvement Engineer (Engineering Or Project Management)

    WhatJobs • Madrid, Kingdom Of Spain, España
    Técnicas Reunidas group (TR) is a leading Oil & Gas International Engineering and Construction Company specialized in the design, construction and management of execution of Industrial plants world...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Value Improvement Engineer (Cost / Estimating)

    Value Improvement Engineer (Cost / Estimating)

    Tecnicas Reunidas • Madrid, Madrid, SPAIN
    Técnicas Reunidas group (TR) is a leading Oil & Gas International Engineering and Construction Company specialized in the design, construction and management of execution of Industrial plants w...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Ai Agent Evaluation Analyst

    Ai Agent Evaluation Analyst

    Mindrift • Madrid, Kingdom Of Spain, España
    This opportunity is only for candidates currently residing in the specified country.Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of En...Mostrar más
    Última actualización: hace 28 días • Oferta promocionada
    Europe Evaluation Strategy Lead — Project & Ops

    Europe Evaluation Strategy Lead — Project & Ops

    compramostucoche.es • Madrid, Comunidad de Madrid, España
    A leading car trading platform in Madrid is looking for an Entry-level Project Manager – Strategy & Operations to enhance the accuracy and reliability of car evaluations across Europe.The ideal can...Mostrar más
    Última actualización: hace 5 días • Oferta promocionada
    Aiml Evaluation - Senior Ml Engineer

    Aiml Evaluation - Senior Ml Engineer

    WhatJobs • Madrid, Kingdom Of Spain, España
    At Apple new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly.Apple is a place where extraordinary people come together to do their life's best w...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Implementation Engineer - Payments Integrations (Hybrid)

    Implementation Engineer - Payments Integrations (Hybrid)

    Adyen • Madrid, Comunidad de Madrid, España
    A leading payment solutions provider is seeking an Implementation Engineer to guide merchants in integrating with their platform. This role requires strong technical and communication skills, with p...Mostrar más
    Última actualización: hace 2 días • Oferta promocionada
    Real Estate Business Developer – Inversión mínima y atractivas comisiones

    Real Estate Business Developer – Inversión mínima y atractivas comisiones

    IAD JoinIAD ES • Collado Villalba, ES
    Está buscando un nuevo reto profesional?.Únase a iad España y forme parte de la red más innovadora de consultores inmobiliarios independientes. Más de 17 millones de euros de facturación.Más de 3000...Mostrar más
    Última actualización: hace 28 días • Oferta promocionada
    Mid Cybersecurity Evaluator - Ethical Hacker

    Mid Cybersecurity Evaluator - Ethical Hacker

    WhatJobs • Kingdom Of Spain, España
    Formed by a team of recognized professionals in the IT security sector with more than 30 years of experience in this field. We have international projection and clients such as Checkpoint, Winbond, ...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada