Talent.com
Esta oferta de trabajo no está disponible en tu país.
AI Evaluation Data Scientist - AI / ML / LLM - (Hybrid (Hybrid) - Barcelona

AI Evaluation Data Scientist - AI / ML / LLM - (Hybrid (Hybrid) - Barcelona

European Tech RecruitBarcelona, Cataluña, España
Hace 1 día
Descripción del trabajo

Overview

AI Evaluation Data Scientist - AI / ML / LLM - Hybrid (Hybrid) - Barcelona

Principal Consultant | Semiconductor, Automotive, Software Engineering

AI Evaluation Data Scientist

A fantastic opportunity for a driven AI Data Scientist to join a leading Quantum AI company, who work on cutting-edge solutions that make AI faster, greener, and more accessible. You’ll be working alongside world-leading experts in quantum computing and AI, with the opportunity to work on challenging projects and shape the future of Generative AI systems.

This is initially a 9 Month Fixed Term Contract, with scope to extend - Hybrid working from sites in Madrid or Barcelona.

Responsibilities

  • Design and lead the evaluation strategy for our Agentic AI and RAG systems, turning customer workflows and business needs into measurable metrics and clear success criteria.
  • Contribute to the end-to-end design of Agentic AI and RAG systems, injecting a data-and-evaluation perspective into retrieval strategies, orchestration policies, tool usage, and memory to solve complex, real-world problems across industries.
  • Develop task-based, multi-step evaluations that reflect how the different components of our systems (retrieval, planning, tool use, memory) perform in real-world scenarios across cloud and edge deployments.
  • Develop and refine rigorous evaluation frameworks that reflect real-world performance, going beyond model benchmarks to assess task success, reasoning capabilities, factual consistency, reliability, and user success metrics across diverse problem domains.
  • Build and maintain a reproducible evaluation pipeline, including datasets, scenarios, configs, test suites, versioned assets, and automated runs to track regressions and improvements over time.
  • Curate and generate high-quality datasets for evaluation, including synthetic and adversarial data, to strengthen coverage and robustness.
  • Implement and calibrate LLM-as-a-judge evaluations, aligning automated scoring with human feedback and ensuring fairness, robustness, and representativeness.
  • Perform deep error analyses and ablations to uncover failure patterns, maintain a taxonomy of failure modes (reasoning, grounding, hallucinations, tool failures), and provide actionable insights to engineers to improve model and system performance.
  • Partner with ML specialists to create a data flywheel, where evaluation continuously informs new dataset creation, improvements on prompts, tool usage, model training, and system refinements, quantifying improvements over time.
  • Define and monitor operational metrics (latency, cost, reliability) to ensure evaluations align with production and customer expectations.
  • Maintain high engineering standards, including clear documentation, reproducible experiments, robust version control, and well-structured ML pipelines.
  • Contribute to team learning and mentorship, guiding junior engineers and sharing expertise in LLM development, evaluation, and deployment best practices.
  • Participate in code reviews, offering thoughtful, constructive feedback to maintain code quality, readability, and consistency.

Required minimum Qualifications

  • Master's or Ph.D. in Computer Science, Machine Learning, Data Science, Physics, Engineering, or related technical fields, with relevant industry experience.
  • Solid hands-on experience (3+ years for mid-level, 5+ years for senior) working as a Data Scientist, ML Engineer, or Research Scientist in applied AI / ML projects deployed in production environments.
  • Strong background in evaluation of machine learning systems, ideally with experience in LLMs, RAG pipelines, or multi-agent systems.
  • Proven ability to design and implement evaluation methodologies that go beyond static benchmarks, capturing real-world task success, reasoning, and robustness.
  • Hands-on experience with dataset creation and curation (including synthetic data generation) for training and evaluation.
  • Proven experience with agent-based architectures (task decomposition, tool use, reasoning workflows), RAG architectures (retrievers, vector databases, rerankers), and orchestration frameworks (LangGraph, LlamaIndex).
  • Strong problem-solving skills, with the ability to navigate ambiguity and design practical solutions to open-ended user or business needs.
  • Strong software engineering skills, with proficiency in Python, Docker, Git, and experience building robust, modular, and scalable ML codebases.
  • Familiarity with common ML and data libraries and frameworks (e.g., PyTorch, HuggingFace, LangGraph, LlamaIndex, Pandas, etc.).
  • Experience with cloud platforms (ideally AWS).
  • By applying to this role, you understand that we may collect your personal data & store & process it on our systems. For more information please see our Privacy Notice (

    Note : This posting includes an HTML-only description refined for formatting compliance. No additional roles or unrelated listings are included.

    #J-18808-Ljbffr

    Crear una alerta de empleo para esta búsqueda

    Data Scientist • Barcelona, Cataluña, España

    Ofertas relacionadas
    • Oferta promocionada
    • Nueva oferta
    Trustworthy AI Data Scientist

    Trustworthy AI Data Scientist

    EurecatBarcelona, Cataluña, España
    The Opportunity : You will join the Trustworthy AI team, a tight group of researchers and developers working on cutting-edge projects in explainable AI, AI compliance, data quality and data value fo...Mostrar másÚltima actualización: hace 8 horas
    Data & AI Strategy Consultant, Barcelona

    Data & AI Strategy Consultant, Barcelona

    AccentureBarcelona, Spain
    Data & AI Strategy Consultant We are looking for a Data AI Strategy Consultant / Manager to join our Data AI Strategy Consulting team, specializing in Supply Chain Operations (SC O).This role...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Business Intelligence Consultant

    Business Intelligence Consultant

    SlimstockViladecans, Catalonia, Spain
    Descripción en Ingles & Espanol.Slimstock es una consultoría internacional en el ámbito de la logística, especializada en la previsión de la demanda y la gestión de aprovisionamiento, que implanta ...Mostrar másÚltima actualización: hace más de 30 días
    Applied Scientist, ATS Machine Learning, Barcelona

    Applied Scientist, ATS Machine Learning, Barcelona

    AmazonBarcelona, España
    Applied Scientist, ATS Machine Learning Are you interested in building state-of-the-art machine learning systems for the most complex, and fastest growing, transportation network in the world? If s...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    AI Engineer

    AI Engineer

    NeuralTrustBarcelona, Catalonia, Spain
    At NeuralTrust, we’re looking for an.We’re a Barcelona-based startup.Our SaaS platform provides the essential tools for large companies to integrate generative AI into their products and services i...Mostrar másÚltima actualización: hace 29 días
    • Oferta promocionada
    Data Scientist

    Data Scientist

    Amaris ConsultingPalau-solità i Plegamans, Catalonia, Spain
    Take your career to the next level with Amaris Consulting as a.Become part of an international team, thrive in a global group with a turnover of €800 million and over 1,000 clients worldwide, and w...Mostrar másÚltima actualización: hace 11 días
    • Oferta promocionada
    AI Data Scientist (Fixed-term contract)

    AI Data Scientist (Fixed-term contract)

    MULTIVERSE COMPUTINGBarcelona, Cataluña, España
    Come and join our multicultural team!.We are looking to fill this role immediately and are reviewing applications daily.Expect a fast, transparent process with quick feedback.We are a European deep...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    Design Lead

    Design Lead

    SenseisTalentMontornès del Vallès, Catalonia, Spain
    Somos SenseisTalent, el partner de recruitment de grandes empresas en España y Europa.Fusionamos la experiencia de RRHH, Hiring Managers y especialistas de diversos sectores para acabar con proceso...Mostrar másÚltima actualización: hace 10 días
    • Oferta promocionada
    Data Scientist / ML Engineer

    Data Scientist / ML Engineer

    AppodealBarcelona, Cataluña, España
    We are looking for an experienced.Data Scientist / Machine Learning Engineer.Data Science team in Barcelona.Analyze data to identify trends, patterns, and actionable insights, independently solving...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    • Nueva oferta
    Data Scientist in AI for Cardiology – Digital Health Unit (RE1)

    Data Scientist in AI for Cardiology – Digital Health Unit (RE1)

    Barcelona Supercomputing CenterBarcelona, Cataluña, España
    The Barcelona Supercomputing Center – Centro Nacional de Supercomputación (BSC-CNS) is Europe’s leading center for supercomputing research. The successful candidate will work in a highly sophisticat...Mostrar másÚltima actualización: hace 8 horas
    AI - Data Scientist

    AI - Data Scientist

    team.blue GlobalBarcelona, Catalunya, .ES
    Quick Apply
    The most trusted digital enabler.Europe and has more than 3,000 experts to support them.Its goal is to shape technology and to empower businesses with innovative digital services.Click here to read...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Data Scientist -NLP

    Senior Data Scientist -NLP

    MANGOPalau-solità i Plegamans, Catalonia, Spain
    Estamos buscando un / a Senior Data Scientist especializado en Natural Language Processing y Large Language Models para unirse a nuestro equipo de Data&Advanced Analytics. Formarás parte del equipo qu...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    • Nueva oferta
    AI Developer

    AI Developer

    SET EuropaTerrassa, SPAIN
    SET Europa ofrece servicios near shore, de contratación y externalización, así como soluciones para las necesidades inmediatas de los proyectos. La Empresa Nuestro cliente ofrece soluciones de inte...Mostrar másÚltima actualización: hace 8 horas
    AI Evaluation Data Scientist (Fixed-term contract)

    AI Evaluation Data Scientist (Fixed-term contract)

    MULTIVERSE COMPUTINGBarcelona, Catalonia, .ES
    Quick Apply
    Come and join our multicultural team!.We are looking to fill this role.Expect a fast, transparent process with quick feedback. We are a European deep-tech leader in quantum and AI, backed by major g...Mostrar másÚltima actualización: hace 7 días
    • Oferta promocionada
    Senior Applied ML Scientist

    Senior Applied ML Scientist

    NoryBarcelona, Cataluña, España
    We’re fixing hospitality and building an all-knowing restaurant management system that blends real-time data with AI predictive analytics to help restaurants run with consistency, certainty, and pr...Mostrar másÚltima actualización: hace 1 día
    AI / ML Engineer, hibrido

    AI / ML Engineer, hibrido

    Axiom Software SolutionsBarcelona, España
    AI / ML Engineer Job Title : ML Eng.Data Science Python Developer Employment type Full time / Permanent.Location : Sant Cugat, -Barcelona, Spain Work Mode Hybrid work 2 days in a week at office.Job...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Business Analyst - AI Products

    Business Analyst - AI Products

    agap2 EspañaViladecans, Catalonia, Spain
    AI-focused team within the aviation industry.The role involves working at the intersection of.Collaborate with cross-functional teams (product leads, data scientists, engineers, business stakeholde...Mostrar másÚltima actualización: hace 11 días
    • Oferta promocionada
    Principal Data Scientist - Agentic AI - BCN / MAD

    Principal Data Scientist - Agentic AI - BCN / MAD

    AILY LABSBarcelona, Cataluña, España
    Principal Data Scientist - Agentic AI - BCN / MAD.Join to apply for this role at AILY LABS.Are you ready to take part in the upcoming AI revolution of the business world? We have an extraordinary opp...Mostrar másÚltima actualización: hace 25 días