Esta oferta de trabajo no está disponible en tu país.

AI Evaluation Data Scientist - AI / ML / LLM - (Hybrid (Hybrid) - Barcelona

European Tech RecruitBarcelona, Cataluña, España

Hace 1 día

Descripción del trabajo

Overview

AI Evaluation Data Scientist - AI / ML / LLM - Hybrid (Hybrid) - Barcelona

Principal Consultant | Semiconductor, Automotive, Software Engineering

AI Evaluation Data Scientist

A fantastic opportunity for a driven AI Data Scientist to join a leading Quantum AI company, who work on cutting-edge solutions that make AI faster, greener, and more accessible. You’ll be working alongside world-leading experts in quantum computing and AI, with the opportunity to work on challenging projects and shape the future of Generative AI systems.

This is initially a 9 Month Fixed Term Contract, with scope to extend - Hybrid working from sites in Madrid or Barcelona.

Responsibilities

Design and lead the evaluation strategy for our Agentic AI and RAG systems, turning customer workflows and business needs into measurable metrics and clear success criteria.
Contribute to the end-to-end design of Agentic AI and RAG systems, injecting a data-and-evaluation perspective into retrieval strategies, orchestration policies, tool usage, and memory to solve complex, real-world problems across industries.
Develop task-based, multi-step evaluations that reflect how the different components of our systems (retrieval, planning, tool use, memory) perform in real-world scenarios across cloud and edge deployments.
Develop and refine rigorous evaluation frameworks that reflect real-world performance, going beyond model benchmarks to assess task success, reasoning capabilities, factual consistency, reliability, and user success metrics across diverse problem domains.
Build and maintain a reproducible evaluation pipeline, including datasets, scenarios, configs, test suites, versioned assets, and automated runs to track regressions and improvements over time.
Curate and generate high-quality datasets for evaluation, including synthetic and adversarial data, to strengthen coverage and robustness.
Implement and calibrate LLM-as-a-judge evaluations, aligning automated scoring with human feedback and ensuring fairness, robustness, and representativeness.
Perform deep error analyses and ablations to uncover failure patterns, maintain a taxonomy of failure modes (reasoning, grounding, hallucinations, tool failures), and provide actionable insights to engineers to improve model and system performance.
Partner with ML specialists to create a data flywheel, where evaluation continuously informs new dataset creation, improvements on prompts, tool usage, model training, and system refinements, quantifying improvements over time.
Define and monitor operational metrics (latency, cost, reliability) to ensure evaluations align with production and customer expectations.
Maintain high engineering standards, including clear documentation, reproducible experiments, robust version control, and well-structured ML pipelines.
Contribute to team learning and mentorship, guiding junior engineers and sharing expertise in LLM development, evaluation, and deployment best practices.
Participate in code reviews, offering thoughtful, constructive feedback to maintain code quality, readability, and consistency.

Required minimum Qualifications

Master's or Ph.D. in Computer Science, Machine Learning, Data Science, Physics, Engineering, or related technical fields, with relevant industry experience.

Solid hands-on experience (3+ years for mid-level, 5+ years for senior) working as a Data Scientist, ML Engineer, or Research Scientist in applied AI / ML projects deployed in production environments.

Strong background in evaluation of machine learning systems, ideally with experience in LLMs, RAG pipelines, or multi-agent systems.

Proven ability to design and implement evaluation methodologies that go beyond static benchmarks, capturing real-world task success, reasoning, and robustness.

Hands-on experience with dataset creation and curation (including synthetic data generation) for training and evaluation.

Proven experience with agent-based architectures (task decomposition, tool use, reasoning workflows), RAG architectures (retrievers, vector databases, rerankers), and orchestration frameworks (LangGraph, LlamaIndex).

Strong problem-solving skills, with the ability to navigate ambiguity and design practical solutions to open-ended user or business needs.

Strong software engineering skills, with proficiency in Python, Docker, Git, and experience building robust, modular, and scalable ML codebases.

Familiarity with common ML and data libraries and frameworks (e.g., PyTorch, HuggingFace, LangGraph, LlamaIndex, Pandas, etc.).

Experience with cloud platforms (ideally AWS).

By applying to this role, you understand that we may collect your personal data & store & process it on our systems. For more information please see our Privacy Notice (

Note : This posting includes an HTML-only description refined for formatting compliance. No additional roles or unrelated listings are included.

#J-18808-Ljbffr

Crear una alerta de empleo para esta búsqueda

Data Scientist • Barcelona, Cataluña, España

Ofertas relacionadas

Oferta promocionada
Nueva oferta

Trustworthy AI Data Scientist

EurecatBarcelona, Cataluña, España

The Opportunity : You will join the Trustworthy AI team, a tight group of researchers and developers working on cutting-edge projects in explainable AI, AI compliance, data quality and data value fo...Mostrar másÚltima actualización: hace 8 horas

Data & AI Strategy Consultant, Barcelona

AccentureBarcelona, Spain

Data & AI Strategy Consultant We are looking for a Data AI Strategy Consultant / Manager to join our Data AI Strategy Consulting team, specializing in Supply Chain Operations (SC O).This role...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Business Intelligence Consultant

SlimstockViladecans, Catalonia, Spain

Descripción en Ingles & Espanol.Slimstock es una consultoría internacional en el ámbito de la logística, especializada en la previsión de la demanda y la gestión de aprovisionamiento, que implanta ...Mostrar másÚltima actualización: hace más de 30 días

Applied Scientist, ATS Machine Learning, Barcelona

AmazonBarcelona, España

Applied Scientist, ATS Machine Learning Are you interested in building state-of-the-art machine learning systems for the most complex, and fastest growing, transportation network in the world? If s...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

AI Engineer

NeuralTrustBarcelona, Catalonia, Spain

At NeuralTrust, we’re looking for an.We’re a Barcelona-based startup.Our SaaS platform provides the essential tools for large companies to integrate generative AI into their products and services i...Mostrar másÚltima actualización: hace 29 días

Oferta promocionada

Data Scientist

Amaris ConsultingPalau-solità i Plegamans, Catalonia, Spain

Take your career to the next level with Amaris Consulting as a.Become part of an international team, thrive in a global group with a turnover of €800 million and over 1,000 clients worldwide, and w...Mostrar másÚltima actualización: hace 11 días

Oferta promocionada

AI Data Scientist (Fixed-term contract)

MULTIVERSE COMPUTINGBarcelona, Cataluña, España

Come and join our multicultural team!.We are looking to fill this role immediately and are reviewing applications daily.Expect a fast, transparent process with quick feedback.We are a European deep...Mostrar másÚltima actualización: hace 1 día

Oferta promocionada

Design Lead

SenseisTalentMontornès del Vallès, Catalonia, Spain

Somos SenseisTalent, el partner de recruitment de grandes empresas en España y Europa.Fusionamos la experiencia de RRHH, Hiring Managers y especialistas de diversos sectores para acabar con proceso...Mostrar másÚltima actualización: hace 10 días

Oferta promocionada

Data Scientist / ML Engineer

AppodealBarcelona, Cataluña, España

We are looking for an experienced.Data Scientist / Machine Learning Engineer.Data Science team in Barcelona.Analyze data to identify trends, patterns, and actionable insights, independently solving...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada
Nueva oferta

Data Scientist in AI for Cardiology – Digital Health Unit (RE1)

Barcelona Supercomputing CenterBarcelona, Cataluña, España

The Barcelona Supercomputing Center – Centro Nacional de Supercomputación (BSC-CNS) is Europe’s leading center for supercomputing research. The successful candidate will work in a highly sophisticat...Mostrar másÚltima actualización: hace 8 horas

AI - Data Scientist

team.blue GlobalBarcelona, Catalunya, .ES

Quick Apply

The most trusted digital enabler.Europe and has more than 3,000 experts to support them.Its goal is to shape technology and to empower businesses with innovative digital services.Click here to read...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Senior Data Scientist -NLP

MANGOPalau-solità i Plegamans, Catalonia, Spain

Estamos buscando un / a Senior Data Scientist especializado en Natural Language Processing y Large Language Models para unirse a nuestro equipo de Data&Advanced Analytics. Formarás parte del equipo qu...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada
Nueva oferta

AI Developer

SET EuropaTerrassa, SPAIN

SET Europa ofrece servicios near shore, de contratación y externalización, así como soluciones para las necesidades inmediatas de los proyectos. La Empresa Nuestro cliente ofrece soluciones de inte...Mostrar másÚltima actualización: hace 8 horas

AI Evaluation Data Scientist (Fixed-term contract)

MULTIVERSE COMPUTINGBarcelona, Catalonia, .ES

Quick Apply

Come and join our multicultural team!.We are looking to fill this role.Expect a fast, transparent process with quick feedback. We are a European deep-tech leader in quantum and AI, backed by major g...Mostrar másÚltima actualización: hace 7 días

Oferta promocionada

Senior Applied ML Scientist

NoryBarcelona, Cataluña, España

We’re fixing hospitality and building an all-knowing restaurant management system that blends real-time data with AI predictive analytics to help restaurants run with consistency, certainty, and pr...Mostrar másÚltima actualización: hace 1 día

AI / ML Engineer, hibrido

Axiom Software SolutionsBarcelona, España

AI / ML Engineer Job Title : ML Eng.Data Science Python Developer Employment type Full time / Permanent.Location : Sant Cugat, -Barcelona, Spain Work Mode Hybrid work 2 days in a week at office.Job...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Business Analyst - AI Products

agap2 EspañaViladecans, Catalonia, Spain

AI-focused team within the aviation industry.The role involves working at the intersection of.Collaborate with cross-functional teams (product leads, data scientists, engineers, business stakeholde...Mostrar másÚltima actualización: hace 11 días

Oferta promocionada

Principal Data Scientist - Agentic AI - BCN / MAD

AILY LABSBarcelona, Cataluña, España

Principal Data Scientist - Agentic AI - BCN / MAD.Join to apply for this role at AILY LABS.Are you ready to take part in the upcoming AI revolution of the business world? We have an extraordinary opp...Mostrar másÚltima actualización: hace 25 días