Friday Systems builds AI that allows industrial robots to adapt to dynamic warehouse environments. Own the DRL stack end-to-end : formulation → algorithm design → large-scale training → evaluation → deployment. Design & ship DRL algorithms (PPO / SAC / DDQN and variants, based on encoders / cross-attention / pointer networks) for complex control & combinatorial optimization.
~ GAE, normalization, entropy / KL control, distributional / value-loss tuning, curriculum learning and reward shaping, …
~ Launch multi-GPU training, parallel rollouts, efficient replay / storage, and reproducible experiment tooling.
~ Productionize : clean PyTorch code, profiling, Dockerized services (FastAPI), AWS deployments, experiment tracking, dashboards.
~ Provide mentorship and leadership to foster a culture of quality and innovation.
Extensive Deep Learning, Reinforcement Learning & PyTorch expertise : You can implement several DRL algorithms from scratch, reason about root-cause performance drops and make informed decisions about next steps.
~ Python, Linux, Docker, Multi-GPU, Cloud (AWS).
~ Ownership : you’re comfortable being the primary owner for experiments, code quality, and results in a small team.
~ We are not considering entry-level or coursework-only profiles for this role.
Deep technical session with CTO on your past RL work (no LeetCode, no homework)
Aws Engineer • Madrid, Spain