Join to apply for the Platform Engineer - AI role at NTT DATA, Inc.
2 weeks ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
Overview
Make an impact with NTT DATA. Join a company that is driving innovation and delivering value to clients and society. We embrace diversity and inclusion and provide opportunities to grow, belong, and thrive.
Your day at NTT DATA
As a Platform Engineer at NTT DATA, you will lead the design of complex managed service solutions for our largest enterprise clients. You will drive the strategic vision and direction for these solutions, combining technical expertise and business acumen to create IT strategies and roadmaps aligned with clients’ objectives, KPIs, and SLAs.
Responsibilities
- Design and build internal developer platforms (IDPs) that provide self-service infrastructure provisioning, deployment pipelines, and operational tooling through intuitive interfaces and APIs
- Develop comprehensive platform architecture spanning on-premises, cloud, and hybrid environments with focus on scalability and reliability
- Create developer-friendly abstractions for complex infrastructure concepts, including deployment workflows, environment management, and service discovery mechanisms
- Design, implement, and maintain enterprise-grade Linux and Windows server infrastructures, including system installation, configuration, patching, and optimization
- Perform advanced system administration tasks including user management, security hardening, performance tuning, and troubleshooting across diverse OS environments
- Implement automated OS provisioning and configuration management using infrastructure-as-code principles
- Design, deploy, and manage virtualized infrastructure using VMware vSphere / ESXi, Microsoft Hyper-V, and KVM
- Conduct capacity planning and performance analysis of virtual infrastructures to optimize resource utilization
- Implement backup and disaster recovery solutions for virtual machines including technologies like Veeam and SRM
- Integrate virtualization platforms with storage area networks (SAN) and network-attached storage (NAS)
- Design, implement, and maintain Kubernetes clusters across on-premises, cloud, and hybrid environments with focus on scalability and high availability
- Optimize container orchestration platforms for performance, cost-efficiency, and resource management
- Develop and maintain container deployment strategies, including blue-green deployments, canary releases, and rolling updates
- Implement service mesh technologies and networking solutions for secure, scalable service-to-service communication
- Implement advanced scheduling algorithms and resource allocation strategies for distributed workloads across multi-cluster and multi-tenant environments
- Design and optimize job scheduling systems with backfill, fair share, and advanced reservations
- Manage cluster resource allocation including CPU, memory, storage, and GPUs with focus on maximizing utilization and minimizing latency
- Implement automated scaling policies and resource optimization techniques for dynamic workload management
- Build and maintain CI / CD pipelines with automated testing, security scanning, and progressive deployment strategies
- Integrate CI / CD systems with Kubernetes and container orchestration platforms for streamlined application delivery
- Implement GitOps workflows and Infrastructure-as-Code practices using tools like Terraform, Pulumi, and Ansible
- Design and implement monitoring, logging, and alerting systems providing visibility into platform health and application performance
- Deploy observability solutions using Prometheus, Grafana, Jaeger, and distributed tracing tooling
- Implement automated anomaly detection and performance optimization based on metrics, logs, and traces
Required Qualifications
Education & Experience
Bachelor’s degree in Computer Science, Information Technology, or related field, or equivalent practical experience5+ years of experience in platform engineering, DevOps, site reliability engineering, or similar infrastructure-focused rolesTechnical Skills
Expert-level knowledge of Linux system administration (RHEL, CentOS, Ubuntu, Debian) including kernel tuning, process management, and security hardeningProficiency in Windows Server administration including Active Directory, Group Policy, and PowerShell scriptingVMware vSphere / ESXi, Microsoft Hyper-V, and open-source hypervisors like KVMKnowledge of virtualization management tools including vCenter Server and System Center Virtual Machine ManagerContainerization & Orchestration
Expert-level Kubernetes administration including cluster setup, networking, storage, and securityProficiency with Docker containerization and container image management; familiarity with RAFAY and RANCHER platformsExperience with container orchestration patterns and service mesh technologiesCloud Platforms
Hands-on experience with AWS, Azure, and Google Cloud Platform including compute, networking, and storage servicesKnowledge of cloud-native technologies and hybrid cloud architectureProgramming & Scripting
Proficiency in Python, Bash, Go, and PowerShell for automation and infrastructure managementExperience with Infrastructure-as-Code tools like Terraform, Pulumi, CloudFormation, or AnsibleMonitoring & Observability
Experience with Prometheus, Grafana, Datadog, ELK Stack, and distributed tracing toolsKnowledge of observability best practices including metrics, logs, and traces correlationCluster & Resource Management
Experience with job schedulers and resource management systems like Slurm, PBS, or Kubernetes scheduling frameworksUnderstanding of distributed systems architecture and resource optimization techniquesSoft Skills
Strong analytical and problem-solving abilities with experience in complex system troubleshootingExcellent communication skills and ability to work effectively with cross-functional virtual teamsProduct mindset with focus on developer experience and platform usabilityExcellent ability to work effectively remote and virtually across Europe & InternationalPreferred Qualifications
Kubernetes certifications (CKA, CKAD, CKS) or equivalent cloud platform certificationsExperience with service mesh technologies (Istio, Linkerd) and API gateway solutionsKnowledge of security frameworks and compliance standards (SOC 2, ISO 27001, HIPAA)Experience with GitOps practices and advanced CI / CD patternsBackground in high-performance computing or large-scale distributed systemsWorkplace type
Remote WorkingAbout NTT DATA
NTT DATA is a $30+ billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and invest billions in R&D to help organizations and society move confidently into the digital future. We are committed to diversity and inclusion and have a global presence across 50+ countries. Our services include consulting, data and AI, industry solutions, and the development and management of applications, infrastructure, and connectivity. NTT DATA is part of NTT Group and headquartered in Tokyo.
Equal Opportunity Employer
NTT DATA is proud to be an Equal Opportunity Employer with a global culture that embraces diversity. We do not discriminate based on age, race, color, gender, sexual orientation, religion, nationality, disability, pregnancy, marital status, veteran status, or any other protected category.
Third parties fraudulently posing as NTT DATA recruiters can be harmful. NTT DATA recruiters will never ask candidates for payment or banking information during recruitment. All legitimate emails will come from an @nttdata.com address. If you suspect fraud, please contact us.
Seniority level
Mid-Senior levelEmployment type
Full-timeJob function
Engineering and Information TechnologyIndustries
IT Services and IT Consulting#J-18808-Ljbffr