for Machine Learning. THE PERSON: We are seeking a DevOps Engineer / HPC Platform Engineer to build and operate our Slurm... and automating HPC or Slurm clusters in production environments. Deep understanding of Linux systems, job schedulers (Slurm...
that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded... your career. THE ROLE: AMD is looking for an AI solutions validation Engineer who is passionate about complex AI solutions...
or HPC clusters. Enjoy debugging complex distributed systems and measuring efficiency rigorously. Have exposure... become experiments and products). About the Role As a Training Performance Engineer, you'll drive efficiency improvements...
NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC.... Installing and testing various systems OS, server firmware and SW stack. Drive support for root cause analysis on reliability...
OR equivalent experience. Apply strong software engineering fundamentals in distributed systems, networking, and storage... like Airflow or Argo, manage streaming systems (Kafka/Event Hubs), and handle object storage (Azure Blob/S3-compatible). Develop...
that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded...: You bring exceptional technical depth in datacenter virtualization, distributed systems, and ideally GPU-accelerated compute...
and cooling systems, including chillers, cooling towers, dry coolers, thermal storage, pumps, hydronic loops, and air/liquid... with AI/HPC data centers and advanced cooling technologies, including two-phase and high-density liquid-cooling systems...
, monitoring systems, object storage like Minio, and High - Performance Computing (HPC)). Design, implement, and support robust... systems, microservices, and HPC architectures. Familiarity with object storage systems such as MinIO or AWS S3...