Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior Engineer - AI and HPC Observability, Location: Santa Clara, CA

Page: 1

Senior Engineer - AI and HPC Observability

. We are looking for a Senior AI & HPC Observability Engineer to design and build the next-generation observability platform for large-scale... infrastructure teams to optimize observability for model training, inference workloads, and HPC performance. Leverage machine...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 22 Oct 2025

Senior HPC Cluster Engineer - EDA

and experienced HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters for EDA and high-performance computing... leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 17 Sep 2025

Senior AI and ML Storage Engineer

some of the world’s most advanced computing workloads. We are seeking a Software Engineer to join our MARS team at NVIDIA... improvements in system reliability, performance, and observability to meet exascale standards. Partner with security, networking...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 21 Oct 2025

Senior Storage Production Engineer - DGX Cloud

, and ensuring low-latency data access for high-performance computing (HPC) and AI/ML workloads. Storage Production Engineers..., Puppet, and Terraform for automating storage deployments. Experience with observability and tracing tools like InfluxDB...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 15 Aug 2025