your career. THE ROLE: We are looking for a dynamic, energetic Lead AI Cluster Models Architect to join our growing team... PERSON: The AI Cluster Models Architect plays a critical role in shaping the future of AI/ML training and inferencing...
your career. THE ROLE: We are looking for a dynamic, energetic Lead HPC Cluster Network Architect to join our growing team... PERSON: The Cluster Network Architect plays a critical role in shaping the future of AI/ML training and inferencing...
and innovation. Finding bottlenecks and optimizing cluster infrastructure for the latest AI systems. Are you ready to take on the... and cluster-level. Support validation of servers with AMD CPU/GPU/NICs and AMD’s libraries such as RCCL Design, implement...
Evaluate and select CPUs, GPUs, accelerators, interconnects, and memory configurations for optimal cluster performance. Design..., and fault tolerance mechanisms. Network Design network topologies to maximize overall cluster performance Understand the...
your career. THE ROLE: We are seeking a highly motivated and skilled GPU Cluster Performance Attainment Engineer... focus of this role is the RDMA networks used in AI Clusters, understanding data flows between GPU, NIC and cluster network...
cluster management, Docker containerization, and Helm chart deployments. Implement and maintain robust CI/CD pipelines.... Extensive experience with AWS deployments, EKS cluster management, Kubernetes, Docker, and Helm charts. Proficiency in CI/CD...
/partners across the world Hands-on experience with setting up cluster or multi –node inter-connected systems Representing...
, responsible for the execution of data center cluster projects at AMD CSP partners and enterprise commercial end-customers. The... during large scale cluster bringup and validation. The candidate should be a data center systems engineer, site reliability...
for treatment to Certified Physical Therapy Assistants (PTA) throughout the cluster. Provide consultation and clinical supervision... to PTAs throughout the cluster. Maintain availability to answer questions from PTAs. Directly supervise PTA and Rehab Tech...
, AI infrastructure, building cluster scale automation for distributed training and inference workloads, MLOps. You will be a member... for distributed training and inference workloads with AMD's ROCM software Build cluster scale automation for distributed training...
performance tuning, debugging, and cluster optimization across distributed processing workloads. Managing Delta Lake governance..., and distributed processing, with proven performance tuning and cluster optimization. Demonstrated expertise in DBT (Core or Cloud...
across service orchestration, job scheduling, cluster management, big-data processing, and other core services that business teams...
benchmarking studies, ISO NE transitional cluster studies, load interconnection studies replicating ISO/Utility practices...
using container-native Hadoop services to work in a Kubernetes cluster. As a Sr. Staff Software Engineer...
, and cluster deployments and lead discussions about network topologies, compute, management, telemetry, and storage fabrics...
. AI Consulting and Solutions: Proven experience building and deploying AI consulting and professional services. Large-Scale Cluster...
. Experiences to run workloads on large scale heterogeneous cluster is a plus Experiences to optimize GPU kernels...
. SpringBoot, Redis, MongoDB, Kafka, and MicroServices architecture. 3. AWS deployments, scaling and EKS cluster management 4...
, and negotiate staffing plans. Manage the end-to-end cluster development cycle, including forecasting, sourcing, procurement...
personnel to create costed bills of material (BOMs) for rack and cluster level solutions Partner with business development...