a comprehensive SRE strategy aligned with the company's goals and objectives. Lead junior members of the team to drive the reliability... reliability and reduce manual intervention. Production Support Optimization: Lead all aspects of end-to-end production support...
About the role: We're hiring a Senior AI Engineer to lead the development and deployment of cutting-edge AI systems... Deployment: Deploy scalable, secure ML systems on AWS, Azure, and Google Cloud. Translate prototypes into reliable, cost...
systems ML Platforms: MLflow, Kubeflow, JupyterHub Infrastructure: AWS/GCP/Azure, Terraform Languages: Python, Go... millions of predictions per second Build robust, scalable systems for model training, deployment, and monitoring Lead...