for global societal impact. The Opportunity at Altana At Altana, we believe that software that ships must be reliable... and efficient. As a Staff Site Reliability Engineer, you will be instrumental in ensuring the availability, performance...
worldwide. Our software empowers legal teams to review and negotiate contracts faster, with greater accuracy and confidence... for the future of legal work Role Overview: We are looking for a Senior AI Engineer to join our team. You will be at the...
Senior Lead AI Engineer Overview: At Capital One, we are creating responsible and reliable AI systems, changing..., develop, test, deploy, and support AI software components including foundation model training, large language model inference...
, alerting, and observability workflows for production systems. Requirements: 5+ years’ experience building large-scale... language. Strong understanding of GPU software stacks (CUDA, Triton, NCCL) and Kubernetes orchestration. Practical experience...
to integrate model-serving pipelines, memory systems, and reasoning components. Implement monitoring, observability, and failover... on design improvements. Requirements Strong background in Computer Science, Software Engineering, or Systems Design...
how we serve our clients. As a Senior Engineer on AI.x, you will play a key role in bringing these priorities to life by designing... best practices for observability and operational excellence to maintain high performance and uptime for mission-critical...
management experience with an engineering team Minimum of 10 years as a software engineer or equivalent technical experience... of work include Machine Learning Engineers, Infrastructure Engineer, Product SWE Frontend and Backend, Mobile Software...
Datadog is the security and observability platform for cloud infrastructure, applications, and AI. Unlike traditional... software, LLMs run autonomously and nondeterministically, which makes them highly flexible but also inherently unpredictable...
planning, reasoning, tool-use, and long-term task decomposition. Engineer context- and memory-rich agents that integrate... microservice-based, cloud-native infrastructure for autonomous agent deployment with Docker/Kubernetes, observability tooling...