with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal... generative AI models. Responsibilities - Design and optimize large model inference pipelines for low-latency, high-throughput...
with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal... creation and consumption on TikTok and serve billions of users. We are seeking an experienced AI model optimization engineer...
to date with and apply cutting-edge techniques in large model optimization and inference acceleration. Qualifications: Minimum Qualifications... our hybrid work model, and the specific requirements may change at any time. We are seeking a Machine Learning Engineer...
benchmark tools and performance optimization of AI workloads specifically tailored for large-scale LLM training and inference... Hardware Acceleration (e.g., GPU/TPU/RDMA) or ML for Systems, and Distributed Storage. - Experience in AI model development...
as large model inference and multi-machine multi-card deployment. Our work enhances user experience by powering diverse..., and deployment of the GenAI features. This also encompasses large-scale training stability and optimization for acceleration, as well...
performance and optimization team across various frameworks and model architectures. This is a highly visible role with large... trend in inference and training. Experience in mapping model architecture to low level software, hardware and understanding...