Remote
•
Today
Position Summary: Designing, implementing, and maintaining distributed systems to build world-class ML platforms/products at scaleExperiment with, deploy, and manage LLMs in a production contextBenchmark and optimize inference deployments for different workloads, e.g. online vs. batch vs. streaming workloadsDiagnose, fix, improve, and automate complex issues across the entire stack to ensure maximum uptime and performanceDesign and extend services to improve functionality and reliability of the
Easy Apply
Contract
$80 - $90