All roles

Member of Technical Staff, Training (Bay Area, Remote)

Remote · USA Full-time New today

What You’ll Do Drive down wall-clock time to convergence by profiling and eliminating bottlenecks across the foundation model training stack stack, from data pipelines to GPU kernels Design, build, and optimize distributed training systems (PyTorch) for multi-node GPU clusters, ensuring scalability, robustness, and high utilization Implement efficient low-level code (CUDA, cuDNN, Triton, custom kernels) and integrate it seamlessly into high-level training frameworks Optimize workloads for hardware efficiency: CPU/GPU compute balance, memory management, data throughput, and networking Develop monitoring and debugging tools for large-scale runs, enabling rapid diagnosis of performance regressions and failures What You’ll Bring Deep experience in distributed systems, ML infrastructure, or high-performance computing (8+ years) Production-grade expertise in Python Low-level performance mastery: CUDA/cuDNN/Triton, CPU–GPU interactions, data movement, and kernel optimization Scaling at the frontier: experience with PyTorch and training jobs using data, context, pipeline, and model parallelism System-level mindset with a track record of tuning hardware–software interactions for maximum utilization Apply To This Job

Related roles

Marketing Analyst (Attribution Focus) (Promova)

Remote · USA Full-time

Student and Family Experience Manager (Immediate Opening)

Remote · USA Full-time

Customer Sales Representative (remote work)

Remote · USA Full-time

Account Manager Industrial Markets Region: France - Africa

Remote · USA Full-time

VP of Engineering

Remote · USA Full-time

Member of Technical Staff, Foundation Models (Bay Area)

Remote · USA Full-time

Member of Technical Staff, Data Agent (Bay Area, Remote)

Remote · USA Full-time

Member of Technical Staff, Platform (Bay Area, Remote)

Remote · USA Full-time

Account Manager Industrial Markets Region: Europe - Middle Eas

Remote · USA Full-time

Sr FP&A Analyst

Remote · USA Full-time

[Remote] Animator/Illustrator

Remote · USA Full-time

Lead Power Systems Engineer - Grid Integration and Stability, Consulting Services

Remote · USA Full-time

Experienced Customer Service Associate – Remote Opportunity at arenaflex

Remote · USA Full-time

Remote Part‑Time Customer Service Representative – arenaflex Home‑Based Support Specialist

Remote · USA Full-time

Logistics Associate - Relief Evenings - 2:00pm-10:30pm (08-HR) Non-Benefited

Remote · USA Full-time

Solar O&M Project Manager

Remote · USA Full-time

Experienced Part-Time Customer Service Representative – Remote Customer Support at arenaflex

Remote · USA Full-time

Experienced Data Entry Specialist – Unlock the Magic of Flexible Schedules at arenaflex

Remote · USA Full-time

Experienced Full Stack Java Software Engineer – Customer Systems Development

Remote · USA Full-time

Experienced Live Chat Support Specialist – Customer Service Representative at arenaflex

Remote · USA Full-time