All roles

Member of Technical Staff, Inference (Bay Area, Remote)

Remote · USA Full-time New today

What You’ll Do Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation) Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks What You’ll Bring Deep experience in distributed systems, ML infrastructure, or high-performance serving (8+ years) Production-grade expertise in Python, with strong background in systems languages (C++/Rust/Go) Low-level performance mastery: CUDA, Triton, kernel optimization, quantization, memory and compute scheduling Proven track record scaling inference workloads in both throughput-oriented cluster environments and latency-critical on-device deployments System-level mindset with a history of tuning hardware–software interactions for maximum efficiency, throughput, and responsiveness Apply To This Job

Related roles

Member of Technical Staff, Training (Bay Area, Remote)

Remote · USA Full-time

Marketing Analyst (Attribution Focus) (Promova)

Remote · USA Full-time

Student and Family Experience Manager (Immediate Opening)

Remote · USA Full-time

Customer Sales Representative (remote work)

Remote · USA Full-time

Account Manager Industrial Markets Region: France - Africa

Remote · USA Full-time

VP of Engineering

Remote · USA Full-time

Member of Technical Staff, Foundation Models (Bay Area)

Remote · USA Full-time

Member of Technical Staff, Data Agent (Bay Area, Remote)

Remote · USA Full-time

Member of Technical Staff, Platform (Bay Area, Remote)

Remote · USA Full-time

Account Manager Industrial Markets Region: Europe - Middle Eas

Remote · USA Full-time

Account Executive

Remote · USA Full-time

Experienced Customer Service Representative - Work From Home Opportunity at arenaflex

Remote · USA Full-time

Web Designer/Developer, Jr. (WordPress)

Remote · USA Full-time

Experienced Customer Service and Scheduling Agent – Virtual Receptionist and Appointment Scheduler

Remote · USA Full-time

Data Entry Specialist – Night Shift | Accurate Data Processing, Database Management & Quality Assurance Professional

Remote · USA Full-time

Experienced Data Entry Clerk - Remote Work From Home Focus Group Panelist - Flexible Part-Time or Full-Time Opportunity

Remote · USA Full-time

Work From Home Data Entry Clerk / Typing

Remote · USA Full-time

Experienced Bilingual Customer Service Representative – Remote Work Opportunity with arenaflex

Remote · USA Full-time

Human Resources Director – U.S. Commercial & Global Customer Capabilities and Innovation

Remote · USA Full-time

Experienced Junior Tech Support Specialist – 24/7 Live-Chat Team at arenaflex

Remote · USA Full-time