All roles

Staff Site Reliability Engineer — Project Volcano

Remote · USA Full-time New today

Are you ready to unlock intelligence? If you don’t think you meet all of the criteria below but are still interested in the job, please apply. Nobody checks every box - we’re looking for candidates that are particularly strong in a few areas, and have some interest and capabilities in others. ## About the Role Kong is building Project Volcano, an internal developer platform purpose-built for Kong's engineering ecosystem. Volcano will provide teams with on-demand preview environments, edge deployments, managed PostgreSQL, auth, realtime, and storage APIs all deeply integrated with Kong products. As the Staff SRE for Volcano, you will be the founding reliability voice for this platform. This role is a strategic initiative driven by the Office of the CTO (OCTO). You will partner directly with engineering leadership to define the platform's reliability posture, build its SRE practice from the ground up, and ensure Volcano can scale to serve all of Kong's customers. This is a high-visibility, high-impact role with direct influence on Kong's next generation developer platform. WHAT YOU'LL DO - Own reliability for Volcano end-to-end: Define and drive SLOs, error budgets, and incident response practices for all Volcano services — edge deployments, managed Postgres, auth, realtime, storage, and the control plane. - Architect the platform's infrastructure: Design and build the multi-region Kubernetes infrastructure, networking, and data plane that powers Volcano's edge deployment pipeline and backend-as-a-service capabilities. - Build the GitOps and CI/CD backbone: Establish deployment automation, canary pipelines, and preview environment provisioning using ArgoCD, Helm, and Terraform/Terragrunt — setting patterns the broader team will follow. - Scale managed data services: Design, operate, and harden multi-tenant PostgreSQL clusters, Redis caching layers, and object storage — with a focus on data isolation, performance, and disaster recovery. - Drive observability from day one: Instrument every Volcano service with meaningful SLIs; build dashboards, alerts, and runbooks using Datadog, Prometheus, and Grafana before services go live, not after incidents. - Lead cross-functional reliability work: Collaborate with the OCTO team, product engineering, and security to bake reliability and compliance into Volcano's architecture — not bolt it on later. - Set SRE culture and standards: Mentor engineers across Volcano's contributing teams on reliability principles; lead postmortems, define on-call practices, and build a blameless engineering culture. - Evaluate and adopt emerging technologies: Given Volcano's greenfield nature, evaluate and make architectural decisions on edge runtimes, serverless compute, vector databases, and AI-native infrastructure components. WHAT YOU'LL BRING - BS in Computer Science or equivalent; substantial experience at Staff or Principal IC level in SRE/Platform Engineering. - Proven track record building SRE or platform engineering practices for developer-facing platforms or PaaS/SaaS products — ideally at greenfield stage. - Deep Kubernetes expertise: multi-tenant cluster design, networking (CNI, service mesh, ingress), autoscaling, and security hardening. About Kong: Kong Inc., a leading developer of API and AI connectivity technologies, is building the infrastructure that powers the agentic era. Trusted by the Fortune 500 and startups alike, Kong's unified API and AI platform, Kong Konnect, enables organizations to secure, manage, accelerate, govern, and monetize the flow of intelligence across APIs and AI models. For more information, visit www.konghq.com.

Compensation

Range: $140K - $197K Apply To This Job

Related roles

Site Reliability Engineer II ( Remote )

Remote · USA Full-time

Site Reliability Engineer lll

Remote · USA Full-time

Senior Site Reliability Engineer - Remote EST

Remote · USA Full-time

Senior DevOps Engineer/Site Reliability Engineer-East Coast

Remote · USA Full-time

Distinguished Site Reliability Engineer - Cloud

Remote · USA Full-time

Senior Site Reliability Engineer, APAC

Remote · USA Full-time

(Senior) Site Reliability Engineer (m/f/d) – Platform & Agentic Operations

Remote · USA Full-time

Senior Site Reliability Engineer (SRE) - (GCP)

Remote · USA Full-time

Kubernetes Engineer - Remote

Remote · USA Full-time

Senior Kubernetes Engineer – Secret Eligible

Remote · USA Full-time

Housing Case Manager

Remote · USA Full-time

Online Search Coordinator

Remote · USA Full-time

Huntington Practice Finance BDO

Remote · USA Full-time

Advisor, Business

Remote · USA Full-time

Asset Reliability Manager

Remote · USA Full-time

Experienced Live Chat Agent – Remote Customer Support Representative

Remote · USA Full-time

Experienced Junior Geographic Information Systems Analyst – Web & Cloud Application Development

Remote · USA Full-time

Manager, Centralized Data & Sample Management - UK , Poland or South Africa (Home-based) - FSP

Remote · USA Full-time

Experienced Healthcare Customer Service Representative – Remote Opportunity with arenaflex

Remote · USA Full-time

Experienced Customer Service Advocate – Remote Healthcare Support Specialist

Remote · USA Full-time