Senior Site Reliability Engineer, Fastly

$167.8-209.7k

AWS
Kubernetes
GCP
Python
Linux
Go
Terraform
Prometheus
Grafana
flux
Datadog
Senior level
Denver
Los Angeles
New York
San Francisco Bay Area
Fastly

Edge cloud platform

Job no longer available

Fastly

Edge cloud platform

1001+ employees

B2BSecurityEnterpriseCyber SecurityCloud Computing

Job no longer available

$167.8-209.7k

AWS
Kubernetes
GCP
Python
Linux
Go
Terraform
Prometheus
Grafana
flux
Datadog
Senior level
Denver
Los Angeles
New York
San Francisco Bay Area

1001+ employees

B2BSecurityEnterpriseCyber SecurityCloud Computing

Company mission

To build a better internet — a safe place where good can thrive.

Role

Who you are

  • Experience running high availability systems and supporting distributed infrastructure. You have designed services with fault tolerance and across geographies. You have deployed and managed multi-tiered services
  • Understanding of Linux systems, high and low level. You have used tcpdump and tracing tools
  • Experience building and operating production-grade kubernetes clusters in multiple regions, clouds or data centers. You have experience with tooling in the CNCF space, such as: prometheus, flux, helm, etc
  • Experience with programming languages such as Go and Python. You can read code and reason about what it does. You can write code within an existing large code base such as adding features. You can create medium-sized programs from scratch such as custom kubernetes controllers, custom prometheus exporters, and building tooling to help manage infrastructure
  • Experience with infrastructure and configuration management tooling. You have used Terraform to manage infrastructure
  • Experience provisioning and managing users and resources with cloud providers such as AWS and GCP. You have provisioned users and accounts using both graphical user interfaces and infrastructure as code frameworks. You have experience provisioning components on public cloud and understand how they work together in creating a multi-tiered service. You have deployed and managed services built on top of public cloud components such as EC2, S3, and GKE
  • Experience with CI/CD and GitOps tooling. You can iterate infrastructure via pipelines through code changes. You have experience using Github including creating and reviewing pull requests
  • Experience with monitoring tools such as Prometheus, Datadog and Grafana
  • Experience working on a distributed team. You have experience collaborating across timezones. You can articulate challenges of distributed teams and how you mitigate them

What the job involves

  • Foundation Engineering at Fastly is looking for a Site Reliability Engineer to join our Cloud and Container Services team. This role is focused on helping to scale and manage Fastly’s Kubernetes based platform for control plane services
  • This platform is built on top of multiple public cloud services and contains many Kubernetes ecosystem components. We’re working to scale out our platform to support growth and at the same time evolve to address new business priorities
  • A successful candidate will help expand our platform feature set, support existing users and onboard new services, and drive efficiency while maintaining a secure platform
  • Design, build and operate infrastructure (cloud, Fastly datacenter) to enable reliable and rapid deployment, effective monitoring, and resilient operation in a large-scale Linux environment. The majority workloads are containerized but some are using native cloud services such as compute and storage
  • Diagnose and resolve performance and reliability issues across the stack: application, operating system, network, 3rd party services and APIs, including cross-application dependencies
  • Deploy and support complex 3rd party and internally developed applications
  • Write tools to automate maintenance and deployment of servers, services, and applications
  • Collaborate with internal users and continually evolve the platform and its operations using solid engineering practices
  • Drive projects sometimes independently and sometimes collaboratively across time zones
  • Configure access and manage operations within a multi-cloud environment

Otta's take

Xav Kearney headshot

Xav Kearney

CTO of Otta

Internet users around the world have come to expect personalized, real-time digital experiences, but delivering these quickly and securely, while maintaining quality, can be a difficult task for businesses.

Fastly enables the companies to deliver fast, secure, and scalable online experiences. Its edge cloud platform moves data and applications closer to end-users — improving the user experience, putting the power back in developers’ hands, and enabling clients to focus on growing their businesses.

The company has helped several high-profile businesses, including Reddit, Pinterest, Stripe, Epic Games, and more. Its offering is well-regarded in the space, demonstrated by it being named a 2022 Gartner® Peer Insights™ Customers’ Choice for Global CDN, as well as a Leader in The Forrester Wave™: Edge Development Platforms, Q4 2023 report.

Insights

Top investors

Few candidates hear
back within 2 weeks

9% employee growth in 12 months

Company

Funding (last 2 of 7 rounds)

Jan 2019

$40m

SERIES F

Jul 2018

$40m

SERIES F

Total funding: $259m

Company benefits

  • Competitive PTO Policies
  • Remote or Hybrid Work
  • Generous time off for parental leave
  • Full medical, dental, and vision coverage
  • Short- and long-term disability insurance
  • Mental health resources
  • 401(k)/retirement plans
  • Employee stock purchasing plans (ESPP)
  • Reimbursements for learning and development programs

Company values

  • We have a curious spirit
  • We focus on our customer
  • We are trustworthy
  • We act with passion
  • We operate with integrity
  • We are competitive
  • We embrace transparency
  • We are good people

Company HQ

China Basin, San Francisco, CA

Founders

Artur Bergman

(Chief Architect and Executive Chairperson)

Served as the company's CEO until changing roles in 2020. Previously served as CTO at Wikia, and was a Board Member at OpenID Foundation


People progressing

Joined as Sales Engineer (Media and Entertainment). Was promoted to Senior Sales Engineer (EMEA) after 2 years, then promoted again to Senior Principal Sales Engineer.

Salary benchmarks

We don't have enough data yet to provide salary benchmarks for this role.

Submit your salary to help other candidates with crowdsourced salary estimates.

Share this job

View 27 more jobs at Fastly