Staff Site Reliability Engineer, Moveworks

Salary not provided

+ Equity

AWS
Kubernetes
GCP
Python
Java
Linux
C++
Azure
Unix
Golang
Senior and Expert level
San Francisco Bay Area

Office located in Mountain View, CA

Moveworks

Enterprise-built conversational AI platform

Be an early applicant

Moveworks

Enterprise-built conversational AI platform

501-1000 employees

B2BArtificial IntelligenceEnterpriseCommunicationAutomation

Be an early applicant

Salary not provided

+ Equity

AWS
Kubernetes
GCP
Python
Java
Linux
C++
Azure
Unix
Golang
Senior and Expert level
San Francisco Bay Area

Office located in Mountain View, CA

501-1000 employees

B2BArtificial IntelligenceEnterpriseCommunicationAutomation

Company mission

To make getting instant help at work effortless for employees.

Role

Who you are

  • 7+ years of experience in authoring and operating complex distributed infrastructure and applications
  • Strong experience with container orchestration platform like Kubernetes and cloud infrastructure like AWS / GCP / Azure
  • Very high proficiency with Unix/Linux, TCP/IP, DNS, load balancers, autoscaling, file systems and different types of data stores
  • Software development proficiency with Python, Golang, Java, or C++
  • Experience working across teams and implementing solutions, tools, and practices to improve observability, reliability, and scalability
  • Desire to work at a startup pace in a small company with a high degree of ownership
  • Strong motivation, gumption, and an appetite for continuous, incremental changes and completing challenging projects fast
  • High level of curiosity about engineering outside of your immediate discipline and an incessant desire to learn
  • BS+ in computer science or a related field

What the job involves

  • As a site reliability engineer, you will be an owner of and be responsible for overall health, performance, and capacity of the Moveworks AI infrastructure and services
  • In addition to helping engineering teams with resolving operational issues, you will also design and implement solutions, tools and practices that help us improve operational efficiency and product SLA
  • This role is a blend of SRE, infrastructure, and software development
  • We’re building a team that indexes on moving fast, solving challenging product/engineering problems and providing value to our customers
  • To be successful, you'll be partnering with and enabling machine learning, search, product, data, and full stack teams to design and build fault tolerant and scalable infrastructure, services and features
  • This is an opportunity to play an integral role at the fastest-growing AI startup in its space
  • Design, develop, and evolve site reliability and chaos engineering for Moveworks infrastructure and services
  • Closely work with machine learning, search, product, infrastructure, data, and frontend teams to understand their infrastructure and operational needs and build solutions that are optimal, fault tolerant, and scalable
  • Author and advocate for reliability through best distributed system design patterns (error handling, retries, rate limiting, circuit breaking, etc.). Participate in design discussions and ensure operational readiness of infrastructure, services, and features
  • Design and build tools, libraries, and frameworks that allow engineering teams to rapidly deploy and scale Moveworks infrastructure and applications
  • Review and participate in application performance analysis / tuning and capacity planning
  • Setup and maintain monitoring, metrics, and reporting systems for observability and actionable alerting
  • Define internal and customer-facing key SLA metrics, implement solutions and practices with different teams to improve those metrics
  • Own the engineering on-call process and setup. Drive discussions for outages, root cause analysis, and action items
  • Participate in on-call rotation for second-tier escalation (at Moveworks, each engineer participates in the team specific first-tier on-call rotation). Help diagnose and resolve complex operational issues

Our take

Moveworks provides IT service and helpdesk support through an AI-driven chatbot. Its system utilizes information about an employee, such as their role and seniority level, to deliver the correct information or carry out processes. Moveworks claims to solve 30% of IT support tickets within weeks of implementation.

In 2024, Moveworks opened a data center in Australia and its service has seen strong uptake from large enterprise-level companies, where IT teams deal with hundreds of requests at a time, helping to minimize backlog and gain back employee time. As more and more workers move from office to home or hybrid, IT support requests have boomed, opening up the opportunity for Moveworks to expand its client pool and provide solutions to help with this change.

Competitors such as Electric offer similar automated IT support services, but Moveworks has invested heavily into creating an AI system that is conversational and easy to use. Automated solutions with machine learning capabilities can also identify cost-saving opportunities. When unused software licenses are estimated to cost companies billions of dollars a year, there is money to be made by identifying and removing these avoidable costs through AI.

Steph headshot

Steph

Company Specialist

Insights

Top investors

Few candidates hear
back within 2 weeks

97% employee growth in 12 months

Company

Funding (last 2 of 3 rounds)

Jun 2021

$200m

SERIES C

Nov 2019

$75m

SERIES B

Total funding: $305m

Company benefits

  • Fully paid medical, dental, and vision coverage with no premiums
  • Fully paid short-term disability, long-term disability, and life insurance
  • Free daily meals
  • Unlimited PTO
  • Unlimited paid sick days
  • 16 weeks of 100% paid parental leave
  • Medical, family care, and military leave
  • Competitive salary — we’re one of the best-paying companies in the Bay Area
  • 401(k) with matching
  • Equity and stock options
  • Commuter and parking benefits

Company HQ

Mountain View, CA

Leadership

After an MA at Stanford, Bhavin worked as a Director at LeapFrog Enterprises before co-founding Gazillion after 5 years in 2005. They left their CEO role at Refresh to co-found Moveworks in June 2016.

Vaibhav started their career as a Stanford University Research Scientist before working for Aster Data as a Software Engineer. Having started ClearStory Data in 2011, they co-founded Moveworks in 2016.

Varun Singh

(President)

Varun worked for half a decade as SVP of Product at Sefaira before spending 2 years as Lead Product Manager at Facebook. They left in 2016 to co-found Moveworks.

Jiang Chen

(CTO of AI)

Jiang took a PhD at Yale in Computer Science before working as a Columbia Research Scientist and spending 7 years at Yahoo! and Google. After a year at Airbnb, they co-founded Moveworks.

Salary benchmarks

We don't have enough data yet to provide salary benchmarks for this role.

Submit your salary to help other candidates with crowdsourced salary estimates.

Share this job

View 37 more jobs at Moveworks