Site Reliability Engineer, Tyk

APAC/EMEA

Salary not provided
MongoDB
AWS
Kubernetes
Python
Redis
Linux
Go
Terraform
Prometheus
Grafana
Junior, Mid and Senior level
Remote from Canada, Europe, UK, US

More information about location

Tyk

Open-source API gateway & management platform

Job no longer available

Tyk

Open-source API gateway & management platform

101-200 employees

B2BEnterpriseInternal toolsAPI

Job no longer available

Salary not provided
MongoDB
AWS
Kubernetes
Python
Redis
Linux
Go
Terraform
Prometheus
Grafana
Junior, Mid and Senior level
Remote from Canada, Europe, UK, US

More information about location

101-200 employees

B2BEnterpriseInternal toolsAPI

Company mission

To connect every system in the world.

Role

Who you are

  • Strong collaboration skills
  • Launching and operating production Kubernetes clusters
  • Designing and operating infrastructure on AWS and other providers
  • Operating MongoDB (or other document database) clusters
  • Operating Redis (or other key-value storage) clusters
  • Administering Linux servers
  • Maintaining distributed software
  • Operating Prometheus and Grafana
  • Operating logging collection and analysis system
  • Strong working knowledge of Kubernetes and Containers
  • Go and/or Python (advanced)
  • AWS (proficient)
  • Linux (proficient)
  • Terraform and IaC in general (proficient)
  • Helm (familiar)
  • MongoDB (or similar)
  • Redis (or similar)
  • Monitoring & logging
  • Grasp of networking concepts (subnets, routing, peering, load balancing, NAT, etc.)
  • Common networking protocols (DNS, TCP/IP, HTTP, TLS, UDP)

What the job involves

  • At Tyk, we’re obsessed with building software that solves problems. We count on our Site Reliability Engineers (SREs) to empower users with a rich feature set, high availability, and stellar performance level to pursue their missions
  • Our customer base is growing, so we’re seeking an experienced SRE to optimise, automate, and improve our performance, using insights from massive-scale data in real time
  • We want an original thinker, a challenger, a technical legend, an opinionated collaborator who wants to make things better
  • Proactive Monitoring: Ensure our production Cloud environment operates within defined SLAs through vigilant monitoring and proactive issue resolution
  • Alerting and Monitoring: Collaborate with Senior SRE to identify opportunities for building proactive alerting and monitoring systems; implement solutions to enhance system reliability
  • Performance Metrics: Contribute to defining key performance metrics for Cloud services, enabling performance improvements and success measurement
  • Solutions Development: Propose and develop solutions to maintain and enhance key performance indicators (KPIs) across our Cloud infrastructure
  • Data Analysis: Gather and analyse metrics from operating systems and applications to optimise system performance and expedite fault resolution
  • Innovation: Drive innovation by optimising system and infrastructure performance, anticipating customer needs, and proactively addressing scaling demands
  • Scalability: Work closely with commercial functions to optimise our platform for scalability and meet growing customer demands
  • Cloud Infrastructure: Analyse and ensure the automation, scalability, and efficient management of our Cloud infrastructure
  • Automation: Execute automation for known cloud operations tasks and create new automation solutions to streamline processes
  • Software Development: Design, write, and deliver software and automation solutions to enhance the availability, scalability, latency, and efficiency of our PaaS services
  • Root Cause Analysis: Participate in blame-free root cause analysis meetings to promote learning and continuous system improvement in the event of production system incidents
  • Documentation: Create and contribute to policies and runbooks to ensure that operational processes are well-documented and consistently followed
  • On-call Support: Provide on-call support, ensuring our Cloud services follow a 24/7 model by promptly responding to alerts, meeting SLAs, and automating root cause analysis
  • Upgrades and Migrations: Plan and execute software upgrades, including Kubernetes versions. Manage and communicate migrations from Classic Cloud to the new Cloud platform

Share this job

Insights

Top investors

38% female employees

-10% employee growth in 12 months

Company

Company benefits

  • Everyone has unlimited paid holiday
  • We have total flexibility in hours, as we believe creativity flows better when our people are given freedom to decide when they are most productive. Everyone is unique after all
  • Employee share scheme
  • Generous maternity and paternity leave
  • Company retreats
  • Volunteering Days
  • Employee Wellbeing platform

Funding (2 rounds)

Sep 2021

$35m

SERIES B

Apr 2019

$5.4m

SERIES A

Total funding: $40.4m

Our take

Tyk offers an API management platform that enables companies to securely share data with third parties, facilitating the sale of services through non-direct channels. This solution helps businesses harness the power of their APIs on a large scale while ensuring data security and accessibility.

The platform acts as both a management and security layer between APIs, simplifying the process for organisations to manage access. By providing a robust security framework, Tyk ensures that only authorised users can interact with an organisation's APIs, thereby protecting sensitive data.

With millions of transactions processed daily for thousands of organisations, including notable clients like AXA, Cisco, Trip Advisor, Starbucks, Domino's, Sephora, and the UK Ministry of Justice, Tyk has established itself as a trusted leader in API management.

Kirsty headshot

Kirsty

Company Specialist at Welcome to the Jungle