Site Reliability Engineer, Appspace

Salary not provided
MongoDB
AWS
Kubernetes
GCP
Python
macOS
Linux
Terraform
MySQL
Azure
RabbitMQ
Windows
Stackdriver
JIRA
Confluence
Atlassian
Git
Junior, Mid and Senior level
Remote in US
Appspace

Communication & space management platform

Be an early applicant

Appspace

Communication & space management platform

201-500 employees

B2BHRInternal toolsProductivityCommunicationSaaS

Be an early applicant

Salary not provided
MongoDB
AWS
Kubernetes
GCP
Python
macOS
Linux
Terraform
MySQL
Azure
RabbitMQ
Windows
Stackdriver
JIRA
Confluence
Atlassian
Git
Junior, Mid and Senior level
Remote in US

201-500 employees

B2BHRInternal toolsProductivityCommunicationSaaS

Company mission

Appspace is on a mission to help companies create a workplace experience people love, because when people love where they work, they can accomplish incredible things.

Role

Who you are

  • Our Cloud Operations team seeks a Site Reliability Engineer who is passionate about problem-solving, automating, and maintaining Appspace’s Cloud Platform to support the needs of our Engineering and Customer Care teams
  • The ideal candidate will see manual work as an opportunity to exercise automation, will understand SRE best practices, have experience automating infrastructure deployments and developing self-healing solutions to infrastructure issues
  • Must be able to learn new technologies quickly and a desire to be a life-long learner
  • Must communicate well and adapt to working well with others across different countries and cultures
  • Strong background in Containers, Kubernetes, Helm, Linux, Python coding, and some experience with Windows Server OS and MacOS are a must
  • Experience with Google Cloud Platform, Google Kubernetes Engine, Google Compute Engine, and Google Storage is highly desired, but comparable experience with AWS or Azure will be considered
  • Solid troubleshooting experience and the ability to reason through a process workflow to identify a fault or odd behavior (i.e., spending time following log trails) is a must
  • Experience with administering MySQL & MongoDB preferred
  • Experience with administering message brokering systems like RabbitMQ preferred
  • Must be flexible on occasionally attending “off-hour” meetings (we’re a global team supporting a global customer base!)
  • Open to quarterly travel up to 5%

Desirable

  • Experience with Build pipeline tools and the Atlassian suite (JIRA, Confluence, Bitbucket/Git, Bamboo, Octopus)
  • Experience with monitoring and alerting platforms, especially StackDriver
  • Experience with HashiCorp Terraform
  • Experience with IIS

What the job involves

  • You will work closely with a global team of cloud, engineering, product, and service professionals to improve our platform’s resiliency and scalability, which directly improves our customers’ experience with Appspace
  • With this role, you can grow your capabilities as a Site Reliability Engineer given the large-scale size of our cloud platform combined with our smaller-sized Cloud Operations team, which means you will have opportunities to work on all Cloud Infrastructure, end-to-end
  • This is a mission-critical role for Appspace, therefore while we offer flex time, it should be scheduled ahead of time, otherwise shift engagement is mandatory outside lunch and break times
  • On-Call coverage will be required weekly during a limited window of US daytime hours over the weekend. This is your opportunity to be part of an awesome company that is rapidly growing and defining the modern workplace experience market!
  • For this role, you will play a key role in maintaining our cloud platform, which includes an assortment of Kubernetes, Microservices, MongoDB, RabbitMQ, MySQL, Windows Server VM Infrastructure, Orchestration Engines, CI/CD and Monitoring platforms. Your day will consist of:
  • Automating maintenance tasks for our Cloud Platform, therefore strong experience in Python and shell scripting is a must
  • Deploying new features and releases of our software into Kubernetes via Helm, so strong experience in Kubernetes and Helm is a must
  • Troubleshooting performance issues or errors thrown by the cloud platform or application, and either resolving the underlying cause, or forwarding your research to Engineering to address in the product
  • Actioning Request Tickets from other teams in support of their needs to enable and prepare for upcoming releases
  • Monitoring the application’s performance, uptime, and cloud infrastructure’s performance, looking for improvement opportunities, and proactively taking action to solve any negative trends before they become issues
  • Lead, Participate, or Execute within the incident management process when alerts fire, and quickly ascertain root cause, resolve the issue, and find new and creative solutions to prevent recurrence
  • Configure, Monitor, Research, and Evaluate workload performances both on Google Cloud Platform and Microsoft Azure Clouds
  • Collaborating with our Development and Quality Assurance teams to address issues in the product and platform
  • Documenting new or updating existing processes and procedures to share knowledge and improve on standardized approaches to solution

Our take

Founded back in 2002, Appspace has grown into something of a leader in the workplace experience field. What essentially started as an internal content management and digital signage toolkit has since expanded into an end-to-end workplace management platform that caters for both physical and digital co-working and communication.

The caliber of customer that Appspace serves says a lot about its status – Google, Meta, LinkedIn, Coca-Cola, and Pfizer all use the platform. It’s to the company’s credit that it’s stayed abreast of changing needs in the workplace, and added to its product offering at opportune moments, either by developing its own solutions or via strategic acquisitions (such as its 2021 purchase of digital workplace platform Beezy).

Now that a blend of on-site, hybrid, and remote working is the norm, there’s a need for a single platform that ensures good employee (and customer) experiences, rather than companies using a host of distributed apps. It’s a need that Appspace is fulfilling well, continuing its winning streak of partnerships with the likes of Sony and Microsoft in 2022.

Steph headshot

Steph

Company Specialist

Insights

Few candidates hear
back within 2 weeks

8% employee growth in 12 months

Company

Company benefits

  • Flexible work schedules
  • Remote work opportunities (Some jobs are 100% remote, some are hybrid – we have offices across Europe, the Middle East, Asia, the UK, and the US)
  • Generous PTO
  • A casual dress work environment
  • Health Insurance
  • Gym allowance
  • Training allowance
  • Training days off
  • A company provided laptop
  • Appspace Quiet Fridays (No non-essential internal meetings scheduled)

Company values

  • Service Excellence: in how we treat each other and external customers and partners.
  • Principled: we are ethical, act with integrity, and do the right thing.
  • Adaptable: we’re flexible and remain resilient in the presence of change.
  • Camaraderie: we check egos and like to have fun, inside and outside the office.
  • Empowerment: we trust our employees and encourage leadership at all levels.

Company HQ

Farmers Branch, TX

Leadership

Brandon Miles

(Board Member)

Former Morgan Stanley Financial Analyst. Founded and served as CEO & President of Appspace until 2022, when Tony DiBenedetto was hired in the role.

Stan Stephens

(Chief Science Officer)

Co-founded Appspace in 2002 and has served as both CTO and CSO since then. Has a Master's degree in Advanced Distributed Systems from Lancaster University.

Share this job

View 5 more jobs at Appspace