Site Reliability Engineer, LightEdge Solutions

Salary not provided
ServiceNow
Senior level
Austin
Chicago
LightEdge Solutions

Cloud hosting, colocation & consulting

Be an early applicant

LightEdge Solutions

Cloud hosting, colocation & consulting

201-500 employees

B2BData storageComplianceConsultingSaaSCloud Computing

Be an early applicant

Salary not provided
ServiceNow
Senior level
Austin
Chicago

201-500 employees

B2BData storageComplianceConsultingSaaSCloud Computing

Company mission

To create and capture opportunities for our clients through the right solutions targeting the right problems.

Role

Who you are

  • 5 years hands-on experience with enterprise monitoring solutions
  • Must possess knowledge of Network Switches, Server hardware, Storage, and Virtualization Technologies
  • Understanding of VMware Infrastructure
  • Experience working with variety of monitoring systems such as Zabbix, vRealize Operations Manager, Nagios and Science Logic
  • Experience and proficiency in integrating with ServiceNow or similar IT service management platforms
  • Experience with managing automations within a monitoring environment
  • Ability to provide guidance with design, maintenance, and improvements to enterprise level monitoring solutions
  • Excellent verbal and written communication skills, ability to present complex ideas and designs to a variety of technical or non-technical stakeholders
  • Experience with design, implementation, and support of monitoring tools in a complex, multi-platform environment
  • High level of understanding monitoring requirements for Storage, Network, and Compute servers

What the job involves

  • As a Site Reliability Engineer (SRE), you will be an integral part of the team at LightEdge Solutions
  • This position will report to the DevOps Manager, and will be responsible for reliable operation of the organization’s systems and services
  • You will play a key role in identifying our monitoring strategy and vision across multiple products and work with a variety of teams to improve the accuracy of our monitoring systems
  • Monitoring and Observability: Design and implement monitoring solutions to track the performance, availability, and health of various systems and services. Establish robust monitoring frameworks, set up alerts, and analyze system metrics to identify and resolve issues proactively
  • Establish and align metrics, including SLAs, SLOs, and SLIs, to closely tie system performance to business objectives, ensuring that the site reliability engineering efforts support the overall goals and customer satisfaction
  • Utilize AIOPS techniques to leverage automation in Incident Management and Response. Develop and maintain automated incident response systems that can detect and mitigate issues automatically. This includes automated incident triaging, remediation, and escalation workflows to minimize manual intervention and improve response times
  • Leverage the IT Service Management (ITSM) platform’s capabilities to integrate monitoring into incident management, change management, and other operational processes, enhancing the efficiency and effectiveness of site reliability engineering practices
  • Working closely with IT functional owners & SME’s
  • Perform implementation, monitoring system administration and integration functions
  • Tasks will consist of developing detailed designs, execution and troubleshooting of strategic solutions in support of effective monitoring, alerting, escalation, automation, reporting and event correlation

Our take

Founded in 1996, LightEdge Solutions is a leader in public cloud and enterprise IT technology solutions, with a focus on private, public, hybrid and multicloud colocation. Its solutions help companies connect to the cloud at a faster pace and with less hassle. It has supported more than 1300 large organisations through 12 data centres in 8 US markets.

With its solid rooting in the industry and the experience to match, it’s no surprise that LightEdge’s offerings have grown to include a full suite of digitisation solutions. From IT infrastructure optimisation to bolstering cybersecurity, Lightedge benefits from catering to a wide range of enterprise needs.

Since its buyout by GI Partners in 2021, LightEdge has embarked on a series of acquisitions itself. With four companies bought since 2021, it has stepped up successfully into this new portfolio role - and these acquisitions should keep LightEdge stable and ripe for further growth for a long time to come.

Steph headshot

Steph

Company Specialist

Insights

Few candidates hear
back within 2 weeks

Company

Funding (1 round)

Apr 2004

$5m

GROWTH EQUITY VC

Total funding: $5m

Company benefits

  • Health, Vision & Dental
  • Life & Long-Term Disability Insurance
  • 401(K) Match
  • PTO, Sick/Personal Time & 10 Paid Holidays
  • Paid Maternity, Paternity & Adoption Leave

Company values

  • EXCEED EXPECTATIONS - Take pride in your work, and always put the customer first
  • DO THE RIGHT THING - Be honest, understanding, and kind. Leave egos at the door
  • GROW THROUGH INNOVATION - Demonstrate an entrepreneurial spirit in all that you do
  • EMBRACE TEAMWORK - Exhibit locker room leadership, maintain positivity, and celebrate every victory

Company HQ

Downtown Des Moines, Des Moines, IA

Leadership

Jim Masterson

(CEO & President)

Previously SVP of Sales and Marketing for Terabeam, and VP of Applications and Marketing at Rhythms NetConnections. Was also an Executive Director for U S WEST.

Salary benchmarks

We don't have enough data yet to provide salary benchmarks for this role.

Submit your salary to help other candidates with crowdsourced salary estimates.

Share this job

View 8 more jobs at LightEdge Solutions