Staff Reliability Engineer, Cerebras Systems

Salary not provided
Python
Bash
Linux
Senior and Expert level
San Francisco Bay Area

Sunnyvale, CA

Cerebras Systems

AI specific computer chip manufacturer

Job no longer available

Cerebras Systems

AI specific computer chip manufacturer

201-500 employees

B2BArtificial IntelligenceHardware

Job no longer available

Salary not provided
Python
Bash
Linux
Senior and Expert level
San Francisco Bay Area

Sunnyvale, CA

201-500 employees

B2BArtificial IntelligenceHardware

Company mission

Cerebras Systems' mission is to speed up the development of artificial intelligence and change the future of work.

Role

Who you are

  • Experience defining reliability test plans, running reliability testing, and writing test reports for silicon components and high performance compute systems. The reliability testing includes but not limited to: HTOL, HAST, thermal, humidity, transportation testing, ESD, etc
  • Experience writing test scripts with Python, shell, and bash in Linux environment and effectively logging and analyzing a large amount of test results
  • Excellent communication, planning, and coordination skills across Systems, Operations, and Software teams
  • Familiarity with various reliability related failure modes for both silicon and systems and experience debugging hardware reliability issues
  • Experience working with external labs to perform failure analysis

What the job involves

  • Work with the Design team to define and execute the wafer-level reliability qualification plan
  • Work with the Design team to define and execute the system-level reliability validation plan
  • Work with the Design team to define and execute the board and sub-assembly-level reliability validation plan
  • Determine software and fixture requirements for performing reliability validation and work with System Software teams to implement the tests
  • Own the readiness of reliability chambers and external power and cooling infrastructure either through in-house development or working with external labs
  • Derive product burn-in requirements based on the results of the reliability validation
  • Work with the Design team and System Software team to debug issues exposed in reliability validation
  • The reliability engineer role will focus on the reliability validation and issue debugging both at the system level and at the component level

Share this job

Insights

Top investors

41% employee growth in 12 months

Company

Company benefits

  • 401k matching
  • Flexible Spending Account (FSA) program
  • Stock option plan
  • Flexible working hours
  • Work from home opportunities
  • Health insurance

Funding (last 2 of 6 rounds)

Nov 2021

$250m

Nov 2019

$272m

Total funding: $720m

Our take

Cerebras Systems creates super-fast computer chips for high-functioning AI systems. In 2019 they launched the world's largest computer chip, many times larger than those already available. With this one piece of hardware, they pressed fast forward on machine learning, with significant implications for the energy, medical, and defense sectors.

Founded by leading computer architects, the vision behind Cerebras was to enable the growth of AI through better hardware. Many of the management team are ex SeaMicro, a technology hardware company that sold for $334 million in 2012. So it's clear these experienced technologists know how to bring a successful product to market. Cerebras is already outpacing the more established chip makers such as Nvidia and Intel, particularly as neither chip maker is specifically focused on providing for AI systems.

Cerebras secured backing from some of the top technology venture capital firms early on, including big hitters Sequoia Capital. In 2109 Cerebras announced a multi-year partnership with the US Energy Department. Further confirmation that fast-tracked AI has the potential to impact a variety of industries in a big way.

Kirsty headshot

Kirsty

Company Specialist at Welcome to the Jungle