Senior Software Engineer, Onehouse

Open Source

Salary not provided

+ Equity

Java
Linux
Unix
Senior and Expert level
San Francisco Bay Area

Office located in Sunnyvale, CA

Onehouse

Pre-built data lakehouse foundation

Open for applications

Onehouse

Pre-built data lakehouse foundation

21-100 employees

B2BEnterpriseBig dataSaaSData AnalysisCloud Computing

Open for applications

Salary not provided

+ Equity

Java
Linux
Unix
Senior and Expert level
San Francisco Bay Area

Office located in Sunnyvale, CA

21-100 employees

B2BEnterpriseBig dataSaaSData AnalysisCloud Computing

Company mission

To aid companies of all sizes in supercharging their data engineering/data science, by automating painful data infrastructure buildout.

Role

Who you are

  • 5-7+ years building large-scale data systems
  • You embrace ambiguous/undefined problems with an ability to think abstractly and articulate technical challenges and solutions
  • Positive attitude towards seeking solutions to hard problems, with a bias towards action and forward progress
  • An ability to quickly prototype new directions, shape them into real projects and analyze large/complex data
  • Strong, object-oriented design and coding skills with Java, preferably on a UNIX or Linux platform
  • Experience with inner workings of distributed (multi-tiered) systems, algorithms, and relational databases
  • Experience with large scale data compute engines / processing frameworks
  • Experience building distributed and/or data storage systems or query engines
  • An ability to prioritize across feature development and tech debt, balancing urgency and speed
  • An ability to solve complex programming/optimization problems
  • Robust and clear communication skills

Desirable

  • Experience working with open source projects and communities
  • Experience in optimization mathematics (linear programming, nonlinear optimization)
  • Existing publications of optimizing large-scale data systems in top-tier distributed system conferences
  • PhD or Masters degree in a related field with industry experience in solving and delivering high-impact optimization projects

What the job involves

  • When you join Onehouse, you're joining a team of passionate professionals tackling the deeply technical challenges of building a 2-sided engineering product
  • Our engineering team serves as the bridge between the worlds of open source and enterprise: contributing directly to and growing Apache Hudi (already used at scale by global enterprises like Uber, Amazon, ByteDance etc) and concurrently defining a new industry category - the transactional data lake
  • As an engineer on the Open Source team at Onehouse, you'll play a pivotal role in shaping and realizing the vision and roadmap for Apache Hudi, while also shaping the future of data lakehouse space
  • Collaborate across multiple teams within Onehouse, serving as the vital bridge between the open-source Apache Hudi project and Onehouse's managed solution, ensuring seamless collaboration and integration
  • Engage closely with community partners and contributors, serving as a steward of the Apache Hudi project, fostering collaboration and guiding its evolution
  • Champion a culture of innovation, quality and timely execution, enabling the team to deliver on the vision of the next-generation data lakehouse
  • Architect and implement solutions that scale to accommodate the rapid growth of our customer base, open source community and the ever-expanding demands of the datalake ecosystem at large
  • Build, design and deliver features/improvements to Apache Hudi
  • Ensure high quality and timely delivery of innovations and improvements in Apache Hudi
  • Dive deep into the architectural details of data ingestion, data storage, data processing and data querying to ensure that Apache Hudi is built to be the most robust, scalable and interoperable data lakehouse
  • Own discussions and work with open source partners/vendors to: troubleshoot issues with Hudi, ensure Hudi support in for compute engines like Pretso/Trino and act as the face of Hudi to the community at large via meetups, customer meetings, talks etc
  • Partner with and mentor engineers on the team

Our take

Managing the ballooning volume of unstructured data is becoming a tough task for enterprise companies. The traditional solution, data lakes, doesn’t offer management or transaction capabilities. This lack of oversight could lead to data violations, that are becoming more costly as regulations tighten. Onehouse is catering to the growing number of businesses opting for an alternative, the so-called ‘data lakehouse’. It's hybrid architecture that offers the management and transaction capabilities of a warehouse, with the cost-effectiveness of a data lake.

The Onehouse platform is a management plane that helps businesses set up a data lakehouse without having to invest the time and expertise in building one from scratch. With an open data format, it can be used to work with protected or sensitive data; it also allows companies to easily pull their data from Onehouse without egress fees if they decide to leave the service.

Onehouse has carved out an astute market niche for itself: businesses under increasingly close scrutiny, but with ballooning data pools, who don’t need to build out highly customized lakehouses. For the moment, this tends to be top-tier enterprises, which is how Onehouse has secure deep pocketed clients like Walmart, Amazon, Zendesk, and Uber. As the first company to make fully managed data lakes possible, it is no surprise that it has received substantial funding. This will allow it to continue advancing the platform and grow its team to meet market demand.

Steph headshot

Steph

Company Specialist

Insights

Company

Funding (last 2 of 3 rounds)

Jun 2024

$35m

SERIES B

Feb 2023

$25m

SERIES A

Total funding: $68m

Company benefits

  • Health, dental, vision
  • Unlimited PTO
  • Paid parental leave
  • Equity
  • Flexible schedule
  • Contribute directly to open source
  • Work and grow with an experienced team

Company HQ

Sharon Heights, Menlo Park, CA

Leadership

Previously a Principal Engineer at Uber, then Confluent, and subsequently served as VP of Apache Hudl at The Apache Software Foundation.

Salary benchmarks

We don't have enough data yet to provide salary benchmarks for this role.

Submit your salary to help other candidates with crowdsourced salary estimates.

Share this job

View 7 more jobs at Onehouse