Software Engineer, Onehouse

Salary not provided
Java
Linux
C++
C
Hadoop
Spark
Flink
Hive
Parquet
Unix
Mid level
San Francisco Bay Area

Office located in Sunnyvale, CA

Onehouse

Pre-built data lakehouse foundation

Open for applications

Onehouse

Pre-built data lakehouse foundation

21-100 employees

B2BEnterpriseBig dataSaaSData AnalysisCloud Computing

Open for applications

Salary not provided
Java
Linux
C++
C
Hadoop
Spark
Flink
Hive
Parquet
Unix
Mid level
San Francisco Bay Area

Office located in Sunnyvale, CA

21-100 employees

B2BEnterpriseBig dataSaaSData AnalysisCloud Computing

Company mission

To aid companies of all sizes in supercharging their data engineering/data science, by automating painful data infrastructure buildout.

Role

Who you are

  • 3+ years of experience as a software engineer with experience developing distributed systems
  • Strong, object-oriented design and coding skills (C/C++ and/or Java preferably on a UNIX or Linux platform)
  • Experience with inner workings of distributed (multi-tiered) systems, algorithms, and relational databases
  • Deal well with ambiguous/undefined problems; ability to think abstractly; articulate technical challenges and solutions
  • Speed and hustle → Ability to prioritize across feature development and tech debt
  • Ability to solve complex programming/optimization problems
  • Ability to quickly prototype optimization solutions and analyze large/complex data
  • Clear communication skills

Desirable

  • Experience working on database systems, Query Engines or Spark codebases
  • Experience working on cloud based (data focused) services
  • Deep understanding of Spark, Flink, Presto, Hive, Parquet internals
  • Hands-on experience with open source projects like Hadoop, Hive, Delta Lake, Hudi, Nifi, Drill, Pulsar, Druid, Pinot, etc

What the job involves

  • Build systems that enable users to manage petabytes of data with a fully managed cloud service
  • Build functionality that enables data systems to be cloud native (self managed), scalable (auto scaling) and secure (different levels of access control)
  • Build scalable job management on Kubernetes to ingest, store, manage and optimize petabytes of data on cloud storage
  • Design systems that help scale and streamline metadata and data access from different query/compute engines
  • Exhibit full ownership of product features, including design and implementation, from concept to completion
  • Be passionate about designing for future scale and high availability, while possessing a deep understanding of common failure patterns and their remediations
  • Uphold a high engineering bar around the code, monitoring, operations, automated testing, release management of the platform

Our take

Managing the ballooning volume of unstructured data is becoming a tough task for enterprise companies. The traditional solution, data lakes, doesn’t offer management or transaction capabilities. This lack of oversight could lead to data violations, that are becoming more costly as regulations tighten. Onehouse is catering to the growing number of businesses opting for an alternative, the so-called ‘data lakehouse’. It's hybrid architecture that offers the management and transaction capabilities of a warehouse, with the cost-effectiveness of a data lake.

The Onehouse platform is a management plane that helps businesses set up a data lakehouse without having to invest the time and expertise in building one from scratch. With an open data format, it can be used to work with protected or sensitive data; it also allows companies to easily pull their data from Onehouse without egress fees if they decide to leave the service.

Onehouse has carved out an astute market niche for itself: businesses under increasingly close scrutiny, but with ballooning data pools, who don’t need to build out highly customized lakehouses. For the moment, this tends to be top-tier enterprises, which is how Onehouse has secure deep pocketed clients like Walmart, Amazon, Zendesk, and Uber. As the first company to make fully managed data lakes possible, it is no surprise that it has received substantial funding. This will allow it to continue advancing the platform and grow its team to meet market demand.

Steph headshot

Steph

Company Specialist

Insights

Company

Funding (last 2 of 3 rounds)

Jun 2024

$35m

SERIES B

Feb 2023

$25m

SERIES A

Total funding: $68m

Company benefits

  • Health, dental, vision
  • Unlimited PTO
  • Paid parental leave
  • Equity
  • Flexible schedule
  • Contribute directly to open source
  • Work and grow with an experienced team

Company HQ

Sharon Heights, Menlo Park, CA

Leadership

Previously a Principal Engineer at Uber, then Confluent, and subsequently served as VP of Apache Hudl at The Apache Software Foundation.

Salary benchmarks

We don't have enough data yet to provide salary benchmarks for this role.

Submit your salary to help other candidates with crowdsourced salary estimates.

Share this job

View 7 more jobs at Onehouse