Data Software Engineer, Twelve Labs

Salary not provided
Python
Tensorflow
PyTorch
Senior and Expert level
San Francisco Bay Area

More information about location

Twelve Labs

AI-enabled video understanding technology

Open for applications

Twelve Labs

AI-enabled video understanding technology

21-100 employees

B2BArtificial IntelligenceContentMachine LearningSaaSVideo

Open for applications

Salary not provided
Python
Tensorflow
PyTorch
Senior and Expert level
San Francisco Bay Area

More information about location

21-100 employees

B2BArtificial IntelligenceContentMachine LearningSaaSVideo

Company mission

To help developers build programs that can see, listen, and understand the world by giving them the most powerful video understanding infrastructure.

Role

Who you are

  • 7+ years of industry experience (or 4+ with a PhD in a related technical domain)
  • A PhD, or a Master's degree, in machine learning or a closely related discipline
  • Led teams of 3+ engineers as a technical lead
  • Experience building model-bootstrapped language or vision-language datasets (RLAIF, etc.)
  • Managed data acquisition for large generative or contrastive modelsExperience with FFmpeg or other high performance image/video processing libraries (bonus points for past work with such processing on GPUs/accelerators)
  • Deep experience as a backend and/or data engineer & an interest in ML/AI systems
  • Strong Python expertise and considerable prior work history with at least one statically typed language (we use Golang)
  • Strong communication skills in written and spoken English

What the job involves

  • As the ML Data Infrastructure Lead at Twelve Labs, you will lead the data team, managing data infrastructure and preparing high quality video data for our training runs
  • Unlike text or image, video is complex to process (because of size and decoding), multimodal (visual and audio), and has a temporal aspect
  • Information can become easily redundant while being dependent on earlier information (like text)
  • Because of the complexity of data processing at Twelve Labs, this role will have a significant impact on the quality of our models
  • Acquire and deliver massive and high-quality datasets for our large training runs
  • Develop and implement best practices and data pipelines (ingest, annotate, and incorporate high-quality datasets into model training and evaluation) by working with internal and external data partners
  • Improve our data infrastructure (e.g., management, versioning) by collaborating with software engineers and security engineers
  • Collaborate with modeling and product teams to evaluate the impact of the data on our models and continuously improve the data quality
  • Hire, provide career growth guidance, coaching, and training for engineers on your team
  • Work across teams to understand and manage project priorities and product deliverables, evaluate trade-offs, and drive technical initiatives from execution to landing

Application process

  • Recruiter Phone Screen
  • Hiring Manager Call
  • Technical Interview and/or Take Home Assignment
  • Culture Interview
  • Reference Checks

Otta's take

Xav Kearney headshot

Xav Kearney

CTO of Otta

Developing an algorithm that can understand text, or images is (relatively) straightforward. However, it becomes a lot more challenging when it’s required to understand video, where these modes fuse with audio and context becomes much harder to gauge.

Twelve Labs has developed a Machine Learning solution that can do just that, then make the inner content of the video indexable for developers and highly searchable for users.

This kind of tech could prove immensely valuable as its use cases go far beyond searchability for the end user. In theory it could be used for more accurate community guidelines monitoring on social media, enterprise knowledge search, and a better overall understanding of the value of video content.

Founded in 2021, Twelve Labs is still a relatively young startup. However Index Ventures, Radical Ventures, Expa, and Techstars have provided significant backing, showing that there’s plenty of confidence in the company’s potential.

Insights

Led by a woman
Top investors

Some candidates hear
back within 2 weeks

Company

Funding (last 2 of 5 rounds)

Jun 2024

$50m

SERIES A

Oct 2023

$10m

EARLY VC

Total funding: $77.2m

Company benefits

  • Voluntary commuting, voluntary remote work and flexible work system (Work-from-anywhere & anyhow)
  • Home office setup stipend
  • Market-leading competitive compensation packages (salary, stock options, etc.)

Company HQ

SoMa, San Francisco, CA

Founders

Jae Lee

(CEO)

After interning as a Software Engineer at Samsung and Amazon, they joined the Republic of Korean's Cyber Operations Command as a Lead Data Scientist. Co-founded Twelve Labs after this period of military service.

Interned at Korea Advanced Institute of Science and Technology (KAIST) as a Deep Learning Researcher. Completed their National Defense duty as an AI & ML Engineer in the Cyber Operations Command.

Sung Jun (SJ) Kim

(Head Of Software Architecture)

Like co-founders Aiden and Jae Lee, served for the Ministry of National Defense as a Lead Software Engineer in the Cyber Operations Command. Before this they were a Cyber Security Research Scientist at Sungkyunkwan University.

Dave Chung

(Head of Operations)

Previously a Project Team Lead at the Institute of East and West Studies (IEWS) at Yonsei University, focusing on the planning and development of the Korean Web 3.0 ecosystem (Funded by the Ministry of ICT).

Soyoung Lee

(Head of Business Development)

Worked in risk assurance for PricewaterhouseCoopers in Seoul before co-founding Twelve Labs.

Salary benchmarks

We don't have enough data yet to provide salary benchmarks for this role.

Submit your salary to help other candidates with crowdsourced salary estimates.

Share this job

View 11 more jobs at Twelve Labs