Data Software Engineer, Twelve Labs

Salary not provided
Python
Tensorflow
PyTorch
Senior and Expert level
San Francisco Bay Area

More information about location

Twelve Labs

AI-enabled video understanding technology

Open for applications

Twelve Labs

AI-enabled video understanding technology

21-100 employees

B2BArtificial IntelligenceContentMachine LearningSaaSVideo

Open for applications

Salary not provided
Python
Tensorflow
PyTorch
Senior and Expert level
San Francisco Bay Area

More information about location

21-100 employees

B2BArtificial IntelligenceContentMachine LearningSaaSVideo

Company mission

To build the best video understanding platform driven by leading perceptual-reasoning research

Role

Who you are

  • 7+ years of industry experience (or 4+ with a PhD in a related technical domain)
  • A PhD, or a Master's degree, in machine learning or a closely related discipline
  • Led teams of 3+ engineers as a technical lead
  • Experience building model-bootstrapped language or vision-language datasets (RLAIF, etc.)
  • Managed data acquisition for large generative or contrastive modelsExperience with FFmpeg or other high performance image/video processing libraries (bonus points for past work with such processing on GPUs/accelerators)
  • Deep experience as a backend and/or data engineer & an interest in ML/AI systems
  • Strong Python expertise and considerable prior work history with at least one statically typed language (we use Golang)
  • Strong communication skills in written and spoken English

What the job involves

  • As the ML Data Infrastructure Lead at Twelve Labs, you will lead the data team, managing data infrastructure and preparing high quality video data for our training runs
  • Unlike text or image, video is complex to process (because of size and decoding), multimodal (visual and audio), and has a temporal aspect
  • Information can become easily redundant while being dependent on earlier information (like text)
  • Because of the complexity of data processing at Twelve Labs, this role will have a significant impact on the quality of our models
  • Acquire and deliver massive and high-quality datasets for our large training runs
  • Develop and implement best practices and data pipelines (ingest, annotate, and incorporate high-quality datasets into model training and evaluation) by working with internal and external data partners
  • Improve our data infrastructure (e.g., management, versioning) by collaborating with software engineers and security engineers
  • Collaborate with modeling and product teams to evaluate the impact of the data on our models and continuously improve the data quality
  • Hire, provide career growth guidance, coaching, and training for engineers on your team
  • Work across teams to understand and manage project priorities and product deliverables, evaluate trade-offs, and drive technical initiatives from execution to landing

Application process

  • Recruiter Phone Screen
  • Hiring Manager Call
  • Technical Interview and/or Take Home Assignment
  • Culture Interview
  • Reference Checks

Our take

Developing an algorithm that can understand text or images is (relatively) straightforward. However, the challenge escalates when it comes to understanding video, where these modes merge with audio, and context becomes much harder to grasp.

Twelve Labs has developed a machine learning solution that tackles this challenge by making the inner content of videos both indexable for developers and highly searchable for users. This technology could prove immensely valuable, with use cases extending far beyond simple searchability for end users. It could be employed for more accurate monitoring of community guidelines on social media, enterprise knowledge searches, and a deeper understanding of the value of video content.

With recent funding under its belt, Twelve Labs is set for rapid growth. The investment will drive R&D, nearly double its workforce, and advance its video understanding technology. As it expands, Twelve Labs is well-positioned to lead the future of multimodal AI and revolutionize how organizations extract values from video content.

Steph headshot

Steph

Company Specialist

Insights

Led by a woman
Top investors

Some candidates hear
back within 2 weeks

Company

Funding (last 2 of 5 rounds)

Jun 2024

$50m

SERIES A

Oct 2023

$10m

EARLY VC

Total funding: $77.2m

Company benefits

  • Full health, dental, and vision benefits
  • Extremely flexible PTO and parental leave policy. Office closed the week of Christmas and New Years
  • Remote-flexible, offices in San Francisco and Seoul and coworking stipend
  • VISA support (such as H1B and OPT transfer for US employees)

Company values

  • Curiosity is power: The complexities of understanding videos present incredibly challenging problems. Curiosity fuels us to create technology that is not only adaptable but also exceptionally robust. Our products have unlocked a powerful new way for developers and enterprises to amplify their understanding of petabytes of data, and we are just getting started
  • Embrace the underdog: We strive to be the best by championing innovation, resilience, and empathy. Failure does not intimidate us but encourages us to always speak up and try new things. In the face of adversity, we rally together, and it is our collective purpose that propels us forward achieving things as a team that exceed even our own expectations
  • Thought leaders: We're not just part of the industry; we shape it. At Twelve Labs, ​​you will immerse yourself in an environment brimming with creativity and forward-thinking. Each day presents an opportunity to gain invaluable insights, challenge conventional wisdom, and pioneer new frontiers as we push the boundaries of AI with our research

Company HQ

SoMa, San Francisco, CA

Founders

Jae Lee

(Co-Founder & CEO)

After Interning as a Software Engineer at Samsung and Amazon, they joined the Republic of Korea's Cyber Operations Command as a Lead Data Scientist. Co-founded Twelve Labs after this period of military service.

Aiden Lee

(Co-Founder & CTO)

Interned at Korea Advanced Institute of Science and Technology (KAIST) as a Deep Learning Researcher. Completed their National Defense duty as an AI & ML Engineer in the Cyber Operations Command.

Sung Jun (SJ) Kim

(Co-Founder)

Served for the Ministry of National Defense as a Lead Software Engineer in the Cyber Operations Command. Before this, they were a Cyber Security Research Scientist at Sungkyunkwan University.

Dave Chung

(Co-Founder)

Previously a Project Team Lead at the Institute of East and West Studies (IEWS) at Yonsei University, focusing on the planning and development of the Korean Web 3.0 ecosystem (Funded by the Ministry of ICT).

Soyoung Lee

(Co-Founder)

Worked in Risk Assurance for PricewaterhouseCoopers in Seoul before co-founding Twelve Labs.

Salary benchmarks

We don't have enough data yet to provide salary benchmarks for this role.

Submit your salary to help other candidates with crowdsourced salary estimates.

Share this job

View 8 more jobs at Twelve Labs