ML Infrastructure Engineer, Abridge

$200-265k

Offers Equity

Kubernetes
Python
Tensorflow
Athena
PyTorch
Zoom
Senior and Expert level
San Francisco Bay Area

More information about location

3-5 days a week in office

Abridge

Powering deeper understanding in healthcare through purpose-built AI

Job no longer available

Abridge

Powering deeper understanding in healthcare through purpose-built AI

101-200 employees

HealthcareB2BArtificial IntelligenceEnterpriseMachine LearningSaaSMedTech

Job no longer available

$200-265k

Offers Equity

Kubernetes
Python
Tensorflow
Athena
PyTorch
Zoom
Senior and Expert level
San Francisco Bay Area

More information about location

3-5 days a week in office

101-200 employees

HealthcareB2BArtificial IntelligenceEnterpriseMachine LearningSaaSMedTech

Company mission

To power a deeper understanding in healthcare.

Role

Who you are

  • 5+ years of experience in ML model deployment and scaling, with a focus on production-quality software
  • Strong proficiency in Python and Kubernetes, with experience building scalable ML infrastructure
  • Expertise in designing fault-tolerant, highly available systems
  • Experience working with cloud environments, Infrastructure as Code (IaC), and managing deployments using Kubernetes
  • Proficiency in optimizing system performance, debugging production issues, and designing systems for scalability and security
  • Experience in software design and architecture for highly available machine learning systems for use cases like inference, evaluation, and experimentation
  • Excellent understanding of low-level operating systems concepts, including multi-threading, memory management, networking and storage, performance, and scale
  • Bachelor's/Master’s Degree or greater in Computer Science/Engineering, Statistics, Mathematics, or equivalent
  • Excellent interpersonal and written communication skills

Desirable

  • Experience with large-scale ML platforms like Ray, Databricks, or AnyScale
  • Expertise with ML toolchains such as PyTorch or TensorFlow
  • Proven experience working with distributed systems and handling inference at scale
  • Background in working with teams and leaders to deliver impactful ML-powered solutions in fast-paced environments
  • In machine learning toolchains and techniques, such as Pytorch or Tensorflow
  • Demonstrated experience incubating and productionizing new technology, working closely with research scientists and technical teams from idea generation through implementation

What the job involves

  • As an ML Systems Engineer at Abridge, you will be responsible for scaling and deploying machine learning models to handle increasing traffic demands and integrating them with various platforms
  • You'll play a pivotal role in building a scalable infrastructure that not only supports current deployments but also lays the foundation for long-term growth
  • Your role will be critical in ensuring our AI-driven healthcare platform is powered by robust, scalable, and efficiently deployed models
  • Architect, design, and implement ML software systems for deploying and managing models at scale
  • Stand up ML models for inference, starting with critical models like the 'linkages' model, and ensure they are capable of handling traffic increases
  • Develop and maintain infrastructure that supports efficient ML operations, including model evaluations, deployments, and training at scale
  • Collaborate closely with ML researchers, engineers, and cross-functional teams to ensure seamless integration of models with services like Zoom and Athena
  • Work with stakeholders across machine learning and operations teams to iterate on systems design and implementation
  • Optimize and maintain the performance of ML systems to ensure high availability, fault tolerance, and smooth scalability
  • Troubleshoot production issues and continuously improve systems to enhance performance and efficiency

Share this job

View 26 more jobs at Abridge

Insights

Top investors

Company

Company benefits

  • Unlimited PTO, plus 12 national holidays
  • 16 weeks paid parental leave, for all employees
  • Flexible working hours — we care more about what you accomplish than what specific hours you’re working
  • Remote work environment
  • Equity for all new employees
  • Generous equipment budget for your home office setup ($1600)
  • Opportunity to work and grow with talented individuals, and have ownership and impact at a high growth startup

Funding (last 2 of 6 rounds)

May 2025

$300m

SERIES E

Feb 2025

$250m

SERIES D

Total funding: $757.5m

Our take

Summarizing key elements of healthcare conversations is crucial for both physicians and patients. Medical scribes, though effective, are costly, and the healthcare system faces funding and staffing crises. This has created a demand for more efficient notetaking solutions.

This is where Abridge comes in, it has developed a HIPAA-compliant AI platform that transcribes, structures, summarizes, and provides insights from audio or transcripts of medical conversations. It claims to work across all specialities and offers patient-friendly summaries in easy-to-understand language.

Abridge aims to alleviate pressure on the healthcare system from the patient's perspective, a unique approach among healthtech startups. While companies like Inovia, DOUB, and DeepScribe focus on assisting physicians with medical transcription AI, they lack Abridge's patient-centric features, which are valuable in enhancing the patient experience.

Freddie headshot

Freddie

Company Specialist at Welcome to the Jungle