Machine Learning Engineer, Mistral AI

Salary not provided
Tensorflow
PyTorch
Junior, Mid and Senior level
London
Paris
Mistral AI

Generative AI model developers

Be an early applicant

Mistral AI

Generative AI model developers

101-200 employees

B2BArtificial IntelligenceDeep TechSaaS

Be an early applicant

Salary not provided
Tensorflow
PyTorch
Junior, Mid and Senior level
London
Paris

101-200 employees

B2BArtificial IntelligenceDeep TechSaaS

Company mission

To make frontier AI ubiquitous, and to provide tailor-made AI to all the builders.

Role

Who you are

  • Master's degree in Computer Science, Machine Learning, Data Science, or a related field
  • Expert programming skills in PythonMLOps or FullStack + ML experience
  • Proficiency in frameworks like PyTorch or TensorFlow
  • Adaptable, proactive and autonomous
  • Attention to detail and a drive to go the last mile to build almost perfect tools
  • Deep understanding of machine learning approaches and algorithms
  • Low-ego
  • Collaborative and have a real team player mindset

Desirable

  • Experience with training and fine-tuning large language models (e.g., distillation, supervised fine-tuning, policy optimization)
  • Worked with LLMs
  • Worked with research teams before

What the job involves

  • You will be in charge of deploying state-of-the-art models in production environments, helping turn research breakthroughs into tangible solutions
  • Create and maintain tooling and services: both internal facing (research & dogfooding) and external facing (product)
  • Collaborate cross-functionally with researchers, software engineers, and product managers to understand complex business challenges and deliver AI-powered solutions
  • Implement and optimize ML pipelines for performance and accuracy, ensuring production readiness and employing cutting-edge technology and innovative approaches
  • Our ML Engineering team is embedded in our Product development organization (SWE & Product) team and works very closely with our Science team
  • All our engineers can fluidly move on the production / research spectrum depending on where the needs are or where their interests lie

Our take

Founded by Facebook AI researchers, Mistral AI focuses on developing open-source generative AI models to "make AI useful." It emerged in early 2023, quickly achieving a $260 million valuation.

With the upcoming release of Mixtral in 2024, featuring up to 176 billion parameters, Mistral is under pressure to deliver. Mixtral's open-source nature has sparked concerns about potential misuse, yet Mistral remains a formidable competitor in the AI landscape, challenging models from Google and Meta.

Recently, Mistral secured a massive $640 million funding round, raising its valuation to $6 billion. This Series A funding, led by General Catalyst and including investors like Lightspeed Venture Partners and Andreessen Horowitz, underscores significant interest in alternatives to ChatGPT.

Freddie headshot

Freddie

Company Specialist

Insights

Top investors

Some candidates hear
back within 2 weeks

Company

Funding (last 2 of 4 rounds)

Jun 2024

$523.2m

SERIES B

Feb 2024

$16.8m

SERIES A

Total funding: $1.1bn

Company benefits

  • Competitive bonus structure
  • Equity
  • Opportunities for professional growth and development

Company HQ

Villette, Paris, France

Leadership

Previously PhD and Software Engineer Intern at Facebook and Visiting Undergraduate at CAlab UCSD

Guillaume Lample

(Chief Scientist)

Research Student and Scientist at Facebook AI and Research Intern at Jane Street Capital

Previously Staff Research Scientist at DeepMind, Post-doc researcher at École normale supérieure, Visiting Researcher at NYU Courant Institute of Mathematical Sciences and PhD Student at Inria

Share this job

View 33 more jobs at Mistral AI