Lead machine learning engineer (Model Architectures & Capabilities Team)

  • Full Time
  • London
  • Posted 8 months ago

Nebius

Job title:

Lead machine studying engineer (Mannequin Architectures & Capabilities Crew)

Firm

Nebius

Job description

The corporateNebius is a contemporary know-how enterprise providing strategic partnerships to main firms around the globe, empowering them to create their very own native hyperscaler platforms and develop into reliable suppliers of cloud providers and applied sciences in their very own areas. Along with revolutionary software program and {hardware}, together with server racks designed in-house, Nebius supplies a launch-ready enterprise mannequin and customizable instruments for assist, gross sales, and advertising.Our goal is to empower our companions to create their very own IT infrastructure and ship cutting-edge, disruptive cloud options to native markets whereas holding safety and compliance with worldwide requirements like ISO and GDPR high priorities.We’re a world firm with workplaces within the Netherlands, Israel and Serbia.Our crewNebius was based by a core crew of engineers and enterprise professionals with a confirmed observe file of utilizing cloud applied sciences to create worth for different companies. We all know from expertise that cutting-edge applied sciences can solely make an affect if their innovation is matched by the extent of the specialists managing them, in order Nebius expands, our core precedence is to draw probably the most certified, enthusiastic, and pushed people we are able to to affix our rising crew.Nebius Massive Language Fashions (LLM) crew is devoted to pushing the boundaries of language modelling know-how. We’re centered on growing a state-of-the-art LLM technological stack that spans web-scale information assortment, foundational mannequin coaching and alignment. Our overarching goal is to pioneer cutting-edge language technology know-how for each inner use and buyer purposes, driving the evolution of the subsequent technology of AI-powered merchandise.The positionWe’re presently searching for the crew lead for the Mannequin Architectures & Capabilities crew. The crew is chargeable for pushing ahead the capabilities of the fashions, small and enormous, that we prepare in-house. This contains discovering mannequin architectures that effectively obtain the specified capabilities, scaling these fashions to the boundaries of our {hardware}, and exploring novel concepts that might probably develop of what’s attainable.On this place, your accountability shall be to:

  • Lead the crew chargeable for mannequin architectures and capabilities
  • Outline technique and techniques, i.e. determine what analysis and engineering instructions to pursue to push the know-how forwards, and assist the crew plan and execute the experiments that can get us there effectively
  • Guarantee excessive requirements of engineering and analysis actions throughout the crew
  • Maintain bettering the design our inner infrastructure for coaching giant fashions to make sure it retains being quick and versatile regardless of the know-how transferring forwards
  • Mentor our engineers and researchers

We anticipate you to have:

  • A profound understanding of theoretical foundations of machine studying
  • Deep experience in trendy deep studying for language processing and technology
  • Substantial expertise with pre-training giant fashions on large clusters
  • Good understanding of efficiency points of enormous neural community coaching (sharding methods, customized kernels, {hardware} options and many others.)
  • Robust software program engineering expertise (we largely use python)
  • Deep expertise with trendy deep studying frameworks (we use jax)
  • Proficiency in up to date software program engineering approaches, together with CI/CD, model management, and unit testing
  • Robust communication and management skills

It could be an added bonus in case you had:

  • Bachelor’s diploma in Laptop Science, Synthetic Intelligence, Knowledge Science, or a associated area. Grasp’s or PhD most well-liked
  • Observe file of constructing and delivering merchandise (not essentially ML-related) in a dynamic startup-like surroundings
  • Expertise in engineering advanced methods, similar to giant distributed information processing methods or high-load net providers
  • Open-source tasks that showcase your engineering prowess
  • Glorious command of the English language, alongside superior writing, articulation, and communication expertise

Does all that sound like your sort of problem? Then be part of us!

Anticipated wage

Location

London

Job date

Fri, 08 Mar 2024 01:57:29 GMT

To assist us observe our recruitment effort, please point out in your e mail/cowl letter the place (globalvacancies.org) you noticed this job posting.

To apply for this job please visit jobviewtrack.com.

Job Location