All ML Research Engineer Internship, SmolLMs pretraining and datasets - US Remote Vacancies In All USA Companies — Options In Each And Every State — Jobhire

ML Research Engineer Internship, SmolLMs pretraining and datasets - US Remote

Hugging Face

N/AN/A

At Hugging Face, we’re on a journey to democratize good AI. We are building the fastest growing platform for AI builders with over 5 million users & 100k organizations who collectively shared over 1M models, 300k datasets & 300k apps. Our open-source libraries have more than 400k+ stars on Github.

About the Role

Smol models are an exciting area of research as they enable cheaper inference and can be run on-device allowing for more customization and ensuring privacy. The SmolLM team at Hugging Face is pushing the frontier of smol models by building high quality pre-training and post-training datasets [1,2], and applying the latest architecture and training techniques to develop state-of-the-art models [2,3]. The dataset processing can leverage our scalable CPU cluster and the models are trained on a state-of-the-art H100 cluster with close to 100 nodes.

In this internship you will work alongside the SmolLM team and work towards...