Machine Learning Engineer Internship, Information Retrieval - US Remote
Hugging Face
N/A
At Hugging Face, we’re on a journey to democratize good AI. We are building the fastest growing platform for AI builders with over 5 million users & 100k organizations who collectively shared over 1M models, 300k datasets & 300k apps. Our open-source libraries have more than 400k+ stars on Github.
About the Role
In Information Retrieval, modern search solutions combine both semantic search (e.g. similar meaning) and lexical search (e.g. exact keyword). The former is often implemented via a dense bi-encoder (a.k.a. Sentence Transformer) model, whereas the latter usually involves a sparse (e.g. SPLADE, BM25) model or algorithm.
Currently, there is no accessible, de-facto solution for training or fine-tuning neural sparse models. To address this, this internship aims to implement an existing neural sparse model architecture and a matching trainer into the Sentence Transformers library, prioritizing ease of use.
About...