facebookresearch / SemDeDup

Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).
112Updated last year

Related projects

Alternatives and complementary repositories for SemDeDup