SeanLee97 / AnglELinks
Train and Infer Powerful Sentence Embeddings with AnglE | π₯ SOTA on STS and MTEB Leaderboard
β556Updated last week
Alternatives and similar repositories for AnglE
Users that are interested in AnglE are comparing it to the libraries listed below
Sorting:
- Generative Representational Instruction Tuningβ675Updated 4 months ago
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learningβ759Updated 2 years ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.β543Updated 3 weeks ago
- Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]β640Updated last year
- SGPT: GPT Sentence Embeddings for Semantic Searchβ875Updated last year
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.β701Updated 3 weeks ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627β497Updated last year
- Guideline following Large Language Model for Information Extractionβ404Updated 11 months ago
- β369Updated last year
- HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labelsβ555Updated 10 months ago
- Forward-Looking Active REtrieval-augmented generation (FLARE)β655Updated last year
- Easily embed, cluster and semantically label text datasetsβ578Updated last year
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'β1,599Updated 9 months ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Modelsβ571Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?β163Updated last year
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03β¦β548Updated last year
- Multilingual/multidomain question generation datasets, models, and python library for question generation.β364Updated last year
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labelsβ345Updated 10 months ago
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"β385Updated last year
- All-in-one text de-duplicationβ723Updated last month
- Official repository for ORPOβ464Updated last year
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder β¦β162Updated 4 months ago
- β560Updated 2 years ago
- β544Updated 11 months ago
- Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)β377Updated this week
- RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Languaβ¦β396Updated 5 months ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuningβ401Updated last year
- Train Models Contrastively in Pytorchβ753Updated 7 months ago
- [EMNLP 2023] Adapting Language Models to Compress Long Contextsβ314Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.β517Updated last year