allenai / oocmapLinks
A file-backed dictionary for Python
☆12Updated 3 years ago
Alternatives and similar repositories for oocmap
Users that are interested in oocmap are comparing it to the libraries listed below
Sorting:
- Drop-in replacements for Python's map function☆15Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Updated last year
- Hyperparameter Search for AllenNLP☆140Updated 11 months ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Updated 2 years ago
- ☆47Updated 3 years ago
- ☆75Updated 4 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Updated 3 years ago
- Rust library for indexing and quickly searching large pretraining corpora☆30Updated 3 months ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆63Updated last month
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- ☆38Updated last year
- Query-focused summarization data☆43Updated 2 years ago
- ☆89Updated 10 months ago
- ☆12Updated 3 years ago
- Repro is a library for easily running code from published papers via Docker.☆41Updated 2 years ago
- ☆102Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆90Updated this week
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 3 years ago
- ☆72Updated 2 years ago
- ☆27Updated 11 months ago
- Dense hybrid representations for text retrieval☆64Updated 2 years ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆69Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Updated last year
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆148Updated 3 years ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14Updated 8 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Updated 7 months ago
- Library for fast text representation and classification.☆31Updated 2 years ago
- Automatically detect errors in annotated corpora.☆48Updated 2 years ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Updated 3 years ago
- Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf☆20Updated 4 years ago