☆30May 6, 2026Updated last month
Alternatives and similar repositories for diversity
Users that are interested in diversity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Complete set of English dialect transformation rules and evaluation code☆17Jun 7, 2024Updated 2 years ago
- Code for the paper "Greed is All You Need: An Evaluation of Tokenizer Inference Methods"☆13Nov 26, 2024Updated last year
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated last year
- Split bib files for anthology bibliography for overleaf☆11Aug 25, 2024Updated last year
- Materials for paper "Are Large Language Models Temporally Grounded?"☆14Nov 16, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Codebase, data and models for hallucination of pruned models☆16Jan 11, 2025Updated last year
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆15Jul 19, 2021Updated 4 years ago
- Dialogue Act classification☆18Jan 15, 2024Updated 2 years ago
- ☆20Mar 3, 2025Updated last year
- A tool for detecting moral values in social discourse☆18Apr 24, 2025Updated last year
- ☆13Jun 4, 2024Updated 2 years ago
- Download and manipulate HathiTrust wordcount data in the tidyverse☆10Jan 31, 2022Updated 4 years ago
- The Code for the EMNLP 2023 main conference paper "Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition…☆13Dec 10, 2023Updated 2 years ago
- Official implementation of Data Contamination Can Cross Language Barriers☆12Sep 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- EACL 2021☆11May 4, 2021Updated 5 years ago
- A curated collection of papers and related projects on using LLMs for privacy.☆34Oct 8, 2025Updated 8 months ago
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- Most basic AI Assistant demo derived from the DeepPavlov Dream AI Assistant.☆14May 22, 2023Updated 3 years ago
- ☆23Aug 10, 2022Updated 3 years ago
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"☆18Dec 21, 2020Updated 5 years ago
- Implementation of the spotlight: a method for discovering systematic errors in deep learning models☆11Oct 5, 2021Updated 4 years ago
- ☆21Feb 15, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- ☆22Mar 31, 2022Updated 4 years ago
- ☆13Sep 28, 2020Updated 5 years ago
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.☆29Jun 12, 2023Updated 3 years ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆26Sep 19, 2024Updated last year
- ISWC2020 Semantic Web Challenge - Product Classification Top1 Solution☆15Nov 18, 2020Updated 5 years ago
- Code for our paper accepted at EMNLP 2023 (Findings)☆14Jan 5, 2024Updated 2 years ago
- Trains small LMs. Designed for training on SimpleStories☆14Sep 15, 2025Updated 9 months ago
- [IJCNN2025] MMSQL: Multi-turn Multi-type text-to-SQL test suit. Repository contains scripts, code, datasets in the paper "Evaluating and …☆21Jan 21, 2026Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ACL 2025 Main] Official Repo for Paper "Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric"☆41Feb 10, 2026Updated 4 months ago
- ☆15Oct 4, 2024Updated last year
- Russian coreference resolution competition☆10Mar 24, 2023Updated 3 years ago
- A toolkit to induce interpretable workflows from raw computer-use activities.☆45Nov 13, 2025Updated 7 months ago
- Hyperparameter tuning with Optuna integrated tensor2tensor.☆10Oct 7, 2020Updated 5 years ago
- Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records☆30Aug 21, 2024Updated last year
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated 2 years ago