allenai / wimbd
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
☆188Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for wimbd
- This project studies the performance and robustness of language models and task-adaptation methods.☆141Updated 5 months ago
- ☆166Updated last year
- A Survey on Data Selection for Language Models☆178Updated 3 weeks ago
- A framework for few-shot evaluation of autoregressive language models.☆101Updated last year
- DSIR large-scale data selection framework for language model training☆227Updated 7 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 7 months ago
- ☆120Updated 2 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆212Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆114Updated last month
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 2 weeks ago
- LOFT: A 1 Million+ Token Long-Context Benchmark☆138Updated last week
- Scalable training for dense retrieval models.☆270Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆194Updated this week
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper…☆105Updated last month
- Pretraining Efficiently on S2ORC!☆136Updated 2 weeks ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆147Updated 3 months ago
- Multilingual Large Language Models Evaluation Benchmark☆105Updated 2 months ago
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆275Updated 2 months ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆110Updated 7 months ago
- ☆38Updated 6 months ago
- ☆445Updated last week
- ☆66Updated 9 months ago
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆315Updated 10 months ago
- Inquisitive Parrots for Search☆177Updated 8 months ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated 2 years ago
- Finetune mistral-7b-instruct for sentence embeddings☆70Updated 6 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆91Updated 4 months ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆104Updated 5 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆237Updated 3 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆200Updated 5 months ago