Data-Provenance-Initiative / Data-Provenance-Collection
☆196Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Data-Provenance-Collection
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆188Updated 2 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 7 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆92Updated last month
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆210Updated 10 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆147Updated 4 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆141Updated 5 months ago
- A framework for few-shot evaluation of autoregressive language models.☆101Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆109Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆68Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆194Updated last week
- ☆258Updated last month
- Pretraining Efficiently on S2ORC!☆136Updated 2 weeks ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆237Updated 3 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆156Updated 6 months ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆211Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆62Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆142Updated 3 weeks ago
- Evaluating LLMs with fewer examples☆133Updated 6 months ago
- ☆445Updated last week
- A Survey on Data Selection for Language Models☆178Updated 3 weeks ago
- ☆149Updated 10 months ago
- AI Logging for Interpretability and Explainability🔬☆87Updated 5 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- ☆85Updated 5 months ago
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically d…☆286Updated 11 months ago
- Scaling Data-Constrained Language Models☆321Updated last month
- Website for hosting the Open Foundation Models Cheat Sheet.☆255Updated 4 months ago
- A toolkit for describing model features and intervening on those features to steer behavior.☆69Updated this week
- Erasing concepts from neural representations with provable guarantees☆208Updated 3 weeks ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆105Updated last week