allenai / DataDecideLinks
☆33Updated last month
Alternatives and similar repositories for DataDecide
Users that are interested in DataDecide are comparing it to the libraries listed below
Sorting:
- ☆81Updated last week
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆216Updated 2 months ago
- ☆72Updated last year
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆92Updated 10 months ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- ☆77Updated last week
- This is the official repository for Inheritune.☆113Updated 7 months ago
- Evaluating LLMs with fewer examples☆161Updated last year
- ☆123Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆59Updated last year
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆184Updated 6 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆34Updated last month
- Benchmarking LLMs with Challenging Tasks from Real Users☆241Updated 11 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆73Updated 3 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated 11 months ago
- Replicating O1 inference-time scaling laws☆90Updated 10 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆116Updated last year
- ☆127Updated last year
- ☆99Updated 10 months ago
- ☆104Updated last year
- ☆85Updated last year
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆64Updated 5 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆143Updated 10 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆48Updated 5 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆108Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆89Updated last year
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆130Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆93Updated 4 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆80Updated 10 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated last year