shachardon / naturally_occurring_feedbackLinks
☆14Updated 2 months ago
Alternatives and similar repositories for naturally_occurring_feedback
Users that are interested in naturally_occurring_feedback are comparing it to the libraries listed below
Sorting:
- My personal web page☆11Updated 3 months ago
- Implementation for MomentumSMoE☆19Updated 9 months ago
- ☆30Updated last year
- ☆61Updated 7 months ago
- Synthetic Data Generation for Evaluation☆13Updated 11 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆223Updated last month
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆26Updated 6 months ago
- Generate a cute welcome message for yourself each day☆22Updated 2 years ago
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆213Updated last week
- ☆89Updated last year
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆16Updated 7 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Complex Function Calling Benchmark.☆163Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆112Updated last year
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- Code for the paper "Fishing for Magikarp"☆179Updated 8 months ago
- ☆129Updated last year
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆29Updated 9 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year
- ☆120Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆136Updated last year
- ☆54Updated last year
- A package dedicated for running benchmark agreement testing☆17Updated 4 months ago
- Code for Zero-Shot Tokenizer Transfer☆142Updated last year
- Code for ExploreTom☆90Updated 7 months ago
- codebase release for EMNLP2023 paper publication☆19Updated 4 months ago
- The first dense retrieval model that can be prompted like an LM☆90Updated 8 months ago
- Let's build better datasets, together!☆269Updated last year
- The official repo for "LLoCo: Learning Long Contexts Offline"☆118Updated last year