yiyihum / da-code
☆11Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for da-code
- Evaluate the Quality of Critique☆35Updated 5 months ago
- ☆53Updated 2 months ago
- L-CITEEVAL: DO LONG-CONTEXT MODELS TRULY LEVERAGE CONTEXT FOR RESPONDING?☆17Updated 3 weeks ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆28Updated 7 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆67Updated 5 months ago
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆13Updated 8 months ago
- Code for Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆17Updated this week
- ☆25Updated last month
- BeHonest: Benchmarking Honesty in Large Language Models☆29Updated 2 months ago
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆30Updated 3 months ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆32Updated 5 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆17Updated last month
- ☆10Updated 3 months ago
- ☆40Updated 11 months ago
- Evaluating Mathematical Reasoning Beyond Accuracy☆37Updated 7 months ago
- Code for COLING 2022 long paper: Answering Numerical Reasoning Questions in Table-Text Hybrid Contents with Graph-based Encoder and Tree-…☆21Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆59Updated 7 months ago
- The official repository of the Omni-MATH benchmark.☆47Updated last week
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆46Updated 4 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆56Updated 8 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆48Updated 8 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆56Updated 3 weeks ago
- Towards Systematic Measurement for Long Text Quality☆28Updated 2 months ago
- ☆29Updated 2 weeks ago
- This the implementation of LeCo☆27Updated 3 months ago
- ☆31Updated 7 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆13Updated last year
- ☆63Updated 5 months ago
- trending projects & awesome papers about data-centric llm studies.☆32Updated this week