TaiMingLu / know-dont-tellLinks
☆16Updated 9 months ago
Alternatives and similar repositories for know-dont-tell
Users that are interested in know-dont-tell are comparing it to the libraries listed below
Sorting:
- ☆17Updated 3 months ago
- Long Context Extension and Generalization in LLMs☆58Updated 10 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆59Updated 2 weeks ago
- ☆39Updated 3 months ago
- Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning.☆22Updated 2 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 8 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆26Updated last month
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆114Updated last year
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆33Updated 10 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆92Updated 2 weeks ago
- A Sober Look at Language Model Reasoning☆81Updated last month
- The rule-based evaluation subset and code implementation of Omni-MATH☆22Updated 7 months ago
- ☆30Updated last year
- Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆85Updated last month
- The official implementation for Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free☆46Updated 2 months ago
- ☆18Updated last week
- ☆59Updated 11 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 3 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆77Updated 2 years ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆77Updated 2 months ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆36Updated 2 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆58Updated 8 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆58Updated 3 weeks ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆28Updated 8 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆40Updated 2 years ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆32Updated last year
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆65Updated 7 months ago
- Test-time-training on nearest neighbors for large language models☆45Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆62Updated 8 months ago
- GenRM-CoT: Data release for verification rationales☆63Updated 9 months ago