da03 / Internalize_CoT_Step_by_Step
☆101Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Internalize_CoT_Step_by_Step
- Benchmarking LLMs with Challenging Tasks from Real Users☆194Updated this week
- ☆89Updated 4 months ago
- ☆111Updated last month
- This is the official repository for Inheritune.☆105Updated last month
- A simple unified framework for evaluating LLMs☆138Updated this week
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆200Updated 5 months ago
- ☆61Updated 2 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆106Updated 2 weeks ago
- ☆94Updated 4 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆129Updated last month
- Evaluating LLMs with fewer examples☆133Updated 6 months ago
- ☆49Updated 6 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated 6 months ago
- Code repository for the c-BTM paper☆105Updated last year
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024☆95Updated this week
- Reformatted Alignment☆112Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- Self-Alignment with Principle-Following Reward Models☆148Updated 8 months ago
- ☆99Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Evaluating LLMs with CommonGen-Lite☆84Updated 7 months ago
- LOFT: A 1 Million+ Token Long-Context Benchmark☆138Updated last week
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆160Updated last month
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆150Updated 2 months ago
- Can Language Models Solve Olympiad Programming?☆100Updated 3 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 2 weeks ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆142Updated 3 weeks ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆141Updated last month
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆60Updated last month
- ☆41Updated this week