MingLiiii / Layer_Gradient
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
☆40Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Layer_Gradient
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆109Updated 3 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆103Updated 6 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆115Updated last week
- ☆90Updated 4 months ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆50Updated 7 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆158Updated 4 months ago
- SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights☆35Updated last month
- ☆63Updated last month
- The official implementation of Self-Exploring Language Models (SELM)☆55Updated 5 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆56Updated last month
- Benchmarking LLMs with Challenging Tasks from Real Users☆195Updated 2 weeks ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆63Updated last month
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆80Updated 7 months ago
- This is the official repository for Inheritune.☆105Updated last month
- "Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?"☆58Updated last month
- Exploring Model Kinship for Merging Large Language Models☆20Updated last week
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆54Updated last week
- ☆128Updated 5 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆144Updated last month
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆30Updated this week
- The Paper List on Data Contamination for Large Language Models Evaluation.☆76Updated this week
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆46Updated 2 weeks ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆96Updated last month
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆91Updated 4 months ago
- ☆151Updated 9 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- A simple unified framework for evaluating LLMs☆145Updated last week
- ☆103Updated last month
- ☆46Updated 2 weeks ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆68Updated 5 months ago