GeniusHTX / TALE
☆76Updated this week
Alternatives and similar repositories for TALE:
Users that are interested in TALE are comparing it to the libraries listed below
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆23Updated 3 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆44Updated last month
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆61Updated 3 weeks ago
- Long Context Extension and Generalization in LLMs☆50Updated 6 months ago
- A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks☆6Updated 3 weeks ago
- ☆83Updated last week
- official implementation of paper "Process Reward Model with Q-value Rankings"☆51Updated last month
- ☆65Updated 4 months ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆56Updated 3 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆90Updated 10 months ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆40Updated 4 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 4 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆45Updated 8 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆101Updated this week
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆26Updated last month
- ☆59Updated 6 months ago
- The official repository of the Omni-MATH benchmark.☆74Updated 3 months ago
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆74Updated 5 months ago
- e☆25Updated this week
- Large Language Models Can Self-Improve in Long-context Reasoning☆66Updated 3 months ago