magicproduct / hash-hop
Long context evaluation for large language models
☆207Updated last month
Alternatives and similar repositories for hash-hop:
Users that are interested in hash-hop are comparing it to the libraries listed below
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆171Updated 3 months ago
- Experiments on speculative sampling with Llama models☆125Updated last year
- ☆71Updated this week
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆189Updated 10 months ago
- ☆108Updated 4 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆432Updated 6 months ago
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆232Updated 2 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆88Updated this week
- Manage scalable open LLM inference endpoints in Slurm clusters☆254Updated 9 months ago
- ☆529Updated 8 months ago
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆231Updated 2 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆187Updated 4 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆320Updated 4 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆139Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆105Updated 5 months ago
- Draw more samples☆189Updated 10 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆126Updated 4 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆205Updated 11 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆331Updated last week
- ☆166Updated last week
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆299Updated last year
- ☆122Updated last month
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆170Updated this week
- Repository for the paper Stream of Search: Learning to Search in Language☆145Updated 2 months ago
- Train your own SOTA deductive reasoning model☆88Updated last month
- Exploring Applications of GRPO☆185Updated last week
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 6 months ago
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆304Updated 2 months ago
- Normalized Transformer (nGPT)☆171Updated 5 months ago