shirley-wu / cot_decodingLinks
☆45Updated last year
Alternatives and similar repositories for cot_decoding
Users that are interested in cot_decoding are comparing it to the libraries listed below
Sorting:
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 4 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆88Updated 2 months ago
- ☆52Updated last year
- Train your own SOTA deductive reasoning model☆96Updated 4 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆81Updated last month
- ☆104Updated 2 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- entropix style sampling + GUI☆26Updated 8 months ago
- ☆52Updated 8 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆62Updated 10 months ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆156Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆113Updated 10 months ago
- Multi-Granularity LLM Debugger☆82Updated last week
- ☆49Updated this week
- EvaByte: Efficient Byte-level Language Models at Scale☆103Updated 2 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆132Updated last year
- accompanying material for sleep-time compute paper☆97Updated 2 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆49Updated 5 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 6 months ago
- ☆88Updated 8 months ago
- Official repo for "Make Your LLM Fully Utilize the Context"☆252Updated last year
- ☆118Updated 10 months ago
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- The first dense retrieval model that can be prompted like an LM☆80Updated 2 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Evaluating LLMs with CommonGen-Lite☆90Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆58Updated 9 months ago
- ☆56Updated 7 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆112Updated this week