aksh555 / deciphering_cot
[EMNLP 2024 Findings] Code for deciphering CoT using shift ciphers
☆12Updated 3 months ago
Alternatives and similar repositories for deciphering_cot:
Users that are interested in deciphering_cot are comparing it to the libraries listed below
- Latent Large Language Models☆17Updated 6 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 6 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 10 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 8 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 4 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆30Updated 2 weeks ago
- entropix style sampling + GUI☆25Updated 4 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated 10 months ago
- Training hybrid models for dummies.☆20Updated last month
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 4 months ago
- ☆15Updated 5 months ago
- ☆10Updated 4 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆44Updated last year
- ☆20Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 11 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 4 months ago
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆21Updated 6 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated 11 months ago
- This repository implements DSPy programs to tasks in Indian Languages☆12Updated last year
- ☆19Updated last week
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 5 months ago
- Optimal Chunk Size for Large Document Summarization☆21Updated last year
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆37Updated last month