aksh555 / deciphering_cot
[EMNLP 2024 Findings] Code for deciphering CoT using shift ciphers
☆12Updated 3 months ago
Alternatives and similar repositories for deciphering_cot:
Users that are interested in deciphering_cot are comparing it to the libraries listed below
- Latent Large Language Models☆17Updated 5 months ago
- Training hybrid models for dummies.☆20Updated last month
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆24Updated 7 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 3 months ago
- Modified Beam Search with periodical restart☆12Updated 5 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 7 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 2 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 4 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆36Updated last week
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆48Updated 2 months ago
- ☆9Updated 10 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 2 months ago
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆26Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆16Updated 11 months ago
- ☆13Updated 2 months ago
- Tools for merging pretrained large language models.☆19Updated 8 months ago
- ☆48Updated 3 months ago
- ☆11Updated 3 months ago
- GoldFinch and other hybrid transformer components☆43Updated 7 months ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆28Updated 6 months ago
- entropix style sampling + GUI☆25Updated 3 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆107Updated 2 weeks ago
- ☆42Updated 11 months ago