aksh555 / deciphering_cot
[EMNLP 2024 Findings] Code for deciphering CoT using shift ciphers
☆12Updated 6 months ago
Alternatives and similar repositories for deciphering_cot
Users that are interested in deciphering_cot are comparing it to the libraries listed below
Sorting:
- Training hybrid models for dummies.☆21Updated 4 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆26Updated 10 months ago
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 6 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 7 months ago
- Latent Large Language Models☆18Updated 8 months ago
- Simple GRPO scripts and configurations.☆58Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 6 months ago
- MPI Code Generation through Domain-Specific Language Models☆14Updated 5 months ago
- Verifiers for LLM Reinforcement Learning☆50Updated last month
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆32Updated last month
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆23Updated last month
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆36Updated last month
- GPT-4 Level Conversational QA Trained In a Few Hours☆61Updated 8 months ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆19Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- ☆20Updated 2 months ago
- ☆27Updated 2 weeks ago
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆18Updated 3 months ago
- a version of baby agi using dspy and typed predictors☆17Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 7 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated last year
- ☆45Updated last year
- Rust bindings for CTranslate2☆14Updated last year
- ☆43Updated 3 months ago
- ☆10Updated 6 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- NanoGPT (124M) quality in 2.67B tokens☆28Updated 2 weeks ago