aksh555 / deciphering_cot
[EMNLP 2024 Findings] Code for deciphering CoT using shift ciphers
☆12Updated 4 months ago
Alternatives and similar repositories for deciphering_cot:
Users that are interested in deciphering_cot are comparing it to the libraries listed below
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Training hybrid models for dummies.☆20Updated 2 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 6 months ago
- ☆15Updated 5 months ago
- A star for organising blocks and playing with transformers.☆23Updated 10 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 9 months ago
- OpenPipe Reinforcement Learning Experiments☆20Updated last week
- A playground to make it easy to try crazy things☆33Updated last week
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 4 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆42Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated last month
- ☆9Updated 11 months ago
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆27Updated this week
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆18Updated last month
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆32Updated this week
- ☆32Updated 2 weeks ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 3 months ago
- Submission to the inverse scaling prize☆23Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- ☆19Updated 3 weeks ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 8 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 4 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 5 months ago
- Really quick-and-dirty example of AI recursive learning☆26Updated 4 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆46Updated last month
- ☆38Updated 7 months ago
- Modified Beam Search with periodical restart☆12Updated 6 months ago
- ☆20Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆37Updated last month
- "Syntriever: How to Train Your Retriever with Synthetic Data from LLMs" the Nations of the Americas Chapter of the Association for Comput…☆24Updated 3 weeks ago