cvndsh / rebusLinks
REBUS: A Robust Evaluation Benchmark of Understanding Symbols
☆13Updated 9 months ago
Alternatives and similar repositories for rebus
Users that are interested in rebus are comparing it to the libraries listed below
Sorting:
- Minimum Description Length probing for neural network representations☆19Updated 4 months ago
- Elevate your language models with insightful diversity metrics.☆11Updated last year
- ☆20Updated 3 weeks ago
- ☆19Updated this week
- Benchmark structured generation libraries☆27Updated 7 months ago
- ☆38Updated 10 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆49Updated 3 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated last month
- Residual Quantization Autoencoder, used for interpreting LLMs☆12Updated 5 months ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated this week
- Simple repository for training small reasoning models☆31Updated 3 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- gzip Predicts Data-dependent Scaling Laws☆35Updated last year
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Separation of planning concerns in ReAct-style LLM agents. Planner fine-tuning on synthetic trajectories.☆18Updated 10 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 7 months ago
- Approximating the joint distribution of language models via MCTS☆21Updated 7 months ago
- ☆13Updated last year
- ☆9Updated last month
- ☆21Updated last year
- ☆49Updated 6 months ago
- ☆11Updated 10 months ago
- An OpenAI wrapper for PyReason to use in a Grid World reinforcement learning setting☆31Updated last year
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆16Updated last year
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆24Updated last month
- Interpretability tools for recurrent convolutional networks (DRC) that play Sokoban☆13Updated 2 months ago
- ☆29Updated last year
- LeanAgent is a novel lifelong learning framework for formal theorem proving that continuously generalizes to and improves on ever-expandi…☆26Updated last month