cvndsh / rebus
REBUS: A Robust Evaluation Benchmark of Understanding Symbols
☆13Updated 7 months ago
Alternatives and similar repositories for rebus:
Users that are interested in rebus are comparing it to the libraries listed below
- Automated Capability Discovery via Foundation Model Self-Exploration☆42Updated last month
- ☆21Updated last year
- ☆48Updated 4 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Residual Quantization Autoencoder, used for interpreting LLMs☆11Updated 2 months ago
- ☆44Updated 9 months ago
- ☆15Updated 5 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 4 months ago
- Elevate your language models with insightful diversity metrics.☆11Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆15Updated last year
- Latent Large Language Models☆17Updated 7 months ago
- Get language models to generate responses in a specific format reliably. Open source implementation of Synchromesh: Reliable code generat…☆27Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Enjoy puzzle-solving directly in your browser.☆23Updated 2 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated this week
- ARLC, a probabilistic abductive reasoner for solving Raven's progressive matrices.☆15Updated last week
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 9 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Benchmark structured generation libraries☆26Updated 4 months ago
- Interpretability tools for recurrent convolutional networks (DRC) that play Sokoban☆11Updated 2 weeks ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated 10 months ago
- This is the official repository for all the code of TheoremLlama☆39Updated 5 months ago
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 5 months ago
- ☆38Updated 7 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 3 months ago
- Very minimal (and stateless) agent framework☆41Updated 2 months ago
- LILO: Library Induction with Language Observations☆84Updated 6 months ago
- ☆28Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 10 months ago