cvndsh / rebus
REBUS: A Robust Evaluation Benchmark of Understanding Symbols
☆13Updated 6 months ago
Alternatives and similar repositories for rebus:
Users that are interested in rebus are comparing it to the libraries listed below
- Get language models to generate responses in a specific format reliably. Open source implementation of Synchromesh: Reliable code generat…☆27Updated 11 months ago
- ☆48Updated 3 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 weeks ago
- ☆38Updated 6 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆14Updated 11 months ago
- ☆21Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 2 weeks ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Elevate your language models with insightful diversity metrics.☆11Updated last year
- An OpenAI wrapper for PyReason to use in a Grid World reinforcement learning setting☆27Updated last year
- Small, simple agent task environments for training and evaluation☆18Updated 3 months ago
- Training hybrid models for dummies.☆20Updated last month
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 3 months ago
- ☆20Updated 3 months ago
- Latent Large Language Models☆17Updated 5 months ago
- Minimum Description Length probing for neural network representations☆18Updated 3 weeks ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆50Updated 2 weeks ago
- LMQL implementation of tree of thoughts☆33Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated 2 months ago
- Enjoy puzzle-solving directly in your browser.☆23Updated last month
- ☆14Updated 4 months ago
- alternative way to calculating self attention☆18Updated 8 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆41Updated last year
- Implementation of Spectral State Space Models☆16Updated 11 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 8 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated 8 months ago