Qualcomm-AI-research / codeitLinks
☆27Updated last year
Alternatives and similar repositories for codeit
Users that are interested in codeit are comparing it to the libraries listed below
Sorting:
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆38Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆57Updated 10 months ago
- Solving the Abstraction & Reasoning Corpus with DreamCoder☆53Updated last year
- Code for☆27Updated 11 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 9 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 9 months ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated 5 months ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆64Updated last year
- Minimum Description Length probing for neural network representations☆20Updated 10 months ago
- Sparse and discrete interpretability tool for neural networks☆64Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆114Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- ☆56Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆23Updated 9 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated 2 years ago
- ☆24Updated 8 months ago
- ☆128Updated last year
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆54Updated last month
- implementation of dualformer☆24Updated 9 months ago
- Pytorch implementation of the Gato paper from Deepmind☆12Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Official repo for Learning to Reason for Long-Form Story Generation☆72Updated 7 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 10 months ago
- Memoria is a human-inspired memory architecture for neural networks.☆78Updated last year