lamm-mit / LifeGPT
☆51Updated last month
Alternatives and similar repositories for LifeGPT:
Users that are interested in LifeGPT are comparing it to the libraries listed below
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- ☆17Updated 6 months ago
- Very minimal (and stateless) agent framework☆42Updated 3 months ago
- alternative way to calculating self attention☆18Updated 10 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated 2 weeks ago
- Repository to create traveling waves integrate special information through time☆50Updated last month
- Generative cellular automaton-like learning environments for RL.☆19Updated 2 months ago
- ☆46Updated last month
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆39Updated this week
- ☆38Updated 8 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Implementation of Spectral State Space Models☆16Updated last year
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 9 months ago
- Graph-Aware Attention for Adaptive Dynamics in Transformers☆58Updated 3 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆21Updated last week
- Agentic Knowledgeable Self-awareness☆47Updated this week
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆16Updated 11 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- ☆16Updated last month
- ☆18Updated last month
- Code for☆27Updated 4 months ago
- BH hackathon☆14Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 5 months ago
- Latent Large Language Models☆17Updated 7 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆17Updated last month
- ☆11Updated 8 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 10 months ago
- ☆50Updated 4 months ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 6 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated this week