kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆20Updated last week
Related projects ⓘ
Alternatives and complementary repositories for OpenStrawberry
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆14Updated 8 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- Training hybrid models for dummies.☆15Updated 2 weeks ago
- Latent Large Language Models☆16Updated 2 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆11Updated 9 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated 11 months ago
- Alpha-Zero Connect Four NN trained via self play☆13Updated last month
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated this week
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated this week
- Implementation of Spectral State Space Models☆17Updated 8 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆30Updated 2 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆49Updated 7 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluating☆31Updated this week
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆12Updated last week
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆38Updated 5 months ago
- ☆49Updated 7 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆22Updated last week
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆13Updated 8 months ago
- Genetics for Language Models☆11Updated 4 months ago
- Lottery Ticket Adaptation☆36Updated last month
- ☆11Updated 3 weeks ago
- ☆31Updated 2 months ago
- ☆40Updated this week
- ☆43Updated 3 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆21Updated 4 months ago
- MPI Code Generation through Domain-Specific Language Models☆13Updated 8 months ago
- GoldFinch and other hybrid transformer components☆39Updated 3 months ago
- Collection of autoregressive model implementation☆66Updated last week
- A forest of autonomous agents.☆18Updated this week