chi2liu / mamba-gpt-3b
It is almost the best 3B model in the current open source industry, surpassing Dolly v2-3b, open lama-3b, and even outperforming the EleutherAI/pythia-12b model in terms of performance. Can refer to open_llm_leaderboard
☆13Updated last year
Alternatives and similar repositories for mamba-gpt-3b:
Users that are interested in mamba-gpt-3b are comparing it to the libraries listed below
- ☆53Updated last year
- ☆37Updated 2 years ago
- Eh, simple and works.☆27Updated last year
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- Train your own SOTA deductive reasoning model☆88Updated last month
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- ☆53Updated 10 months ago
- Codebase for the arxiver dataset☆14Updated 4 months ago
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated 3 weeks ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- The Next Generation Multi-Modality Superintelligence☆71Updated 7 months ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- ☆61Updated last year
- Very minimal (and stateless) agent framework☆42Updated 3 months ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Updated 5 months ago
- ☆55Updated 3 months ago
- Low-Rank Adaptation of Large Language Models clean implementation☆8Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆38Updated last year
- ☆64Updated this week
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆167Updated last year
- Community Open Source Implementation of GPT4o in PyTorch☆29Updated last week
- Simple Model Similarities Analysis☆21Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated 11 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- LLM reads a paper and produce a working prototype☆52Updated 2 weeks ago
- Lego for GRPO☆27Updated 3 weeks ago