chi2liu / mamba-gpt-3b
It is almost the best 3B model in the current open source industry, surpassing Dolly v2-3b, open lama-3b, and even outperforming the EleutherAI/pythia-12b model in terms of performance. Can refer to open_llm_leaderboard
☆13Updated last year
Alternatives and similar repositories for mamba-gpt-3b:
Users that are interested in mamba-gpt-3b are comparing it to the libraries listed below
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- ☆60Updated last year
- ☆52Updated 11 months ago
- ☆64Updated last week
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- ☆38Updated last year
- A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.☆27Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- LLM reads a paper and produce a working prototype☆51Updated 2 weeks ago
- Simple GRPO scripts and configurations.☆59Updated last month
- ☆53Updated 10 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 4 months ago
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ☆64Updated last year
- Latent Large Language Models☆17Updated 7 months ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆16Updated this week
- ☆73Updated last year
- ☆32Updated last year
- Eh, simple and works.☆27Updated last year
- Community Open Source Implementation of GPT4o in PyTorch☆29Updated 2 weeks ago
- Very minimal (and stateless) agent framework☆41Updated 2 months ago
- ☆54Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 11 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆59Updated 7 months ago
- Generate High Quality textual or multi-modal datasets with Agents☆18Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated last month
- Merge LLM that are split in to parts☆26Updated last year
- ☆46Updated 8 months ago
- ☆87Updated last year
- A repository for research on medium sized language models.☆76Updated 10 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆28Updated 3 weeks ago