chi2liu / mamba-gpt-3bLinks

It is almost the best 3B model in the current open source industry, surpassing Dolly v2-3b, open lama-3b, and even outperforming the EleutherAI/pythia-12b model in terms of performance. Can refer to open_llm_leaderboard

☆13

Alternatives and similar repositories for mamba-gpt-3b

Users that are interested in mamba-gpt-3b are comparing it to the libraries listed below

Sorting:

CogNLP / CogAGENT
☆36Updated 2 years ago
character-ai / MuKoe
☆54Updated last year
kyegomez / Kosmos-X
The Next Generation Multi-Modality Superintelligence
☆71Updated 10 months ago
sambanova / generative_data_prep
☆64Updated 2 months ago
kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
☆57Updated 3 months ago
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆100Updated last year
bdytx5 / mistral7B_finetune
fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI
☆38Updated last year
TobiasNorlund / UI-Act
An AI agent for interacting with a computer using the graphical user interface
☆76Updated last year
geronimi73 / mamba
☆31Updated last year
kailashsp / SELF-DISCOVER
☆32Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆106Updated 6 months ago
NousResearch / StripedHyenaTrainer
☆61Updated last year
data-for-agents / insta
Official Repo for InSTA: Towards Internet-Scale Training For Agents
☆48Updated this week
Glavin001 / Data2AITextbook
🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)
☆25Updated last year
geronimi73 / phi2-finetune
☆87Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
agiresearch / Formal-LLM
Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents
☆124Updated last year
kyegomez / Andromeda
An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast
☆151Updated 10 months ago
Agora-Lab-AI / The-Distiller
Generate High Quality textual or multi-modal datasets with Agents
☆18Updated 2 years ago
allenai / recoma
Reasoning by Communicating with Agents
☆29Updated 2 months ago
lucidrains / mind-evolution
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆55Updated last month
facebookresearch / dual-system-for-visual-language-reasoning
Github repo for Peifeng's internship project
☆13Updated last year
discus-labs / discus
A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ
☆63Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆67Updated last year
zozoheir / tinyllm
Develop, evaluate and monitor LLM applications at scale
☆100Updated 7 months ago
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆104Updated last month
dsdanielpark / open-llm-leaderboard-report
Weekly visualization report of Open LLM model performance based on 4 metrics.
☆87Updated last year
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year