chi2liu / mamba-gpt-3bLinks
It is almost the best 3B model in the current open source industry, surpassing Dolly v2-3b, open lama-3b, and even outperforming the EleutherAI/pythia-12b model in terms of performance. Can refer to open_llm_leaderboard
☆13Updated last year
Alternatives and similar repositories for mamba-gpt-3b
Users that are interested in mamba-gpt-3b are comparing it to the libraries listed below
Sorting:
- ☆36Updated 2 years ago
- ☆54Updated last year
- The Next Generation Multi-Modality Superintelligence☆71Updated 10 months ago
- ☆64Updated 2 months ago
- Finetune any model on HF in less than 30 seconds☆57Updated 3 months ago
- inference code for mixtral-8x7b-32kseqlen☆100Updated last year
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI☆38Updated last year
- An AI agent for interacting with a computer using the graphical user interface☆76Updated last year
- ☆31Updated last year
- ☆32Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆106Updated 6 months ago
- ☆61Updated last year
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆48Updated this week
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆25Updated last year
- ☆87Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆124Updated last year
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆151Updated 10 months ago
- Generate High Quality textual or multi-modal datasets with Agents☆18Updated 2 years ago
- Reasoning by Communicating with Agents☆29Updated 2 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆55Updated last month
- Github repo for Peifeng's internship project☆13Updated last year
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ☆63Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Multi-Domain Expert Learning☆67Updated last year
- Develop, evaluate and monitor LLM applications at scale☆100Updated 7 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated last month
- Weekly visualization report of Open LLM model performance based on 4 metrics.☆87Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆168Updated last year