kyegomez / GPT3
An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"
☆16Updated 6 months ago
Alternatives and similar repositories for GPT3:
Users that are interested in GPT3 are comparing it to the libraries listed below
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]☆11Updated last year
- Collection of autoregressive model implementation☆76Updated last week
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆20Updated 2 months ago
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆17Updated this week
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆50Updated 9 months ago
- Unofficial Implementation of Evolutionary Model Merging☆33Updated 9 months ago
- ⚡ OVM for Planning in Mathematical Reasoning☆10Updated 10 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆26Updated this week
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Updated last year
- Prune transformer layers☆67Updated 7 months ago
- ☆69Updated 5 months ago
- Implementation of Adepts Fuyu all-new Multi-Modality model in pytorch☆24Updated 2 months ago
- Here we will test various linear attention designs.☆58Updated 8 months ago
- The Next Generation Multi-Modality Superintelligence☆70Updated 4 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆52Updated 4 months ago
- Official Implementation Of The Paper: `DeciMamba: Exploring the Length Extrapolation Potential of Mamba'☆22Updated 5 months ago
- Implementation of the Mamba SSM with hf_integration.☆56Updated 4 months ago