kyegomez / GPT4oLinks
Community Open Source Implementation of GPT4o in PyTorch
☆29Updated 3 weeks ago
Alternatives and similar repositories for GPT4o
Users that are interested in GPT4o are comparing it to the libraries listed below
Sorting:
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated 2 weeks ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆55Updated last month
- The Next Generation Multi-Modality Superintelligence☆70Updated 10 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆87Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆80Updated 2 months ago
- ☆66Updated 3 months ago
- ☆61Updated last year
- Cerule - A Tiny Mighty Vision Model☆66Updated 10 months ago
- Finetune any model on HF in less than 30 seconds☆57Updated 3 months ago
- ☆63Updated last year
- ☆24Updated 10 months ago
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆30Updated this week
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆59Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- ☆35Updated 2 years ago
- A repository for research on medium sized language models.☆77Updated last year
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆41Updated last month
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆39Updated 4 months ago
- Verifiers for LLM Reinforcement Learning☆65Updated 3 months ago
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆98Updated 9 months ago
- XmodelLM☆39Updated 8 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 4 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆59Updated last year
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Updated last year
- HuggingChat like UI in Gradio☆71Updated 2 years ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated last week
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year