kyegomez / GPT4o
Community Open Source Implementation of GPT4o in PyTorch
☆29Updated last week
Alternatives and similar repositories for GPT4o:
Users that are interested in GPT4o are comparing it to the libraries listed below
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Updated 5 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]☆11Updated last year
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆48Updated 2 months ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi …☆10Updated this week
- ☆62Updated 3 weeks ago
- ☆16Updated last month
- A simple reproducible template to implement AI research papers☆23Updated 7 months ago
- ☆24Updated 7 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 11 months ago
- HuggingChat like UI in Gradio☆72Updated last year
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆44Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆32Updated last month
- Tina: Tiny Reasoning Models via LoRA☆55Updated this week
- ☆37Updated 2 years ago
- ☆53Updated 10 months ago
- LLM reads a paper and produce a working prototype☆52Updated 2 weeks ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆20Updated this week
- The implementation of the paper: "Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models"☆29Updated last year
- ☆18Updated last month
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated this week
- A repository for research on medium sized language models.☆76Updated 11 months ago
- Code and data from the paper 'Human Feedback is not Gold Standard'☆19Updated 9 months ago
- aesthetic tensor visualiser☆15Updated this week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆97Updated 6 months ago