mmhamdy / open-language-models
A list of language models with permissive licenses such as MIT or Apache 2.0
☆24Updated last month
Alternatives and similar repositories for open-language-models:
Users that are interested in open-language-models are comparing it to the libraries listed below
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated last week
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 6 months ago
- Modified Beam Search with periodical restart☆12Updated 7 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated last week
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 9 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- BH hackathon☆14Updated last year
- ☆16Updated last month
- Train, tune, and infer Bamba model☆88Updated 2 months ago
- ☆20Updated 10 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 4 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Updated 5 months ago
- ☆17Updated this week
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 10 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 10 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 8 months ago
- alternative way to calculating self attention☆18Updated 10 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆27Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- ☆48Updated 5 months ago
- ☆14Updated 2 weeks ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆32Updated last month
- ☆17Updated 2 weeks ago
- ☆29Updated last year
- ☆16Updated 6 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆27Updated last month
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 5 months ago