yandex / YaLM-100BLinks
Pretrained language model with 100B parameters
☆3,757Updated 2 years ago
Alternatives and similar repositories for YaLM-100B
Users that are interested in YaLM-100B are comparing it to the libraries listed below
Sorting:
- Generate images from texts. In Russian☆1,648Updated 3 years ago
- Russian GPT3 models.☆2,096Updated 3 years ago
- Model parallel transformers in JAX and Haiku☆6,363Updated 3 years ago
- Stable diffusion for real-time music generation (web app)☆2,679Updated last year
- A list of totally open alternatives to ChatGPT☆4,736Updated 2 years ago
- Code for GPT-4chan☆634Updated 3 years ago
- Fork of Facebooks LLaMa model to run on CPU☆771Updated 2 years ago
- ☆768Updated 5 years ago
- min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch☆3,494Updated 9 months ago
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,876Updated 3 months ago
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,170Updated 3 months ago
- Model API for GALACTICA☆2,740Updated 2 years ago
- A collection of libraries to optimise AI model performances☆8,354Updated last year
- Kandinsky 2 — multilingual text2image latent diffusion model☆2,820Updated last year
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,371Updated last month
- Language modeling and instruction tuning for Russian☆466Updated last year
- An unnecessarily tiny implementation of GPT-2 in NumPy.☆3,438Updated 2 years ago
- An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.☆8,288Updated 3 years ago
- ☆116Updated 5 years ago
- Simple UI for LLM Model Finetuning☆2,062Updated 2 years ago
- Locally run an Instruction-Tuned Chat-Style LLM☆10,190Updated 2 years ago
- Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)☆2,672Updated 2 years ago
- High-speed download of LLaMA, Facebook's 65B parameter GPT model☆4,146Updated 2 years ago
- Pre-crisis Risk Management for Personal Finance☆617Updated 5 months ago
- Repo for external large-scale work☆6,542Updated last year
- StableLM: Stability AI Language Models☆15,771Updated last year
- ☆1,275Updated 3 years ago
- A simple notebook demonstrating prompt-based music generation via Mubert API☆2,741Updated 2 years ago
- The simplest way to run LLaMA on your local machine☆13,005Updated last year
- 🔍 Search Engine for a Procedural Simulation of the Web with GPT-3.☆520Updated 3 years ago