yandex / YaLM-100BLinks
Pretrained language model with 100B parameters
☆3,757Updated 2 years ago
Alternatives and similar repositories for YaLM-100B
Users that are interested in YaLM-100B are comparing it to the libraries listed below
Sorting:
- Generate images from texts. In Russian☆1,649Updated 2 years ago
- Russian GPT3 models.☆2,101Updated 2 years ago
- ☆767Updated 4 years ago
- Fork of Facebooks LLaMa model to run on CPU☆772Updated 2 years ago
- YaFSDP: Yet another Fully Sharded Data Parallel☆975Updated 2 months ago
- A modern, scalable analytics system☆1,522Updated last week
- Repo for external large-scale work☆6,543Updated last year
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,133Updated 7 months ago
- Language modeling and instruction tuning for Russian☆466Updated last year
- YTsaurus is a scalable and fault-tolerant open-source big data platform.☆2,075Updated last week
- ☆118Updated 4 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,295Updated last month
- A Stable Diffusion desktop frontend with inpainting, img2img and more!☆1,273Updated 2 years ago
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93Updated 2 years ago
- ☆2,162Updated last year
- Locally run an Instruction-Tuned Chat-Style LLM☆10,212Updated 2 years ago
- Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.☆796Updated last month
- Code for GPT-4chan☆637Updated 3 years ago
- Pre-crisis Risk Management for Personal Finance☆615Updated 2 weeks ago
- A list of totally open alternatives to ChatGPT☆4,696Updated 2 years ago
- min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch☆3,493Updated 4 months ago
- Stable diffusion for real-time music generation (web app)☆2,669Updated last year
- ☆789Updated 2 years ago
- A simple notebook demonstrating prompt-based music generation via Mubert API☆2,739Updated 2 years ago
- Model parallel transformers in JAX and Haiku☆6,351Updated 2 years ago
- Guide to using pre-trained large language models of source code☆1,838Updated last year
- Chat with Meta's LLaMA models at home made easy☆838Updated 2 years ago
- ☆2,880Updated last week
- Drive a browser with GPT-3☆1,928Updated last year
- An unnecessarily tiny implementation of GPT-2 in NumPy.☆3,409Updated 2 years ago