yandex / YaLM-100B
Pretrained language model with 100B parameters
☆3,755Updated last year
Alternatives and similar repositories for YaLM-100B:
Users that are interested in YaLM-100B are comparing it to the libraries listed below
- Generate images from texts. In Russian☆1,647Updated 2 years ago
- Russian GPT3 models.☆2,094Updated 2 years ago
- Model parallel transformers in JAX and Haiku☆6,327Updated 2 years ago
- YaFSDP: Yet another Fully Sharded Data Parallel☆960Updated last month
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,165Updated this week
- min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch☆3,488Updated 2 years ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,595Updated 7 months ago
- Language modeling and instruction tuning for Russian☆467Updated 8 months ago
- ☆772Updated 4 years ago
- Fork of Facebooks LLaMa model to run on CPU☆773Updated 2 years ago
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,310Updated 5 months ago
- An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.☆8,288Updated 3 years ago
- Repo for external large-scale work☆6,524Updated 11 months ago
- An unnecessarily tiny implementation of GPT-2 in NumPy.☆3,349Updated 2 years ago
- YTsaurus is a scalable and fault-tolerant open-source big data platform.☆2,012Updated this week
- 🔍 Search Engine for a Procedural Simulation of the Web with GPT-3.☆517Updated 2 years ago
- Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.☆902Updated last month
- 🌊 A Human-in-the-Loop workflow for creating HD images from text☆2,836Updated last year
- Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)☆2,614Updated 2 years ago
- Locally run an Instruction-Tuned Chat-Style LLM☆10,234Updated 2 years ago
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆13,531Updated 2 weeks ago
- ☆983Updated 2 years ago
- Open STT☆795Updated 3 years ago
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,788Updated this week
- A list of totally open alternatives to ChatGPT☆4,622Updated last year
- Model API for GALACTICA☆2,718Updated 2 years ago
- Training and serving large-scale neural networks with auto parallelization.☆3,125Updated last year
- High-speed download of LLaMA, Facebook's 65B parameter GPT model☆4,165Updated last year
- Drive a browser with GPT-3☆1,921Updated 10 months ago
- Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch☆8,246Updated 6 months ago