yandex / YaLM-100BLinks
Pretrained language model with 100B parameters
☆3,754Updated 2 years ago
Alternatives and similar repositories for YaLM-100B
Users that are interested in YaLM-100B are comparing it to the libraries listed below
Sorting:
- Russian GPT3 models.☆2,099Updated 2 years ago
- Generate images from texts. In Russian☆1,651Updated 2 years ago
- min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch☆3,491Updated 6 months ago
- ☆768Updated 4 years ago
- YaFSDP: Yet another Fully Sharded Data Parallel☆983Updated last week
- Model parallel transformers in JAX and Haiku☆6,354Updated 2 years ago
- Language modeling and instruction tuning for Russian☆466Updated last year
- Repo for external large-scale work☆6,546Updated last year
- ☆117Updated 5 years ago
- YTsaurus is a scalable and fault-tolerant open-source big data platform.☆2,102Updated this week
- A modern, scalable analytics system☆1,618Updated 2 weeks ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,832Updated last year
- A Stable Diffusion desktop frontend with inpainting, img2img and more!☆1,271Updated 2 years ago
- Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.☆1,079Updated 3 months ago
- Model API for GALACTICA☆2,738Updated 2 years ago
- Fork of Facebooks LLaMa model to run on CPU☆771Updated 2 years ago
- Simple UI for LLM Model Finetuning☆2,064Updated last year
- Drive a browser with GPT-3☆1,933Updated last year
- ☆1,707Updated 3 years ago
- Library for industrial alignment.☆404Updated last month
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,337Updated last month
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,872Updated last month
- Русскоязычный генеративный чатбот с профилем и фактами☆262Updated 2 years ago
- We are against discrediting open source software☆621Updated 3 weeks ago
- Kandinsky 2 — multilingual text2image latent diffusion model☆2,809Updated last year
- High-speed download of LLaMA, Facebook's 65B parameter GPT model☆4,154Updated 2 years ago
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,377Updated last year
- Chat with Meta's LLaMA models at home made easy☆838Updated 2 years ago
- ☆791Updated 3 years ago