yandex / YaLM-100BLinks
Pretrained language model with 100B parameters
☆3,758Updated last year
Alternatives and similar repositories for YaLM-100B
Users that are interested in YaLM-100B are comparing it to the libraries listed below
Sorting:
- Russian GPT3 models.☆2,095Updated 2 years ago
- Model parallel transformers in JAX and Haiku☆6,338Updated 2 years ago
- Language modeling and instruction tuning for Russian☆467Updated 10 months ago
- min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch☆3,490Updated last month
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple☆5,340Updated last year
- YaFSDP: Yet another Fully Sharded Data Parallel☆970Updated this week
- Locally run an Instruction-Tuned Chat-Style LLM☆10,227Updated 2 years ago
- Fork of Facebooks LLaMa model to run on CPU☆773Updated 2 years ago
- Model API for GALACTICA☆2,725Updated 2 years ago
- Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.☆927Updated last month
- An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.☆8,296Updated 3 years ago
- ☆2,147Updated last year
- Repo for external large-scale work☆6,530Updated last year
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,675Updated 9 months ago
- High-speed download of LLaMA, Facebook's 65B parameter GPT model☆4,163Updated last year
- YTsaurus is a scalable and fault-tolerant open-source big data platform.☆2,050Updated this week
- Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)☆2,634Updated 2 years ago
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,102Updated 4 months ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,336Updated last year
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.☆3,668Updated last year
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,338Updated 7 months ago
- ☆2,834Updated 2 weeks ago
- C++ implementation for BLOOM☆810Updated 2 years ago
- Python package for analyzing Telegram chats and finding correlations between people☆145Updated 2 years ago
- Run LLaMA (and Stanford-Alpaca) inference on Apple Silicon GPUs.☆588Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,228Updated 2 weeks ago
- Fast Differentiable Tensor Library in JavaScript and TypeScript with Bun + Flashlight☆1,157Updated 11 months ago
- The simplest way to run LLaMA on your local machine☆13,072Updated last year
- The web scraping open project repository aims to share knowledge and experiences about web scraping with Python☆1,645Updated last year
- Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but…☆1,956Updated last month