yandex / YaLM-100B
Pretrained language model with 100B parameters
☆3,752Updated last year
Alternatives and similar repositories for YaLM-100B:
Users that are interested in YaLM-100B are comparing it to the libraries listed below
- Russian GPT3 models.☆2,088Updated 2 years ago
- Generate images from texts. In Russian☆1,648Updated 2 years ago
- min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch☆3,484Updated 2 years ago
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple☆5,144Updated last year
- Fork of Facebooks LLaMa model to run on CPU☆772Updated last year
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,261Updated 3 months ago
- Drive a browser with GPT-3☆1,913Updated 8 months ago
- An unnecessarily tiny implementation of GPT-2 in NumPy.☆3,311Updated last year
- Language modeling and instruction tuning for Russian☆466Updated 6 months ago
- ☆1,276Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆7,101Updated this week
- Locally run an Instruction-Tuned Chat-Style LLM☆10,236Updated last year
- Model API for GALACTICA☆2,701Updated last year
- A list of totally open alternatives to ChatGPT☆4,585Updated last year
- Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)☆2,594Updated last year
- A collection of libraries to optimise AI model performances☆8,371Updated 6 months ago
- YaFSDP: Yet another Fully Sharded Data Parallel☆900Updated this week
- ☆773Updated 4 years ago
- Code for GPT-4chan☆631Updated 2 years ago
- An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.☆8,270Updated 2 years ago
- Stable diffusion for real-time music generation (web app)☆2,628Updated 6 months ago
- ☆1,533Updated last year
- YTsaurus is a scalable and fault-tolerant open-source big data platform.☆1,981Updated this week
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,752Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,598Updated 2 months ago
- Model parallel transformers in JAX and Haiku☆6,323Updated 2 years ago
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".☆1,305Updated last year
- ☆4Updated 11 months ago
- ☆121Updated 4 years ago
- C++ implementation for BLOOM☆810Updated last year