☆44Mar 29, 2023Updated 3 years ago
Alternatives and similar repositories for RWKV-LM-deepspeed
Users that are interested in RWKV-LM-deepspeed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Framework agnostic python runtime for RWKV models☆147Aug 24, 2023Updated 2 years ago
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆65May 14, 2023Updated 3 years ago
- tinygrad port of the RWKV large language model.☆44Mar 9, 2025Updated last year
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.☆67Sep 14, 2022Updated 3 years ago
- ☆32Mar 30, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A go wrapper around the rwkv.cpp library☆20Mar 4, 2024Updated 2 years ago
- BlinkDL's RWKV-v4 running in the browser☆48Mar 2, 2023Updated 3 years ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆42Jul 17, 2023Updated 2 years ago
- Gradio UI for RWKV LLM☆27Feb 21, 2023Updated 3 years ago
- JAX implementations of RWKV☆18Sep 26, 2023Updated 2 years ago
- Fine-tuning RWKV-World model☆26Jun 6, 2023Updated 2 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆116Mar 22, 2023Updated 3 years ago
- ☆13Jun 3, 2023Updated 2 years ago
- Enhancing LangChain prompts to work better with RWKV models☆34May 30, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A converter and basic tester for rwkv onnx☆44Jan 29, 2024Updated 2 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆413Jul 11, 2023Updated 2 years ago
- 一个基于Flask实现的RWKV_Role_Playing项目的API。☆32Jun 26, 2024Updated last year
- ☆10Oct 17, 2022Updated 3 years ago
- Exploring different Deep learning problems primarily (but not limited to) using FastAI2, HuggingFace and Pytorch Library☆11Jul 19, 2021Updated 4 years ago
- High level Lean 4 FFI for Rust☆14Mar 16, 2024Updated 2 years ago
- A QQ Chatbot based on RWKV (W.I.P.)☆80Nov 16, 2023Updated 2 years ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code created for blog series on unsupervised feature/topic extraction from corporate email content. An implementation for cleaning raw e…☆10Oct 21, 2021Updated 4 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- https://datahack.analyticsvidhya.com/contest/american-express-amexpert-2018/☆10Nov 29, 2018Updated 7 years ago
- Anh - LAION's multilingual assistant datasets and models☆28Apr 5, 2023Updated 3 years ago
- ChatGPT-like Web UI for RWKVstic☆99Apr 18, 2023Updated 3 years ago
- ☆10Sep 7, 2020Updated 5 years ago
- The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )☆232Dec 10, 2025Updated 5 months ago
- ☆14May 8, 2023Updated 3 years ago
- Superposition prover☆17Feb 16, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- RADLADS training code☆43May 7, 2025Updated last year
- RWKV-7: Surpassing GPT☆104Nov 17, 2024Updated last year
- Codes written for some competitions☆13Dec 8, 2016Updated 9 years ago
- Observe the slow deterioration of my mental sanity in the github commit history☆12May 31, 2023Updated 2 years ago
- Interpretability analysis of language model outlier and attempts to distill the model☆13May 8, 2023Updated 3 years ago
- The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonatha…☆23Dec 30, 2022Updated 3 years ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago