☆44Mar 29, 2023Updated 3 years ago
Alternatives and similar repositories for RWKV-LM-deepspeed
Users that are interested in RWKV-LM-deepspeed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 3 years ago
- Framework agnostic python runtime for RWKV models☆147Aug 24, 2023Updated 2 years ago
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆65May 14, 2023Updated 2 years ago
- tinygrad port of the RWKV large language model.☆45Mar 9, 2025Updated last year
- RWKV centralised docs for the community☆32Jan 17, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- BlinkDL's RWKV-v4 running in the browser☆48Mar 2, 2023Updated 3 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset☆30Apr 2, 2023Updated 3 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆116Mar 22, 2023Updated 3 years ago
- ☆13Jun 3, 2023Updated 2 years ago
- A converter and basic tester for rwkv onnx☆44Jan 29, 2024Updated 2 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- 📖 — Notebooks related to RWKV☆58May 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆413Jul 11, 2023Updated 2 years ago
- ☆10Oct 17, 2022Updated 3 years ago
- High level Lean 4 FFI for Rust☆14Mar 16, 2024Updated 2 years ago
- Solution for the Foursquare - Location Matching competition☆14Jul 8, 2022Updated 3 years ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- Code created for blog series on unsupervised feature/topic extraction from corporate email content. An implementation for cleaning raw e…☆10Oct 21, 2021Updated 4 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- A simple REPL for Lean 4, returning information about errors and sorries.☆12Jun 19, 2023Updated 2 years ago
- https://datahack.analyticsvidhya.com/contest/american-express-amexpert-2018/☆10Nov 29, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Anh - LAION's multilingual assistant datasets and models☆28Apr 5, 2023Updated 3 years ago
- ChatGPT-like Web UI for RWKVstic☆99Apr 18, 2023Updated 3 years ago
- The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )☆233Dec 10, 2025Updated 4 months ago
- possibly useful materials for learning RWKV language model.☆26Jun 8, 2023Updated 2 years ago
- Superposition prover☆17Feb 16, 2023Updated 3 years ago
- ☆14May 8, 2023Updated 2 years ago
- 😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)☆13Apr 19, 2023Updated 3 years ago
- Codes written for some competitions☆13Dec 8, 2016Updated 9 years ago
- RWKV-7: Surpassing GPT☆104Nov 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Aug 1, 2023Updated 2 years ago
- Winners' solution approach and code for WNS Analytics Wizard 2019☆11Jul 6, 2023Updated 2 years ago
- Observe the slow deterioration of my mental sanity in the github commit history☆12May 31, 2023Updated 2 years ago
- ☆14Dec 26, 2023Updated 2 years ago
- ☆16Jul 3, 2023Updated 2 years ago
- Interpretability analysis of language model outlier and attempts to distill the model☆13May 8, 2023Updated 2 years ago
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,568Mar 23, 2025Updated last year