☆44Mar 29, 2023Updated 2 years ago
Alternatives and similar repositories for RWKV-LM-deepspeed
Users that are interested in RWKV-LM-deepspeed are comparing it to the libraries listed below
Sorting:
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 2 years ago
- Framework agnostic python runtime for RWKV models☆147Aug 24, 2023Updated 2 years ago
- ☆32Mar 30, 2023Updated 2 years ago
- A simple REPL for Lean 4, returning information about errors and sorries.☆12Jun 19, 2023Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- tinygrad port of the RWKV large language model.☆45Mar 9, 2025Updated last year
- High level Lean 4 FFI for Rust☆14Mar 16, 2024Updated last year
- JavaScript port of the path tracing algorithm from Peter Shirley's "Ray Tracing in One Weekend"☆11Jul 5, 2016Updated 9 years ago
- ☆13Jun 3, 2023Updated 2 years ago
- Superposition prover☆17Feb 16, 2023Updated 3 years ago
- 📖 — Notebooks related to RWKV☆58May 13, 2023Updated 2 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- 一个简单的,由ChatGPT主导编写的api,使用简单的请求访问ChatRWKV☆15May 19, 2023Updated 2 years ago
- Plain-text declaration export for Lean 4☆27Updated this week
- ☆16Jul 3, 2023Updated 2 years ago
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆65May 14, 2023Updated 2 years ago
- Documenting common pitfalls and footguns in Lean☆37Aug 26, 2025Updated 6 months ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆41Jul 17, 2023Updated 2 years ago
- Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…☆18Feb 19, 2026Updated 2 weeks ago
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Apr 9, 2023Updated 2 years ago
- A go wrapper around the rwkv.cpp library☆20Mar 4, 2024Updated 2 years ago
- The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonatha…☆23Dec 30, 2022Updated 3 years ago
- GoldFinch and other hybrid transformer components☆45Jul 20, 2024Updated last year
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- ChatGPT-like Web UI for RWKVstic☆100Apr 18, 2023Updated 2 years ago
- Anh - LAION's multilingual assistant datasets and models☆27Apr 5, 2023Updated 2 years ago
- LeelaZero + PhoenixGo's weights☆20Nov 13, 2018Updated 7 years ago
- A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library☆28Jun 5, 2024Updated last year
- Fine-tuning RWKV-World model☆26Jun 6, 2023Updated 2 years ago
- Gradio UI for RWKV LLM☆28Feb 21, 2023Updated 3 years ago
- Formalization of the Millennium Problems in Lean 4☆44Jan 16, 2026Updated last month
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆29Jul 30, 2020Updated 5 years ago
- ☆13Oct 5, 2025Updated 5 months ago
- ☆35Apr 12, 2024Updated last year
- ☆34Jul 21, 2024Updated last year
- BANG is a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation. AR and NAR generat…☆28Feb 6, 2022Updated 4 years ago
- A Learnable LSH Framework for Efficient NN Training☆34Jul 22, 2021Updated 4 years ago