☆44Mar 29, 2023Updated 2 years ago
Alternatives and similar repositories for RWKV-LM-deepspeed
Users that are interested in RWKV-LM-deepspeed are comparing it to the libraries listed below
Sorting:
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 2 years ago
- Framework agnostic python runtime for RWKV models☆147Aug 24, 2023Updated 2 years ago
- ☆32Mar 30, 2023Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- A simple REPL for Lean 4, returning information about errors and sorries.☆12Jun 19, 2023Updated 2 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- ☆13Jun 3, 2023Updated 2 years ago
- High level Lean 4 FFI for Rust☆14Mar 16, 2024Updated last year
- RWKV centralised docs for the community☆31Jan 17, 2026Updated last month
- 📖 — Notebooks related to RWKV☆58May 13, 2023Updated 2 years ago
- Superposition prover☆17Feb 16, 2023Updated 3 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- 一个简单的,由ChatGPT主导编写的api,使用简单的请求访问ChatRWKV☆15May 19, 2023Updated 2 years ago
- Project Ivory is a minimalism PHP forum, with a clean UI for minimalists.☆71Nov 7, 2018Updated 7 years ago
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆65May 14, 2023Updated 2 years ago
- Documenting common pitfalls and footguns in Lean☆37Aug 26, 2025Updated 6 months ago
- Plain-text declaration export for Lean 4☆27Updated this week
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆41Jul 17, 2023Updated 2 years ago
- Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…☆18Feb 19, 2026Updated 2 weeks ago
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Apr 9, 2023Updated 2 years ago
- A go wrapper around the rwkv.cpp library☆20Mar 4, 2024Updated 2 years ago
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆19May 25, 2023Updated 2 years ago
- The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonatha…☆23Dec 30, 2022Updated 3 years ago
- GoldFinch and other hybrid transformer components☆45Jul 20, 2024Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆413Jul 11, 2023Updated 2 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- JAX implementations of RWKV☆19Sep 26, 2023Updated 2 years ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- Posterior Control of Blackbox Generation☆23May 2, 2020Updated 5 years ago
- ChatGPT-like Web UI for RWKVstic☆100Apr 18, 2023Updated 2 years ago
- Anh - LAION's multilingual assistant datasets and models☆27Apr 5, 2023Updated 2 years ago
- Elm ports' wrapper for uncomplicated request-response-style communication☆29Apr 20, 2021Updated 4 years ago
- Gradio UI for RWKV LLM☆28Feb 21, 2023Updated 3 years ago
- Modelling the new Lead-Copper apatite proposed room temperature supeconductor☆32Aug 8, 2023Updated 2 years ago
- A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library☆28Jun 5, 2024Updated last year
- Fine-tuning RWKV-World model☆26Jun 6, 2023Updated 2 years ago
- Formalization of the Millennium Problems in Lean 4☆44Jan 16, 2026Updated last month
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆29Jul 30, 2020Updated 5 years ago
- CuriousWall is a simple PHP forum, with a clean UI.☆197Nov 7, 2018Updated 7 years ago