☆17Aug 1, 2023Updated 2 years ago
Alternatives and similar repositories for RWKV-Pytorch
Users that are interested in RWKV-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…☆133Jul 20, 2024Updated last year
- Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset☆31Apr 2, 2023Updated 2 years ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆42Jul 17, 2023Updated 2 years ago
- ☆32Mar 30, 2023Updated 2 years ago
- Fine-tuning RWKV-World model☆26Jun 6, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Gradio UI for RWKV LLM☆27Feb 21, 2023Updated 3 years ago
- ☆10Feb 21, 2023Updated 3 years ago
- ☆13Jan 22, 2025Updated last year
- A semantic segmentation method for high resolution image☆12Jul 1, 2022Updated 3 years ago
- Neural Processing Letters: End-to-End Entity Detection with Proposer and Regressor☆12Jun 6, 2023Updated 2 years ago
- ☆10Jun 10, 2023Updated 2 years ago
- ☆44Mar 29, 2023Updated 2 years ago
- ☆10May 1, 2023Updated 2 years ago
- VQ-TR repository☆12Apr 18, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 免注册免费使用 ChatGPT,请关注微信公众号【胖竹同学】。☆10Apr 4, 2023Updated 2 years ago
- 细粒度中文命名实体识别数据集处理,将json数据处理成BIOES标注的数据。CLUENER dataset pretreatment☆11Jun 29, 2020Updated 5 years ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆14Jul 21, 2024Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- ☆12Apr 19, 2024Updated last year
- An implementation of DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs☆16May 2, 2018Updated 7 years ago
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Jul 30, 2024Updated last year
- ☆13Aug 8, 2024Updated last year
- PyTorch implementation of "Segmenter: Transformer for Semantic Segmentation" Strudel et al. (2021)☆18May 23, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CCKS2023-PromptCBLUE: Code implement of TianChi completition☆20Feb 27, 2024Updated 2 years ago
- A PyTorch implementation of DANet based on CVPR 2019 paper "Dual Attention Network for Scene Segmentation"☆11Oct 30, 2019Updated 6 years ago
- ☆15Aug 15, 2023Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition☆25Jul 18, 2023Updated 2 years ago
- Variance Covariance Regularization☆14Jun 22, 2023Updated 2 years ago
- Feature Importance Analysis of Models☆11Mar 23, 2022Updated 4 years ago
- Scalable Kubernetes-native implementation of the Open Data Fabric protocol for global collaborative data processing☆22Updated this week
- Time series data contribution via influence functions☆17Jan 18, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- ☆15Feb 26, 2024Updated 2 years ago
- Dirfuzz by golang☆15Feb 28, 2023Updated 3 years ago
- Source code for Leveraging 2D Information for Long-term Time Series Forecasting with Vanilla Transformers☆18May 29, 2024Updated last year
- C++编程思想☆14Mar 9, 2021Updated 5 years ago
- A gMLP (gated MLP) implementation in Tensorflow 1.x, as described in the paper "Pay Attention to MLPs" (2105.08050).☆16Aug 31, 2021Updated 4 years ago
- Code for "CANet: Context Aware Network for 3D Brain Tumor Segmentation"☆13Mar 20, 2024Updated 2 years ago