RWKV-Wiki/MultilingualShareGPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RWKV-Wiki/MultilingualShareGPT)

RWKV-Wiki / MultilingualShareGPT

MultilingualShareGPT, the free multi-language corpus for LLM training

☆72

Alternatives and similar repositories for MultilingualShareGPT

Users that are interested in MultilingualShareGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenLLMAI / OpenLLMDE
View on GitHub
OpenLLMDE: An open source data engineering framework for LLMs
☆18Sep 9, 2023Updated 2 years ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
EricLee8 / BiDeN
View on GitHub
The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…
☆16Feb 17, 2023Updated 3 years ago
MichaelZhouwang / VLUE
View on GitHub
This repo contains codes and instructions for baselines in the VLUE benchmark.
☆41Jul 16, 2022Updated 4 years ago
Strong-AI-Lab / Logical-Equivalence-driven-AMR-Data-Augmentation-for-Representation-Learning
View on GitHub
The source code for Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning. #1 on the ReClor Leaderbo…
☆18Dec 2, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
silverriver / MMChat
View on GitHub
[LREC] MMChat: Multi-Modal Chat Dataset on Social Media
☆110Sep 25, 2022Updated 3 years ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
andy-yangz / Awesome-RLHF
View on GitHub
Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD
☆23Dec 13, 2022Updated 3 years ago
gmftbyGMFTBY / Rep-Dropout
View on GitHub
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
☆41Oct 17, 2023Updated 2 years ago
2003pro / ScaleBiO
View on GitHub
This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting
☆25Jul 30, 2024Updated last year
google-research-datasets / maxm
View on GitHub
MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…
☆13Jan 16, 2024Updated 2 years ago
jokieleung / Maria
View on GitHub
PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".
☆23Sep 19, 2021Updated 4 years ago
adobe-research / Cross-lingual-Test-Dataset-XTD10
View on GitHub
☆17Dec 22, 2021Updated 4 years ago
EagleW / ReviewRobot
View on GitHub
Code for ReviewRobot: Explainable Paper Review Generation based on Knowledge Synthesis
☆30May 31, 2021Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
FlagOpen / FlagInstruct
View on GitHub
☆173Apr 20, 2023Updated 3 years ago
lancopku / FedMNMT
View on GitHub
[Findings of ACL 2023] Communication Efficient Federated Learning for Multilingual Machine Translation with Adapter
☆12Sep 4, 2023Updated 2 years ago
Neutralzz / RefQA
View on GitHub
The implementation of the paper "Harvesting and Refining Question-Answer Pairs for Unsupervised QA"
☆33Nov 25, 2020Updated 5 years ago
Academic-Hammer / HammerLLM
View on GitHub
1.4B sLLM for Chinese and English - HammerLLM🔨
☆44Apr 7, 2024Updated 2 years ago
allenai / allennlp-reading-comprehension-research
View on GitHub
☆41Feb 12, 2019Updated 7 years ago
AkariAsai / logic_guided_qa
View on GitHub
The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".
☆71Jul 25, 2024Updated last year
launchnlp / LitCab
View on GitHub
☆25Jun 10, 2025Updated last year
PersuGPT / PersuGPT.github.io
View on GitHub
TL;DR: We propose a large-scale cross-domain persuasion dataset covers 13,000 scenarios in 35 domains, with the developed PersuGPT model …
☆17Feb 12, 2025Updated last year
yuweihao / reclor
View on GitHub
Code for "ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning" (ICLR 2020)
☆83Jul 2, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yuchenlin / awesome-commonsense
View on GitHub
[Work in progress] A reading list for machine commonsense reasoning
☆34Apr 14, 2020Updated 6 years ago
maziao / T2I-Eval
View on GitHub
[ACL 2025 Main] Open-source toolkit for automatic evaluation of text-to-image generation task, including training & test datasets and a d…
☆20Jul 5, 2025Updated last year
erogol / ParallelWaveGAN
View on GitHub
ParallelWaveGAN adaptation for Mozilla TTS
☆15May 23, 2020Updated 6 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
frozentoad9 / CMST
View on GitHub
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
☆13Oct 12, 2022Updated 3 years ago
SiyuanWangw / StepwiseQA
View on GitHub
The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".
☆22Sep 1, 2022Updated 3 years ago
GeekDream-x / IDOL
View on GitHub
Repo for paper "IDOL: Indicator-oriented Logic Pre-training for Logical Reasoning" accepted to the Findings of ACL 2023
☆22Nov 7, 2023Updated 2 years ago
howard-hou / VisualRWKV
View on GitHub
VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.
☆246Jan 13, 2026Updated 6 months ago
IBM / Neural_Unification_for_Logic_Reasoning_over_Language
View on GitHub
Neural Unification for Logic Reasoning over Language
☆22Nov 15, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
ymcui / ACL2020-PC-Blogs-Chinese
View on GitHub
Chinese Version of ACL 2020 PC Blogs （ACL 2020程序委员会博文中文版）
☆15Apr 15, 2020Updated 6 years ago
vzhong / e3
View on GitHub
Dockerized code for E3: Entailment-driven Extracting and Editing for Conversational Machine Reading.
☆48Jul 22, 2023Updated 2 years ago
Aman-4-Real / awesome-multimodal-dialogue
View on GitHub
Paper, dataset and code list for multimodal dialogue.
☆22Jan 2, 2025Updated last year
PengjieRen / CaSE_WISE
View on GitHub
This repo contains the code and data used in the paper "Wizard of Search Engine: Access to Information Through Conversations with Search …
☆21Apr 30, 2021Updated 5 years ago
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
openaudiolab / LLaST
View on GitHub
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆26Aug 11, 2024Updated last year