MultilingualShareGPT, the free multi-language corpus for LLM training
☆73Apr 6, 2023Updated 2 years ago
Alternatives and similar repositories for MultilingualShareGPT
Users that are interested in MultilingualShareGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Apr 9, 2023Updated 2 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Sep 9, 2023Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…☆16Feb 17, 2023Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The source code for Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning. #1 on the ReClor Leaderbo…☆18Dec 2, 2025Updated 3 months ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆108Sep 25, 2022Updated 3 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- https://liuzeming01.github.io/XDailyDialog/☆14Jun 25, 2023Updated 2 years ago
- Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD☆23Dec 13, 2022Updated 3 years ago
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆41Oct 17, 2023Updated 2 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆58Jun 10, 2024Updated last year
- PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".☆23Sep 19, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Jan 16, 2024Updated 2 years ago
- ☆15Dec 22, 2021Updated 4 years ago
- Code for ReviewRobot: Explainable Paper Review Generation based on Knowledge Synthesis☆30May 31, 2021Updated 4 years ago
- [Findings of ACL 2023] Communication Efficient Federated Learning for Multilingual Machine Translation with Adapter☆12Sep 4, 2023Updated 2 years ago
- TL;DR: We propose a large-scale cross-domain persuasion dataset covers 13,000 scenarios in 35 domains, with the developed PersuGPT model …☆17Feb 12, 2025Updated last year
- [Work in progress] A reading list for machine commonsense reasoning☆34Apr 14, 2020Updated 5 years ago
- Code for "ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning" (ICLR 2020)☆83Jul 2, 2024Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆42Apr 7, 2024Updated last year
- ParallelWaveGAN adaptation for Mozilla TTS☆15May 23, 2020Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Repo for paper "IDOL: Indicator-oriented Logic Pre-training for Logical Reasoning" accepted to the Findings of ACL 2023☆22Nov 7, 2023Updated 2 years ago
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆244Jan 13, 2026Updated 2 months ago
- Codebase for EnterpriseOps-Gym from ServiceNow☆71Mar 22, 2026Updated last week
- ☆25Jun 10, 2025Updated 9 months ago
- Neural Unification for Logic Reasoning over Language☆22Nov 15, 2021Updated 4 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- Chinese Version of ACL 2020 PC Blogs (ACL 2020程序委员会博文中文版)☆15Apr 15, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆27Aug 11, 2024Updated last year
- This repo contains the code and data used in the paper "Wizard of Search Engine: Access to Information Through Conversations with Search …☆21Apr 30, 2021Updated 4 years ago
- A Wikipedia-based summarization dataset☆14Mar 27, 2023Updated 3 years ago
- GraphRetriever in the paper "Knowledge Guided Text Retrieval and Reading for Open Domain Question Answering"☆39Nov 22, 2021Updated 4 years ago
- AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading Comprehension (ACL 2022)☆27May 20, 2022Updated 3 years ago
- A span-based joint named entity recognition (NER) and relation extraction model.☆11Aug 5, 2020Updated 5 years ago