[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
☆33Jan 17, 2026Updated 3 months ago
Alternatives and similar repositories for mc2_corpus
Users that are interested in mc2_corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11May 28, 2024Updated last year
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆17Nov 15, 2024Updated last year
- A Multi-tasking and Multi-stage Chinese Minority Pre-Trained Language Model☆12Jul 24, 2023Updated 2 years ago
- Source code and data for Counterfactual Recipe Generation: Exploring Models’ Compositional Generalization Ability in a Realistic Scenario…☆15Oct 25, 2022Updated 3 years ago
- ☆17May 17, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆266Jul 15, 2025Updated 9 months ago
- 🈵 Collected resources to learn/study Manchu (Manchurian Language). 满语滿族満州語入門。☆18Jun 7, 2023Updated 2 years ago
- ☆19Dec 29, 2024Updated last year
- 面向大模型的民族文化数据集☆12May 26, 2025Updated 11 months ago
- ☆21Oct 26, 2021Updated 4 years ago
- ☆13Nov 17, 2024Updated last year
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆20Sep 28, 2022Updated 3 years ago
- ☆12May 13, 2023Updated 2 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆47Feb 18, 2025Updated last year
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆71Jul 30, 2024Updated last year
- The guideline for pod.☆10Jun 19, 2020Updated 5 years ago
- ROCK Framework for Commonsense Causality Reasoning (CCR)☆10Jun 28, 2023Updated 2 years ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last week
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆15Feb 7, 2025Updated last year
- Check your grade automatically and send e-mail when new grade comes☆12Feb 7, 2018Updated 8 years ago
- Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"☆15Oct 25, 2024Updated last year
- ☆23Oct 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 古籍识别☆15May 19, 2021Updated 4 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Aug 20, 2021Updated 4 years ago
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆55Apr 7, 2026Updated 3 weeks ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆21Feb 27, 2025Updated last year
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 2 years ago
- a paper reading list on Document level Relation Extraction☆60Nov 19, 2021Updated 4 years ago
- Repository for ACL2021 paper: <Zero-shot Event Extraction via Transfer Learning: Challenges and Insights>.☆30Jan 5, 2023Updated 3 years ago
- Codes for paper Exploring Sequence-to-Sequence Learning for Aspect Term Extraction.☆13Nov 27, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- URIEL+ knowledge base for natural language processing☆17Dec 16, 2025Updated 4 months ago
- 【大模型 & NLP & 算法大礼包】提供大量NLP、大模型和算法付费干货,一套拥有,学习&科研&工作不愁!☆30Sep 18, 2024Updated last year
- Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models (ICLR 2024)☆14May 31, 2025Updated 11 months ago
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆18Jun 27, 2025Updated 10 months ago
- Tibetan-English translator for CLI☆16Jan 26, 2026Updated 3 months ago
- The official dataset of paper "Goal-Oriented Prompt Attack and Safety Evaluation for LLMs".☆21Feb 5, 2024Updated 2 years ago
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆23Dec 21, 2023Updated 2 years ago