[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
☆35Jan 17, 2026Updated 4 months ago
Alternatives and similar repositories for mc2_corpus
Users that are interested in mc2_corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11May 28, 2024Updated 2 years ago
- A highlight tool for reading ArXiv papers☆31May 30, 2021Updated 5 years ago
- A Multi-tasking and Multi-stage Chinese Minority Pre-Trained Language Model☆12Jul 24, 2023Updated 2 years ago
- Source code and data for Counterfactual Recipe Generation: Exploring Models’ Compositional Generalization Ability in a Realistic Scenario…☆15Oct 25, 2022Updated 3 years ago
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆64Nov 30, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 基于多级LSTM的抽取式文本摘要☆12Aug 20, 2024Updated last year
- A curated list of papers on LLMs and agents for scientific research and development☆91Dec 11, 2024Updated last year
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆48Feb 18, 2025Updated last year
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆74Jul 30, 2024Updated last year
- 中文法律LLaMA (LLaMA for Chinese legel domain)☆993Aug 28, 2024Updated last year
- MergeNet-filter-ldr2hdr, detail in paper 《Reconstructing HDR Image from a Single Filtered LDR Image Base on a Deep HDR Merger Network》☆10Sep 11, 2019Updated 6 years ago
- ROCK Framework for Commonsense Causality Reasoning (CCR)☆10Jun 28, 2023Updated 2 years ago
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆15Feb 7, 2025Updated last year
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆18Dec 5, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"☆15Oct 25, 2024Updated last year
- ☆24Oct 14, 2024Updated last year
- ☆15Jul 12, 2025Updated 11 months ago
- [Machine Learning 2023] NaCL: Noise-Robust Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation☆12Jul 8, 2023Updated 2 years ago
- 古籍识别☆15May 19, 2021Updated 5 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Aug 20, 2021Updated 4 years ago
- Motion Generation from Fine-grained Textual Descriptions (LREC-COLING 2024)☆15Jun 13, 2024Updated 2 years ago
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆30Apr 2, 2025Updated last year
- ☆10Mar 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- URIEL+ knowledge base for natural language processing☆17May 5, 2026Updated last month
- Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models (ICLR 2024)☆14May 31, 2025Updated last year
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)☆19May 17, 2022Updated 4 years ago
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆23Dec 21, 2023Updated 2 years ago
- Tibetan-English translator for CLI☆16Jan 26, 2026Updated 4 months ago
- A Manchu dictionary website☆13Feb 26, 2026Updated 3 months ago
- FastNLP Implementation of several ABSA subtasks and models also can be found in https://gitee.com/ROGERDJQ/FastABSA.git.☆17Mar 18, 2023Updated 3 years ago
- Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107☆21Aug 10, 2024Updated last year
- A simple, Python-based, command-line runner for MGIZA++.☆10Mar 24, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆20Oct 2, 2024Updated last year
- [ACL 2026] Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic…☆48May 31, 2026Updated 2 weeks ago
- ☆22Jun 1, 2023Updated 3 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Jan 11, 2021Updated 5 years ago
- Papers about Opinion Triplet Extraction, inlcluding two subtasks: Aspect Sentiment Triplet Extraction (ASTE) and Aspect Sentiment Opinion…☆19Nov 17, 2021Updated 4 years ago
- ☆24Jan 18, 2022Updated 4 years ago
- Ziya-LLaMA-13B是IDEA基于LLaMa的130亿参数的大规模预训练模型,具备翻译,编程,文本分类,信息抽取,摘要,文案生成,常识问答和数学计算等能力。目前姜子牙通用大模型已完成大规模预训练、多任务有监督微调和人类反馈学习三阶段的训练过程。本文主要用于Ziya-…☆46Jun 9, 2023Updated 3 years ago