[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
☆31Jan 17, 2026Updated 2 months ago
Alternatives and similar repositories for mc2_corpus
Users that are interested in mc2_corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly☆25Jan 6, 2026Updated 2 months ago
- A highlight tool for reading ArXiv papers☆31May 30, 2021Updated 4 years ago
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆17Nov 15, 2024Updated last year
- ☆17May 17, 2022Updated 3 years ago
- 🈵 Collected resources to learn/study Manchu (Manchurian Language). 满语滿族満州語入門。☆18Jun 7, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆64Nov 30, 2023Updated 2 years ago
- 面向大模型的民族文化数据集☆12May 26, 2025Updated 10 months ago
- ☆21Oct 26, 2021Updated 4 years ago
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆20Sep 28, 2022Updated 3 years ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆46Feb 18, 2025Updated last year
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆69Jul 30, 2024Updated last year
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆14Feb 7, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ROCK Framework for Commonsense Causality Reasoning (CCR)☆10Jun 28, 2023Updated 2 years ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆36Jan 16, 2026Updated 2 months ago
- Check your grade automatically and send e-mail when new grade comes☆12Feb 7, 2018Updated 8 years ago
- [CVPR'24] Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression☆15Jul 1, 2024Updated last year
- ☆24Oct 14, 2024Updated last year
- ☆14Jul 12, 2025Updated 8 months ago
- Implementation of the DPD architecture and related experiments for the ACL 2024 paper "Semisupervised Neural Proto-Language Reconstructio…☆11Jul 22, 2024Updated last year
- 古籍识别☆15May 19, 2021Updated 4 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Aug 20, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A research of Manchu hypothesis of Voynich manuscript. It's an Oracle database with tabes, DML scripts, PLSQL functions and queries.☆16Jun 11, 2014Updated 11 years ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆21Feb 27, 2025Updated last year
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆29Apr 2, 2025Updated 11 months ago
- ☆10Mar 22, 2024Updated 2 years ago
- a paper reading list on Document level Relation Extraction☆60Nov 19, 2021Updated 4 years ago
- Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models (ICLR 2024)☆14May 31, 2025Updated 9 months ago
- Instruction Tuning data generation uses LLM in a specific scenario.☆23May 2, 2024Updated last year
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆18Jun 27, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆17Mar 6, 2024Updated 2 years ago
- A simple, Python-based, command-line runner for MGIZA++.☆10Mar 24, 2022Updated 4 years ago
- Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107☆20Aug 10, 2024Updated last year
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆20Oct 2, 2024Updated last year
- ☆22Jun 1, 2023Updated 2 years ago
- A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps☆11Apr 18, 2019Updated 6 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Jan 11, 2021Updated 5 years ago