C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs
☆11Jan 13, 2023Updated 3 years ago
Alternatives and similar repositories for c4repset
Users that are interested in c4repset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Assistent with Chat Integration☆13Sep 5, 2024Updated last year
- website for MS Marco☆34Mar 26, 2025Updated last year
- Memestra plugin for the Python Language Server☆20Feb 23, 2022Updated 4 years ago
- LibreGraph Identity Management☆31Mar 18, 2026Updated last week
- Code for "On Long-Tailed Phenomena in NMT".☆10Jan 10, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago
- Arabic - English emotion lexicon☆12Apr 24, 2017Updated 8 years ago
- NIILC QA data☆18Nov 20, 2015Updated 10 years ago
- repository for the project of building large arabic multidomain lexicon for sentiment analysis using feature selection from multiple reso…☆16Jan 21, 2015Updated 11 years ago
- 議事録メタデータセット☆12Jun 10, 2018Updated 7 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- The official Qt World Summit conference app by Felgo☆38Oct 28, 2019Updated 6 years ago
- Improved GPT-3.5 Agent (with tools) for GPT-3.5☆20May 1, 2023Updated 2 years ago
- ☆16Oct 27, 2025Updated 5 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code for paper: Weakly- and Semi-supervised Evidence Extraction☆15Apr 12, 2021Updated 4 years ago
- Sync Settings Locale/Llama-Plugin / Launcher Shortcuts for Android☆29Oct 15, 2019Updated 6 years ago
- Partial code for "Skill Extraction from Job Postings using Weak Supervision" at RecSysHR 2022.☆13May 19, 2023Updated 2 years ago
- Example input for the Wales Group code☆11Oct 20, 2024Updated last year
- 単語分割を経由しない単語埋め込み☆14Mar 19, 2017Updated 9 years ago
- An open platform for training, serving, and evaluating large language model for tool learning.☆20Aug 2, 2023Updated 2 years ago
- Scripts for creating a Japanese-English parallel corpus and training NMT models☆18Nov 9, 2021Updated 4 years ago
- ☆18Oct 22, 2024Updated last year
- Code for the paper: On Symmetric Losses for Learning from Corrupted Labels☆19May 11, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Molecular cluster tools built on RDKit☆11Dec 22, 2016Updated 9 years ago
- Solrの導入資料です。LAMP構成に特化しています。☆129Jan 28, 2013Updated 13 years ago
- A remark plugin for making interactive markdown documents with Tangle.☆13Oct 25, 2021Updated 4 years ago
- a boilerplate removal algorithm☆12Mar 22, 2016Updated 10 years ago
- Work done for "From Nand to Tetris: Building a Modern Computer from First Principles"☆11Jan 7, 2016Updated 10 years ago
- ☆10Sep 29, 2017Updated 8 years ago
- This is a new version of Learning Active Learning which uses reinforcement learning☆13May 17, 2022Updated 3 years ago
- Github for the NIPS 2020 paper "Learning outside the black-box: at the pursuit of interpretable models"☆14Sep 7, 2022Updated 3 years ago
- ☆17Jul 31, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Japanese tutorial for Vespa☆20Mar 14, 2018Updated 8 years ago
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 3 months ago
- A python program that applies a choice of nonnegative matrix factorization (NMF) algorithms to a dataset for clustering.☆13Jun 10, 2019Updated 6 years ago
- The code to reproduce CVPR 2021 paper "Towards Robust Classification Model by Counterfactual and Invariant Data Generation"☆17Jul 29, 2021Updated 4 years ago
- Nile University's Arabic sentiment Lexicon☆17Nov 24, 2016Updated 9 years ago
- ☆28Sep 4, 2023Updated 2 years ago
- Free, opensource, serverless learning platform☆60Mar 14, 2023Updated 3 years ago