Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.
☆19Oct 3, 2024Updated last year
Alternatives and similar repositories for LexC-Gen
Users that are interested in LexC-Gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Crosslingual Reasoning through Test-Time Scaling☆19May 13, 2025Updated 10 months ago
- Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024☆18Mar 25, 2025Updated 11 months ago
- A Flexible Toolkit for Dense Retrieval☆44Nov 12, 2025Updated 4 months ago
- A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.☆58Apr 3, 2025Updated 11 months ago
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Apr 18, 2023Updated 2 years ago
- Official Implementation of K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction.☆18Jul 8, 2025Updated 8 months ago
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆49Nov 8, 2024Updated last year
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆65Oct 16, 2024Updated last year
- ☆22Jul 16, 2024Updated last year
- Python application, generating parallel corpus for any language pairs, can be used for training nmt (Neural Machine Translation) systems☆12Dec 8, 2022Updated 3 years ago
- Framework for zero-shot learning with knowledge graphs.☆113Mar 28, 2023Updated 2 years ago
- NLRB data scraper by LexPredict☆12Dec 8, 2022Updated 3 years ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆14Jan 24, 2017Updated 9 years ago
- SemEval 2019 Task 4: Hyperpartisan News Detection☆10Nov 9, 2019Updated 6 years ago
- A library for research in unnatural language semantics☆14Mar 5, 2026Updated 2 weeks ago
- A recurrent neural network model to analyze how travelers expressed their feelings on Twitter☆12Jun 30, 2019Updated 6 years ago
- Automatic OCR of clipboard contents.☆14Aug 12, 2022Updated 3 years ago
- Decrypts WhatsApp msgstore.db.crypt14 files.☆10Jan 1, 2022Updated 4 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- A neural parser for QA-SRL.☆23Apr 29, 2019Updated 6 years ago
- Towards Few-Shot Fact-Checking via Perplexity☆14Jun 11, 2021Updated 4 years ago
- Generalized Data Augmentation for Low-Resource Translation☆12Jul 30, 2019Updated 6 years ago
- This repository contains a brief info about me(spidy20).☆12Jul 24, 2024Updated last year
- An Interactive Tool for Annotating Discourse Structure and Text Improvement☆16Sep 15, 2021Updated 4 years ago
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆17Jun 4, 2025Updated 9 months ago
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆69Jan 7, 2026Updated 2 months ago
- ☆15Nov 29, 2018Updated 7 years ago
- Multi-GPU supported kmeans clustering for cluser-clip☆15Jun 3, 2024Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆27Oct 4, 2022Updated 3 years ago
- English Resource Grammar☆25Mar 15, 2026Updated last week
- CNN and Contrastive Autoencoder (CAE) on EMNIST using Tensorflow☆10Oct 7, 2018Updated 7 years ago
- My personal website and blog☆20Jan 29, 2026Updated last month
- The website of the Oscar Project☆11Mar 27, 2025Updated 11 months ago
- [WIP] AI that "reads" live TV and writes it as a movie script in real-time.☆23Jun 3, 2025Updated 9 months ago
- Conversational Neuro-Symbolic Commonsense Reasoning☆26Jun 18, 2020Updated 5 years ago
- Accepted to ICLR 2025. MetaMetrics is a calibrated meta-metric designed to evaluate generation tasks across different modalities aligned …☆14Dec 30, 2024Updated last year
- COLING 2018 Tutorial on Multilingual FrameNet: Automatic semantic role labeling for FrameNet☆25Aug 29, 2018Updated 7 years ago