Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.
☆20Oct 3, 2024Updated last year
Alternatives and similar repositories for LexC-Gen
Users that are interested in LexC-Gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A weak supervision framework for (partial) labeling functions☆16Jul 15, 2024Updated last year
- Crosslingual Reasoning through Test-Time Scaling☆19May 13, 2025Updated 11 months ago
- Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024☆18Mar 25, 2025Updated last year
- A Flexible Toolkit for Dense Retrieval☆47Nov 12, 2025Updated 5 months ago
- A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.☆58Apr 3, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- Official Implementation of K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction.☆20Jul 8, 2025Updated 9 months ago
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆49Nov 8, 2024Updated last year
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆66Oct 16, 2024Updated last year
- ☆22Jul 16, 2024Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆152Oct 2, 2025Updated 7 months ago
- Framework for weakly supervised deep sequence taggers, focused on named entity recognition☆77Feb 10, 2023Updated 3 years ago
- Framework for zero-shot learning with knowledge graphs.☆113Mar 28, 2023Updated 3 years ago
- NLRB data scraper by LexPredict☆12Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ✍️ A browser add-on (Firefox, Chrome, Thunderbird) that allows you to autocorrect common text sequences and convert text characters to a …☆12Updated this week
- ☆13Oct 3, 2024Updated last year
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- A PHP-based application to create and manage anonymous surveys with restricted access for selected participants.☆10Nov 20, 2024Updated last year
- A home for collaboration on construction of a multilingual FrameNet☆13Aug 25, 2017Updated 8 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆14Jan 24, 2017Updated 9 years ago
- Dataset Catalogue Homepage for Indonesian Languages☆10Feb 19, 2024Updated 2 years ago
- ☆16Jun 10, 2024Updated last year
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Jul 27, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- A library for research in unnatural language semantics☆14Mar 5, 2026Updated last month
- MegaDetector models served over FastAPI & visualized with Streamlit☆10May 9, 2023Updated 2 years ago
- Automatic OCR of clipboard contents.☆14Aug 12, 2022Updated 3 years ago
- Este repositorio contiene el dataset para el entrenamiento de una CNN que clasifica 10 diferentes tipos de especies de aves☆13Oct 8, 2021Updated 4 years ago
- A neural parser for QA-SRL.☆23Apr 29, 2019Updated 7 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- Towards Few-Shot Fact-Checking via Perplexity☆13Jun 11, 2021Updated 4 years ago
- A python module to process data for Frame Semantic Parsing☆23Nov 3, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- TransOMCS is a commonsense knowledge resource transferred from ASER. It is in the format of OMCS but two orders of magnitude larger.☆70Aug 25, 2020Updated 5 years ago
- Code for RSS 2020 paper: Robot Object Retrieval with Contextual Natural Language Queries☆14Apr 21, 2022Updated 4 years ago
- ☆45Jul 5, 2022Updated 3 years ago
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆68Updated this week
- ☆15Nov 29, 2018Updated 7 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆27Oct 4, 2022Updated 3 years ago
- CNN and Contrastive Autoencoder (CAE) on EMNIST using Tensorflow☆10Oct 7, 2018Updated 7 years ago