Train transformer-based models.
☆28Jan 23, 2026Updated 2 months ago
Alternatives and similar repositories for zeldarose
Users that are interested in zeldarose are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 2 months ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- A small python library to parse and write TSV files generated by the WebAnno software.☆11Apr 14, 2025Updated 11 months ago
- Coala is a python package for Contextual Answer Sentence Selection.☆15Jun 12, 2023Updated 2 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Nov 22, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Converter from UD-trees to BART representation☆35Mar 6, 2024Updated 2 years ago
- ☆30Feb 11, 2022Updated 4 years ago
- Code for the paper Neural Pipeline for Zero-Shot Data-to-Text Generation☆16Aug 26, 2024Updated last year
- LegionTools is a toolkit + UI that provides an easy way to recruit and route workers from Amazon Mechanical Turk to real-time and synchro…☆23Jun 5, 2018Updated 7 years ago
- Grobid module for superconductor material and properties extraction☆22May 17, 2025Updated 10 months ago
- WiNER-fr is a free named entity corpus using French Wikinews texts.☆17Feb 12, 2021Updated 5 years ago
- NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021☆13May 18, 2021Updated 4 years ago
- Lazily regularized updates for Adagrad with sparse features. Implemented in Cython for efficiency.☆11Jan 2, 2021Updated 5 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆64Jul 29, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- Material for the MASH course on introduction to scikit-learn☆16Apr 15, 2017Updated 8 years ago
- Auxiliary GAN for WE post-specialisation☆24Feb 22, 2019Updated 7 years ago
- Triplet neural network for joint representation learning for text and images☆10Mar 17, 2019Updated 7 years ago
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data☆57Aug 5, 2021Updated 4 years ago
- Build 2019 Demos for Knowledge Mining Session☆10May 17, 2019Updated 6 years ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆44Mar 6, 2024Updated 2 years ago
- Repository hosting the common code for the entity-fishing clients☆10Jun 10, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The grobidmonkey package is an open-source package designed for postprocessing GROBID outputs.☆12Mar 27, 2024Updated 2 years ago
- Analytic platform for the HAL research archive (in development)☆13Oct 2, 2020Updated 5 years ago
- Specification of a stand-off element for the TEI guidelines☆12Apr 29, 2021Updated 4 years ago
- AutoML Workshop (Azure Machine Learning mainly)☆13Jan 5, 2020Updated 6 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- A machine learning software for extracting astronomical entities from scholarly documents☆10Oct 31, 2022Updated 3 years ago
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last month
- ML Reproducibility Challenge 2020: Electra reimplementation using PyTorch and Transformers☆12Apr 16, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- WindSR Dataset contains more than 22,000 pairs of HR/LR wind speed images, which are processed using the NASA's GEOS-5 Nature Run dataset…☆12Jan 18, 2024Updated 2 years ago
- Microsoft question-answering dataset☆10Jun 16, 2023Updated 2 years ago
- Line shuffler for huge text file which does not fit in memory☆13Dec 1, 2022Updated 3 years ago
- Tools for Creating Universal Numeric Fingerprints for Data☆22Apr 12, 2022Updated 3 years ago
- Natural language detection, Java bindings for CLD2☆17Feb 26, 2026Updated last month
- Depth-Bounded PCFG Induction☆13Apr 19, 2019Updated 6 years ago
- Utility to compile string of chemical terms into data structure with chemical formula and composition☆13Sep 17, 2021Updated 4 years ago