argilla-io/biome-text

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/argilla-io/biome-text)

argilla-io / biome-text

Custom Natural Language Processing with big and small models 🌲🌱

☆66

Alternatives and similar repositories for biome-text

Users that are interested in biome-text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

argilla-io / awesome-llm-datasets
View on GitHub
👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)
☆26May 2, 2023Updated 3 years ago
aistairc / kirt_bert_on_abci
View on GitHub
Training BERT on ABCI
☆19Aug 31, 2021Updated 4 years ago
hotchpotch / yast
View on GitHub
YAST - Yet Another SPLADE or Sparse Trainer
☆21Jun 16, 2025Updated last year
ausgerechnet / cwb-ccc
View on GitHub
Python wrapper for the CWB to extract concordances and score frequency lists
☆22May 11, 2026Updated 2 months ago
jmzhao / pbos
View on GitHub
☆19Oct 10, 2020Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
microsoft / ExperimentTools
View on GitHub
XTlib is an API and command line tool for scaling and managing ML experiments. The goal of XTLib is to enable you to effortlessly organi…
☆15Jul 5, 2023Updated 3 years ago
RaRe-Technologies / talks
View on GitHub
Presentations & notebooks from our talks /workshops/meetups/etc
☆24Mar 23, 2018Updated 8 years ago
angeloskath / supervised-lda
View on GitHub
A flexible variational inference LDA library.
☆23Mar 15, 2019Updated 7 years ago
epwalsh / python-registrable
View on GitHub
Python module for registering and instantiating classes by name
☆13Dec 11, 2019Updated 6 years ago
Sandeep42 / anuvada
View on GitHub
Interpretable Models for NLP using PyTorch
☆18Jan 22, 2018Updated 8 years ago
mllocs / standard-notes-chrome-extension
View on GitHub
Unofficial chrome extension for Standard Notes
☆14Dec 5, 2020Updated 5 years ago
computing-mq / mlrg
View on GitHub
Machine Learning Reading Group
☆11Sep 15, 2023Updated 2 years ago
JeremyAlain / imitation_learning_from_language_feedback
View on GitHub
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆26Mar 30, 2023Updated 3 years ago
zhaoxiaoyu1995 / fpn_HSL-TFP
View on GitHub
A Surrogate Model with Data Augmentation and Deep Transfer Learning for Temperature Field Prediction of Heat Source Layout
☆11Nov 25, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
allenai / allennlp-hub
View on GitHub
A collection of selected of models built with AllenNLP.
☆26Feb 20, 2020Updated 6 years ago
gumption / pydata-simpsons
View on GitHub
Content associated with a PyData Seattle 2017 tutorial on Unevenly spaced time series analysis of The Simpsons using pandas
☆15Jul 6, 2017Updated 9 years ago
rejcom / maps
View on GitHub
☆11Oct 27, 2022Updated 3 years ago
argilla-io / argilla
View on GitHub
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
☆5,063Updated this week
explosion / catalogue
View on GitHub
Super lightweight function registries for your library
☆183Mar 27, 2026Updated 4 months ago
alexbrandsen / jsonl2bio
View on GitHub
Script that converts JSONL output from Doccano to the BIO format
☆10Jul 5, 2019Updated 7 years ago
LAION-AI / riverbed
View on GitHub
Tools for content datamining and NLP at scale
☆45Jun 20, 2024Updated 2 years ago
k-tahiro / bert-summarizer
View on GitHub
BERT-based text summarizer.
☆10Updated this week
ajb129 / KeyakiTreebank
View on GitHub
Keyaki Treebank Parsed Corpus
☆10May 15, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
socialmediaie / SocialMediaIE
View on GitHub
A toolkit for social media information extraction using multi-task learning and active learning
☆19Dec 27, 2022Updated 3 years ago
LuisaMaerz / KnowMAN
View on GitHub
KnowMAN: Weakly Supervised Multinomial Adversarial Networks
☆12Nov 9, 2021Updated 4 years ago
SiphuLangeni / tortus
View on GitHub
A PyPI package for easy text annotation in a Jupyter Notebook.
☆29Aug 9, 2021Updated 4 years ago
agoose77 / literary
View on GitHub
Literate Python package development with Jupyter
☆12Aug 18, 2025Updated 11 months ago
megagonlabs / ruler
View on GitHub
Data Programming by Demonstration (DPBD) for Document Classification
☆35Jun 17, 2021Updated 5 years ago
klainfo / awesome-text-summarization
View on GitHub
A curated list of resources dedicated to text summarization
☆11Mar 28, 2018Updated 8 years ago
svenwiegand / typed-intl
View on GitHub
Typed internationalization (intl/i18n) library for TypeScript/JavaScript apps.
☆19Dec 9, 2022Updated 3 years ago
dhlab-epfl / dhSegment-text
View on GitHub
Fork of dhSegment for experiments on visual and textual feature combination.
☆15Jan 30, 2021Updated 5 years ago
mjendrusch / torchsupport
View on GitHub
Supporting tools for PyTorch in biology research.
☆19Mar 10, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zackchase / label_shift
View on GitHub
A simple algorithm to identify and correct for label shift.
☆21Feb 4, 2018Updated 8 years ago
probcomp / notebook
View on GitHub
jupyter/datascience-notebook with probcomp libraries
☆16Sep 17, 2020Updated 5 years ago
casutton / bayes-qnet
View on GitHub
Code for Bayesian inference for queueing networks with incomplete data
☆12Jul 5, 2017Updated 9 years ago
rrmenon10 / ExEnt
View on GitHub
[ACL 2022] CLUES: A Benchmark for Learning Classifiers using Natural Language Explanations
☆10Jun 5, 2022Updated 4 years ago
tastyminerals / ccrawl
View on GitHub
Simple CORPORA list crawler
☆11Dec 2, 2016Updated 9 years ago
shivaditya-meduri / transvalorResearchProject
View on GitHub
Graph Neural Network-based Surrogate Models for Finite Element Analysis
☆10Sep 25, 2022Updated 3 years ago
webis-de / small-text
View on GitHub
Active Learning for Text Classification in Python
☆648May 24, 2026Updated 2 months ago