flairNLP/fundus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/flairNLP/fundus)

flairNLP / fundus

A very simple news crawler with a funny name

☆468

Alternatives and similar repositories for fundus

Users that are interested in fundus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

flairNLP / transformer-ranker
View on GitHub
Efficiently find the best-suited language model (LM) for your NLP task
☆134Jul 26, 2025Updated 11 months ago
lm-pub-quiz / lm-pub-quiz
View on GitHub
Evaluate language models using multiple choice items
☆13Mar 6, 2026Updated 4 months ago
flairNLP / fabricator
View on GitHub
[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.
☆110May 16, 2024Updated 2 years ago
flairNLP / zelda
View on GitHub
A comprehensive benchmark for entity disambiguation
☆29Jun 29, 2023Updated 3 years ago
oceanumeric / EnteRAG
View on GitHub
A RAG that can scale 🧑🏻‍💻
☆11May 28, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Pleias / OCRoscope
View on GitHub
Small python package to measure OCR quality and other related metrics.
☆26Feb 19, 2024Updated 2 years ago
fhamborg / news-please
View on GitHub
news-please - an integrated web crawler and information extractor for news that just works
☆2,472Apr 14, 2026Updated 3 months ago
tomaarsen / SpanMarkerNER
View on GitHub
SpanMarker for Named Entity Recognition
☆477Apr 10, 2026Updated 3 months ago
LSX-UniWue / SuperGLEBer
View on GitHub
German Language Understanding Evaluation Benchmark @NAACL24
☆22Dec 11, 2025Updated 7 months ago
cohere-ai / DiskVectorIndex
View on GitHub
☆209Jun 26, 2025Updated last year
huggingface / setfit
View on GitHub
Efficient few-shot learning with Sentence Transformers
☆2,777May 26, 2026Updated last month
flairNLP / familiarity
View on GitHub
Label shift estimation for transfer difficulty with Familiarity.
☆10Feb 4, 2025Updated last year
NathanGodey / headless-lm
View on GitHub
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆29Apr 17, 2024Updated 2 years ago
flairNLP / CleanCoNLL
View on GitHub
The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.
☆25Jul 2, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
THUNLP-MT / DirectQuote
View on GitHub
A Dataset for Direct Quotation Extraction and Attribution in News Articles.
☆14Sep 28, 2021Updated 4 years ago
boschresearch / adversarial_meta_embeddings
View on GitHub
Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"
☆13Dec 14, 2021Updated 4 years ago
helpmefindaname / transformer-smaller-training-vocab
View on GitHub
Temporary remove unused tokens during training to save ram and speed.
☆23Jun 15, 2025Updated last year
argilla-io / argilla
View on GitHub
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
☆5,048Updated this week
adbar / trafilatura
View on GitHub
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…
☆6,334Updated this week
biaslyze-dev / biaslyze
View on GitHub
The NLP Bias Identification Toolkit
☆39Sep 8, 2023Updated 2 years ago
KRLabsOrg / rulechef
View on GitHub
Learn rule-based models from examples using LLM-powered synthesis. Replace expensive LLM calls with fast, deterministic, inspectable rege…
☆31Jul 10, 2026Updated 2 weeks ago
borisdayma / sora-mini
View on GitHub
☆18Feb 16, 2024Updated 2 years ago
HLasse / TextDescriptives
View on GitHub
A Python library for calculating a large variety of metrics from text
☆366May 5, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
IBM / zshot
View on GitHub
Zero and Few shot named entity & relationships recognition
☆400Sep 17, 2025Updated 10 months ago
nateraw / spaces-docker-templates
View on GitHub
🚀🤗 A collection of templates for Hugging Face Spaces
☆35Oct 9, 2023Updated 2 years ago
lukasgarbas / can-we-tune-together
View on GitHub
Combining encoder-based language models
☆11Nov 11, 2021Updated 4 years ago
HallerPatrick / pecc
View on GitHub
[LREC-Coling 2024] PECC: Problem Extraction and Coding Challenges
☆14May 30, 2024Updated 2 years ago
thiippal / MoodCat
View on GitHub
MoodCat😼 classifies the mood of English sentences.
☆14Jun 19, 2022Updated 4 years ago
urchade / GLiNER
View on GitHub
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts)
☆3,427Updated this week
opinionscience / BERTransfer
View on GitHub
A BERT-based application for reusable text classification at scale
☆37Jul 23, 2023Updated 3 years ago
webis-de / small-text
View on GitHub
Active Learning for Text Classification in Python
☆646May 24, 2026Updated 2 months ago
Hellisotherpeople / DebateSum
View on GitHub
Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"
☆55Dec 2, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
stephantul / skeletoken
View on GitHub
Datamodels for hugging face tokenizers
☆109Jun 18, 2026Updated last month
maxdotio / mighty-batch
View on GitHub
Highly concurrent and fast content processing for Mighty Inference Server
☆10Feb 6, 2023Updated 3 years ago
mixedbread-ai / batched
View on GitHub
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆161Jul 14, 2025Updated last year
kensho-technologies / pathpiece
View on GitHub
PathPiece tokenizer
☆14Nov 10, 2024Updated last year
CODAIT / Identifying-Incorrect-Labels-In-CoNLL-2003
View on GitHub
Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.
☆12May 11, 2021Updated 5 years ago
huggingface / text-clustering
View on GitHub
Easily embed, cluster and semantically label text datasets
☆610Mar 28, 2024Updated 2 years ago
explosion / spacy-vectors-builder
View on GitHub
🌸 Train floret vectors
☆18May 4, 2023Updated 3 years ago