A very simple news crawler with a funny name
☆443Mar 17, 2026Updated last week
Alternatives and similar repositories for fundus
Users that are interested in fundus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficiently find the best-suited language model (LM) for your NLP task☆135Jul 26, 2025Updated 7 months ago
- Evaluate language models using multiple choice items☆13Mar 6, 2026Updated 2 weeks ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110May 16, 2024Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- news-please - an integrated web crawler and information extractor for news that just works☆2,401Sep 21, 2025Updated 6 months ago
- SpanMarker for Named Entity Recognition☆465Jan 8, 2025Updated last year
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 3 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- ☆209Jun 26, 2025Updated 8 months ago
- Efficient few-shot learning with Sentence Transformers☆2,699Dec 11, 2025Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆25Jul 2, 2024Updated last year
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,905Updated this week
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,961Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 8 months ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆5,569Sep 12, 2025Updated 6 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Dec 22, 2023Updated 2 years ago
- The NLP Bias Identification Toolkit☆39Sep 8, 2023Updated 2 years ago
- ☆17Feb 16, 2024Updated 2 years ago
- Temporary remove unused tokens during training to save ram and speed.☆23Jun 15, 2025Updated 9 months ago
- A Python library for calculating a large variety of metrics from text☆361Updated this week
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- Easily embed, cluster and semantically label text datasets☆600Mar 28, 2024Updated last year
- MoodCat😼 classifies the mood of English sentences.☆14Jun 19, 2022Updated 3 years ago
- A BERT-based application for reusable text classification at scale☆38Jul 23, 2023Updated 2 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆55Dec 2, 2021Updated 4 years ago
- Simply, faster, sentence-transformers☆144Aug 27, 2024Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,131Mar 16, 2026Updated last week
- Train LLM on Hugging Face infra☆70Nov 13, 2025Updated 4 months ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆81Feb 10, 2026Updated last month
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 4 years ago
- ☆31Nov 14, 2024Updated last year
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated last year
- Fast Multimodal Semantic Deduplication & Filtering☆906Jan 20, 2026Updated 2 months ago