Webhose / free-news-datasetsLinks
Weekly free datasets from global news sites
☆23Updated this week
Alternatives and similar repositories for free-news-datasets
Users that are interested in free-news-datasets are comparing it to the libraries listed below
Sorting:
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 7 months ago
- A personal knowledge base that I can dump information to and help me learn☆24Updated last week
- spaCy entry points for Curated Transformers☆31Updated last week
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 6 months ago
- A News Article Collection Library☆22Updated 2 years ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated 2 years ago
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.☆37Updated 5 months ago
- Agent-based implementation of RAG, incorporating AI agents into the RAG pipeline to orchestrate its components and perform additional act…☆12Updated 3 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆22Updated 3 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated last year
- Easiest way to build custom agents, in a no-code notion style editor, using simple macros.☆27Updated 6 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated last week
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 9 months ago
- AI_Powered_Dev_Search_Engine☆12Updated last year
- MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. …☆13Updated 10 months ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆24Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆73Updated 10 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- ☆21Updated 3 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 8 months ago
- A BERT-based application for reusable text classification at scale☆38Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆30Updated this week
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injection…☆12Updated 3 months ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- ☆67Updated last year
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆52Updated 2 months ago
- Python library to use Pleias-RAG models☆53Updated last month
- GLiNER model in a FastAPI microservice.☆44Updated 5 months ago