Webhose / free-news-datasetsLinks
Weekly free datasets from global news sites
β24Updated last week
Alternatives and similar repositories for free-news-datasets
Users that are interested in free-news-datasets are comparing it to the libraries listed below
Sorting:
- Chrome Extension for exploring Hugging Face datasets πβ49Updated 11 months ago
- Common crawl extractorβ78Updated last year
- Automated Document Intelligence Workflowβ26Updated 8 months ago
- VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for corβ¦β76Updated 6 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Largβ¦β24Updated 6 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β26Updated 9 months ago
- LLM plugin for clustering embeddingsβ82Updated last year
- Tools to construct and process Common Crawl webgraphsβ93Updated last week
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graphβ25Updated last year
- π©π€π€ A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)β24Updated 2 years ago
- Automated Qualitative Analysis of LLMs (ICLR 2025)β43Updated 2 months ago
- LLM-powered autonomous agent with hierarchical task managementβ50Updated 2 years ago
- Example implementation of Iteration of Tought - Gives a star if you like the projectβ43Updated 8 months ago
- GPT4 based personalized ArXiv paper assistant botβ10Updated last year
- Professional Wargaming LLM Toolboxβ14Updated last month
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.β54Updated 5 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β75Updated 10 months ago
- Visualize any repo or codebase into diagram or animationβ20Updated 10 months ago
- Python package that adds IntelligentGraph capabilities to RDFLib RDF graph packageβ55Updated last year
- Quick Notebook Tutorialsβ36Updated last month
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar realβ¦β29Updated 4 months ago
- LLM plugin for models hosted by Anyscale Endpointsβ35Updated last year
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.β26Updated last year
- The Official NewsCatcher News API V2 SDK for Pythonβ20Updated 11 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.β42Updated last year
- A cog model for the all-mpnet-base-v2 sentence-transformers embedding model.β15Updated last year
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPIβ115Updated last year
- AI_Powered_Dev_Search_Engineβ12Updated last year
- Pivotal Token Searchβ123Updated last month
- Prototyping a question and answer bot over PDFsβ39Updated last year