commoncrawl / cc-citations
Scientific articles using or citing Common Crawl data
☆13Updated 3 weeks ago
Alternatives and similar repositories for cc-citations:
Users that are interested in cc-citations are comparing it to the libraries listed below
- ☆15Updated last year
- ☆30Updated 11 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆59Updated 9 months ago
- Factored Cognition Primer: How to write compositional language model programs☆48Updated last year
- Demo frontend for OpenAlex☆18Updated this week
- Various Jupyter notebooks about Common Crawl data☆50Updated this week
- Tools to construct and process webgraphs from Common Crawl data☆85Updated 3 weeks ago
- wrapper for the crossref events api☆18Updated last year
- Science GPT is an advanced AI-powered tool designed to generate text based on the content of uploaded scientific PDF files. Leveraging th…☆13Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆77Updated 3 weeks ago
- Solve Geometric & Graph Problems with Large Language Models☆28Updated last year
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 6 months ago
- An instruction tuned large language model with extra support for poetry and verse generation☆19Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆47Updated 6 months ago
- spaCy entry points for Curated Transformers☆26Updated 4 months ago
- A browser extension providing Open Access bibliographical services☆14Updated 2 years ago
- PDF parser powered by grobid☆25Updated 6 months ago
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆22Updated last year
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆29Updated last year
- ☆52Updated last year
- This Network-graph based literature review tool uses the open-source version of Neo4j with Jupyter Notebooks written in Python to import …☆10Updated last year
- 🦦 weasel: A small and easy workflow system☆75Updated 7 months ago
- The OpenCitations RDF Resource Browser☆11Updated last month
- A News Article Collection Library☆22Updated last year
- LLM plugin for embeddings using sentence-transformers☆48Updated last week
- A simple library for training named entity recognition model from partially annotated data☆23Updated last year
- The GitBook documentation site for OpenAlex☆18Updated last month
- A bibliographic reference correction service☆18Updated 2 years ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated last year