Index of URLs to pdf files all over the internet and scripts
☆24May 2, 2023Updated 3 years ago
Alternatives and similar repositories for CCpdf
Users that are interested in CCpdf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training data for the NLPContributionGraph Shared Task 11 at SemEval-2021☆14Jan 11, 2021Updated 5 years ago
- Web archiving utility library☆11May 5, 2026Updated last month
- Analyse des Pegida facebook Korpus☆10Jan 31, 2015Updated 11 years ago
- weixin125个人健康数据管理系统的设计与实现微信小程序+ssm后端毕业源码案例设计☆11Feb 28, 2024Updated 2 years ago
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Mar 2, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- We enable LLM with personalization capability☆11Nov 16, 2023Updated 2 years ago
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.☆17Jul 18, 2024Updated last year
- a SplineCamera react component☆14Feb 18, 2024Updated 2 years ago
- Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…☆12Oct 21, 2022Updated 3 years ago
- SVG Differentiable Rendering: Generating vector graphics using neural networks. Support: text-to-SVG, Image-to-SVG, SVG Editing.☆63Feb 25, 2025Updated last year
- multimodal document analysis☆166May 14, 2026Updated last month
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆12Nov 27, 2022Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 5 years ago
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Applying GANs in improving question generation and answering☆12Oct 1, 2017Updated 8 years ago
- Structured Multi-task Learning for Molecular Property Prediction, AISTATS'22 (https://proceedings.mlr.press/v151/liu22e.html)☆14Jul 6, 2022Updated 3 years ago
- Curated list of awesome datasets for various table understanding tasks☆19Sep 5, 2025Updated 9 months ago
- Record animations on HTML5 canvas☆14Apr 16, 2024Updated 2 years ago
- Personalized Response Generation via Generative Split Memory Network☆12Sep 6, 2021Updated 4 years ago
- pytorch crnn with centerloss to solve the near word problem☆16Jan 27, 2022Updated 4 years ago
- Collecting good beginner tasks and project ideas.☆16Apr 23, 2018Updated 8 years ago
- [Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"☆17Dec 1, 2023Updated 2 years ago
- Official Implementation of "C5T5: Controllable Generation of Organic Molecules with Transformers"☆23Dec 17, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 5 years ago
- A simple wrapper for lmdb. Support dict-like operations.☆23Apr 20, 2023Updated 3 years ago
- DSIR large-scale data selection framework for language model training☆274Apr 7, 2024Updated 2 years ago
- IPAdic packaged for easy use from Python.☆24Oct 31, 2021Updated 4 years ago
- Python toolbox to load, parse and process Official Journals of the European Union (EU).☆23May 3, 2024Updated 2 years ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆106Mar 31, 2025Updated last year
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- Scaffolding for multi-user Elm applications via Gulp, Express, and SockJS.☆11Apr 10, 2015Updated 11 years ago
- ☆10Apr 4, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆26Feb 16, 2021Updated 5 years ago
- ☆10Jan 20, 2024Updated 2 years ago
- Code for the paper ``Text2Math: End-to-end Parsing Text into Math Expressions" accepted by EMNLP 2019☆16Aug 20, 2019Updated 6 years ago
- Program Translator AI built on Pytorch☆15Dec 19, 2019Updated 6 years ago
- A Living Papers article starter template.☆24Nov 3, 2023Updated 2 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Sep 8, 2022Updated 3 years ago
- A template primarily for PhD theses but also suitable for Bachelor's or Master's theses☆11Nov 10, 2021Updated 4 years ago