Index of URLs to pdf files all over the internet and scripts
☆25May 2, 2023Updated 2 years ago
Alternatives and similar repositories for CCpdf
Users that are interested in CCpdf are comparing it to the libraries listed below
Sorting:
- Training data for the NLPContributionGraph Shared Task 11 at SemEval-2021☆14Jan 11, 2021Updated 5 years ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆30Jul 16, 2023Updated 2 years ago
- Tool for sentiment analysis annotation☆13Mar 26, 2025Updated 11 months ago
- (Competition) 6th -- Scene-Text-Detection-and-Recognition.☆10Jun 14, 2022Updated 3 years ago
- The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.☆39Dec 7, 2023Updated 2 years ago
- Curated list of awesome datasets for various table understanding tasks☆18Sep 5, 2025Updated 6 months ago
- Analyse des Pegida facebook Korpus☆10Jan 31, 2015Updated 11 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- Tensorflow implementation of the paper "Fast Compressive Sensing Using Generative Model with Structed Latent Variables"☆10Apr 7, 2020Updated 5 years ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago
- multimodal document analysis☆166Updated this week
- DSIR large-scale data selection framework for language model training☆270Apr 7, 2024Updated last year
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Jul 17, 2024Updated last year
- ☆10Oct 20, 2021Updated 4 years ago
- ☆10Feb 12, 2024Updated 2 years ago
- Skillset Challenge for the Apprenticeship Program, June 2021.☆10Jan 8, 2022Updated 4 years ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Sep 15, 2023Updated 2 years ago
- ☆14Aug 31, 2023Updated 2 years ago
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- A cross-platform, OpenGL terminal emulator.☆11Mar 14, 2024Updated last year
- Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020☆13Jul 25, 2024Updated last year
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Oct 20, 2022Updated 3 years ago
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆15Feb 4, 2021Updated 5 years ago
- ☆10Mar 20, 2021Updated 4 years ago
- We enable LLM with personalization capability☆11Nov 16, 2023Updated 2 years ago
- Convert obsidian md file into confluence pages☆14Jan 10, 2025Updated last year
- Code for the UCL Statistical NLP course☆11Jan 19, 2015Updated 11 years ago
- Coalesce 2022 Python models demo with Databricks. Not actively maintained.☆13Dec 4, 2024Updated last year
- ☆15Nov 29, 2020Updated 5 years ago
- JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…☆13Nov 5, 2024Updated last year
- ☆14Jun 28, 2023Updated 2 years ago
- Dockerfile to build the excellent OpenPose software from CMU.☆11Apr 28, 2022Updated 3 years ago
- Defeasible Natural Language Inference☆13Dec 4, 2020Updated 5 years ago
- A template primarily for PhD theses but also suitable for Bachelor's or Master's theses☆11Nov 10, 2021Updated 4 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆14Mar 2, 2024Updated 2 years ago
- ☆11Jun 29, 2021Updated 4 years ago