All in one PDF Parser Toolkit
☆17Sep 15, 2023Updated 2 years ago
Alternatives and similar repositories for pdf_parser
Users that are interested in pdf_parser are comparing it to the libraries listed below
Sorting:
- PDF parsing toolkit for preparing academic text corpus☆63Jul 12, 2024Updated last year
- GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.☆53Jul 11, 2024Updated last year
- Code and datasets for paper "GeoGalactica: A Scientific Large Language Model in Geoscience"☆40Jun 27, 2024Updated last year
- Geoscience Knowledge Manager☆22Jan 29, 2026Updated last month
- Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024☆209Jun 5, 2024Updated last year
- ☆12Aug 21, 2024Updated last year
- This is the repository for the Master of Science thesis titled "GAN-based Matrix Factorization for Recommender Systems".☆10Aug 10, 2020Updated 5 years ago
- Common conventions for building applications on the GeoDeepDive infrastructure☆16Jul 6, 2018Updated 7 years ago
- A Python module for mapping multiple high-dimensional datasets into a common low-dimensional space.☆10Mar 29, 2018Updated 7 years ago
- Knowledge-driven stochastic modeling of geological geometry features conditioned on drillholes, outcrops, and geophysics☆27Feb 19, 2025Updated last year
- Fast and memory-efficient exact attention ported to rocm☆13Dec 1, 2023Updated 2 years ago
- Code for EMNLP'20 paper "When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models"☆11Nov 10, 2020Updated 5 years ago
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆40Oct 15, 2024Updated last year
- 本库为《数据结构与算法》(林劼、刘震、陈端宾、 戴波主编)中的算法全实现,在学习过程中发现无论是书还是ppt中都有不少错误或者不清晰之处,故自行全实现☆12Dec 26, 2019Updated 6 years ago
- 基于simcse的中文句向量生成☆16Jun 8, 2022Updated 3 years ago
- Codes of Modeling Two-Way Selection Preference for Person-Job Fit☆16Dec 25, 2022Updated 3 years ago
- The Science knowledge graph ontologies, a.k.a. SKGO, is a suite of OWL ontology models to capture the knowledge of scientific research da…☆16Jul 3, 2025Updated 8 months ago
- 一个用于训练句子embedding的工具,支持Cosent以及Simcse、infonce☆21Jun 17, 2025Updated 9 months ago
- 不用tensorflow estimator,分别采用字mask和wwm mask在中文领域内finetune bert模型☆24Apr 15, 2020Updated 5 years ago
- Knowledge extraction from semi-structured web.☆13Mar 25, 2024Updated last year
- unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction☆13Apr 20, 2023Updated 2 years ago
- Citation Extraction and Classifier☆16Updated this week
- Learning Ontologies Via Embeddings☆12Jul 6, 2023Updated 2 years ago
- A curated, non-exhaustive list of papers in handwritten signature verification using ML/DL techniques.☆24Oct 9, 2022Updated 3 years ago
- ☆16Dec 7, 2021Updated 4 years ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 3 years ago
- ☆10Mar 18, 2023Updated 3 years ago
- Document Classification on COVID-19 Literature using the LitCovid collection and the Hedwig library.☆16Oct 26, 2024Updated last year
- ☆11Oct 6, 2021Updated 4 years ago
- mSimCSE: Multilingual SimCSE☆33Nov 14, 2022Updated 3 years ago
- Advanced Semantics for Commonsense Knowledge Extraction (WWW 2021)☆25Jan 3, 2023Updated 3 years ago
- CO-SKEL dataset, Co-skeletonization and Skeletonization codes for "Object Co-skeletonization with Co-segmentation" paper published in CVP…☆13Jun 16, 2018Updated 7 years ago
- BERT-based nominal Semantic Role Labeling (SRL), both using the Nombank dataset and the Ontonotes dataset.☆20Dec 28, 2022Updated 3 years ago
- This program categorizes a given query's "search intent" via the kinds of SERP features present for the query.☆23Apr 20, 2019Updated 6 years ago
- PyTorch reimplementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆16Jul 23, 2021Updated 4 years ago
- ☆10Apr 3, 2024Updated last year
- ☆22May 1, 2023Updated 2 years ago
- StyleFlow: Attribute-conditioned Exploration of StyleGAN-generated Images using Conditional Continuous Normalizing Flows (ACM TOG 2021)☆16Apr 1, 2021Updated 4 years ago
- 第二届阿里巴巴大数据智能云上编程大赛冠军解决方案☆31Oct 11, 2019Updated 6 years ago