seljaseppala / eu_corpus_compiler
EU Regulation Corpus Compiler: A pipeline of Python programs to download EU regulatory documents from the Eur-Lex portal via the CELLAR endpoint using the response of a SPARQL query sent to the EU Sparql endpoint.
☆15Updated 3 years ago
Alternatives and similar repositories for eu_corpus_compiler:
Users that are interested in eu_corpus_compiler are comparing it to the libraries listed below
- Mining Legal Arguments in Court Decisions - Data and software☆65Updated last year
- Collection of Datasets for Legal Text Processing☆83Updated last year
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆63Updated 5 months ago
- spaCy extension for Visual Studio Code☆27Updated last year
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆68Updated 7 months ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- Run OCR, extract information from documents and classify them. In addition, annotate documents and build custom NLP and computer vision m…☆61Updated this week
- A simple library for segmenting legal texts☆15Updated last year
- ☆22Updated 7 months ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆53Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆104Updated 8 months ago
- An EUR-Lex parser for Python.☆29Updated 6 months ago
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆26Updated 3 weeks ago
- Python package that adds IntelligentGraph capabilities to RDFLib RDF graph package☆55Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 10 months ago
- ☆18Updated 3 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Python text processing, pattern matching, and NLP framework☆63Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆46Updated 5 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Updated 5 months ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆86Updated 2 years ago
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-ser…☆42Updated 3 months ago
- Named entity recognition for the legal domain☆41Updated 3 years ago
- Pytorch implementation of a BiLSTM model for the Wikification project.☆18Updated 4 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆47Updated 6 months ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆22Updated 2 years ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆16Updated last year
- A list of selected resources, methods, and tools dedicated to legal data schemes and ontologies.☆96Updated 9 months ago