jbencina / dojreleases
Python scraper of DOJ press releases
☆10Updated 6 years ago
Alternatives and similar repositories for dojreleases:
Users that are interested in dojreleases are comparing it to the libraries listed below
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- Data and code related to the report "Truth, Lies, and Automation: How Language Models Could Change Disinformation"☆27Updated 3 years ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆29Updated 4 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Using Machine Learning to Create Funny Memes☆25Updated 2 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- ☆10Updated 5 years ago
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.☆18Updated 4 years ago
- A collection of code, data and information related to our audit of TikTok.☆21Updated last month
- View count predictor using titles, descriptions, and thumbnails.☆34Updated 7 years ago
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- GPT-2 finetuned on dril twetes☆15Updated 5 years ago
- ☆11Updated 3 years ago
- ☆18Updated last year
- Topic modelling with SpaCy, Gensim and Textacy☆19Updated 7 years ago
- Data and Code for "The Values Encoded in Machine Learning Research"☆44Updated 2 years ago
- ☆22Updated 3 years ago
- Accompanying code for the paper: Totally Looks Like - How Humans Compare, Compared to Machines, by Amir Rosenfeld, Markus D. Solbach and …☆38Updated 6 years ago
- News API - fetch news from CommonCrawl, parse with NewsPlease, enrich with pre-trained machine-learning models, to structured searchable …☆28Updated 2 years ago
- ☆16Updated 7 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- In this notebook, I implemented a script to transcribe YouTube videos (and audio files in general) using Google's speech-to-text API.☆13Updated 2 years ago
- Word embeddings for job postings☆13Updated 2 years ago
- Extracting Entities with Limited Evidence☆16Updated 2 years ago
- NLP based Classification Model that predicts a person's personality type as one of the 16 Myers Briggs personality types. Extremely chall…☆32Updated 2 years ago
- Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team☆28Updated 4 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Text preprocessing tools in python.☆27Updated 7 years ago
- BirdSpotter is a python package which provides an influence and bot detection toolkit for twitter.☆19Updated 4 years ago