JustAnotherArchivist / little-thingsLinks
The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.arpa.li instead
☆25Updated 4 years ago
Alternatives and similar repositories for little-things
Users that are interested in little-things are comparing it to the libraries listed below
Sorting:
- Extract list of results from search engines pages as CSV with a bookmarklet directly within the browser☆24Updated 3 months ago
- H2O is a web app for creating and reading open educational resources, primarily in the legal field☆39Updated last week
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- Awesome list dedicated to digital and data preservation tools, sources, services and so on.☆26Updated 2 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆54Updated 3 weeks ago
- A financial disclosure data extraction tool.☆16Updated last year
- Some tools to help analyze the twitter archive☆62Updated last month
- A Memento Aggregator CLI and Server in Go☆65Updated 4 months ago
- Browser version of Hyphe (WIP)☆31Updated 2 months ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- 😎 A community-curated list of awesome lawtech software and learning resources for legal technology and design.☆26Updated 5 years ago
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆56Updated last year
- React components to render differences between captures at the Wayback Machine☆35Updated 2 months ago
- A helper library full of URL-related heuristics.☆70Updated last month
- Bot for operating snscrape in #archivebot on efnet☆11Updated 5 years ago
- Track changes to GraphQL APIs by git scraping their schemas☆29Updated 3 months ago
- A utility that searches for RSS feeds from a CSV list of URLs☆11Updated 4 years ago
- CommonCrawl keyword scanner. Time for month of CC data on EC2 c5.18xlarge instance for hundreds of keywords takes about 3 hours. LLM (BER…☆15Updated 2 years ago
- Digital Preservation of HTTP in documentary heritage.☆22Updated 2 years ago
- Phantombuster's SDK☆14Updated 9 months ago
- A list of things related to software, literature, and other content for 🕣 Memento☆99Updated last year
- A demonstration transnational register of beneficial ownership data from the UK, Denmark, Slovakia and Armenia☆17Updated 8 months ago
- Generate a list of your GitHub stars by topic - automatically!☆78Updated 2 years ago
- 📚 A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivity☆95Updated 6 years ago
- Curated list of my GitHub stars☆65Updated 5 years ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated 10 months ago
- Public API client for GETTR, a "non-bias [sic] social network," designed for data archival and analysis.☆93Updated last month
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- CLI implementation of httpreserve that can test links and retrieve internet archive replacements☆10Updated 7 months ago
- Everyting you need to know about Aquila Network Neural Search Ecosystem. Official repositories, client libraries, ecosystem projects, boi…☆32Updated 3 years ago