susumuota / arxiv-reddit-summary
Summarize the top 30 most popular arXiv papers on Reddit, Hacker News and Hugging Face in the last 30 days. And post them to Slack, Twitter and Bluesky.
☆19Updated 2 weeks ago
Alternatives and similar repositories for arxiv-reddit-summary:
Users that are interested in arxiv-reddit-summary are comparing it to the libraries listed below
- ☆11Updated 2 months ago
- Efficiently computing & storing token n-grams from large corpora☆23Updated 6 months ago
- Low-Rank Adaptation of Large Language Models clean implementation☆8Updated last year
- An awesome list of AnthropicAI' Claude model☆51Updated 2 years ago
- An open, comprehensive catalog of scholarship, connecting papers, authors, institutions, and journals.☆10Updated last year
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 7 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 3 months ago
- Plugin for LLM adding a Markov chain generating model☆19Updated 9 months ago
- The official Python library for Formulaic☆16Updated last year
- The first AI artist☆32Updated 2 years ago
- A dataset of alignment research and code to reproduce it☆77Updated last year
- Training hybrid models for dummies.☆20Updated 3 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 7 months ago
- ☆22Updated 11 months ago
- Python tools☆12Updated last year
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Updated 8 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆25Updated last month
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆19Updated 11 months ago
- ☆22Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆33Updated 2 years ago
- Turn any collection of files into a dataset☆45Updated 2 years ago
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆12Updated 2 years ago
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 6 months ago
- Run embedding models using ONNX☆32Updated last year
- Get deterministic output in any format like json from any LLM.☆18Updated 2 years ago
- LLMs sitting on a council together to decide, by consensus, who among them is the best.☆14Updated 2 weeks ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated 10 months ago