AutoLLM / ArxivDigestLinks
ArXiv Digest and Personalized Recommendations using Large Language Models
☆365Updated last year
Alternatives and similar repositories for ArxivDigest
Users that are interested in ArxivDigest are comparing it to the libraries listed below
Sorting:
- GPT4 based personalized ArXiv paper assistant bot☆523Updated last year
- ☆266Updated 4 months ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆221Updated last year
- A puzzle to learn about prompting☆127Updated 2 years ago
- ☆291Updated 11 months ago
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆550Updated 5 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆302Updated last year
- Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.☆316Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆254Updated last year
- ☆412Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆187Updated last year
- Scaling Data-Constrained Language Models☆334Updated 8 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆697Updated last year
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆268Updated 11 months ago
- [ICML 2024] CLLMs: Consistency Large Language Models☆391Updated 6 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆464Updated last year
- ☆229Updated 9 months ago
- A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).☆147Updated 5 months ago
- ☆133Updated last year
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆193Updated 6 months ago
- [ICLR 2025 Spotlight] An open-sourced LLM judge for evaluating LLM-generated answers.☆366Updated 3 months ago
- Simple next-token-prediction for RLHF☆227Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆212Updated last year
- LOFT: A 1 Million+ Token Long-Context Benchmark☆198Updated last month
- This repository implements the chain of verification paper by Meta AI☆169Updated last year
- ☆654Updated 7 months ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆219Updated last year
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆116Updated 10 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆309Updated 7 months ago
- Annotated version of the Mamba paper☆482Updated last year