Code for "Small Models are Valuable Plug-ins for Large Language Models"
β132May 16, 2023Updated 3 years ago
Alternatives and similar repositories for SuperICL
Users that are interested in SuperICL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π©Ί A collection of ChatGPT evaluation reports on various bechmarks.β50Mar 28, 2023Updated 3 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewritingβ17Nov 30, 2021Updated 4 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- β16Jul 20, 2023Updated 2 years ago
- Self-Alignment with Principle-Following Reward Modelsβ170Sep 18, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Momentum Decoding: Open-ended Text Generation as Graph Explorationβ19Jan 27, 2023Updated 3 years ago
- A local search system implementation using Elasticsearch for Wikipedia data indexing and retrieval.β14May 17, 2025Updated last year
- Use the tokenizer in parallel to achieve superior accelerationβ20Mar 21, 2024Updated 2 years ago
- Papers and Related work to help learn ICL conveniently for everyone who interests.β14Feb 28, 2024Updated 2 years ago
- β284Jan 6, 2025Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)β15Jul 9, 2023Updated 2 years ago
- β18Mar 10, 2023Updated 3 years ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractorsβ40Dec 14, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β13Mar 5, 2024Updated 2 years ago
- Baseline code for NAACL 2021 paper "Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge"β21Jul 6, 2021Updated 4 years ago
- πΌ Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Expertsβ41Sep 29, 2024Updated last year
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span β¦β14Aug 25, 2023Updated 2 years ago
- The code of CIKM 2023 (Oral Presentation) : A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NEβ¦β14Jul 19, 2024Updated last year
- β74Apr 2, 2024Updated 2 years ago
- Reading comprehension based question-answering model for news articles.β11Jun 22, 2022Updated 3 years ago
- β102Dec 22, 2023Updated 2 years ago
- self-adaptive in-context learningβ45May 5, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Expertsβ16Feb 26, 2024Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"β19May 10, 2023Updated 3 years ago
- ACL 2022(findings): A Sentence is Worth 128 Pseudo Tokens: A Semantic-Aware Contrastive Learning Framework for Sentence Embeddingsβ18Mar 23, 2022Updated 4 years ago
- β30Apr 6, 2026Updated last month
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Modelsβ67Dec 10, 2024Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLMβ54Sep 3, 2024Updated last year
- https://arxiv.org/abs/2209.15162β53Jan 24, 2023Updated 3 years ago
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Modelsβ46Mar 13, 2022Updated 4 years ago
- Code for ACL2023 paper: Pre-Training to Learn in Contextβ106Jul 26, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Crawl & visualize ICLR papers and reviews.β18Nov 5, 2022Updated 3 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.β88Mar 7, 2025Updated last year
- β13Feb 7, 2023Updated 3 years ago
- Paper List for In-context Learning π·β876Oct 8, 2024Updated last year
- β14Feb 26, 2024Updated 2 years ago
- Align, a general text alignment functionβ15Dec 7, 2023Updated 2 years ago
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.β21Jun 17, 2024Updated last year