NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆48Mar 29, 2024Updated 2 years ago
Alternatives and similar repositories for cappy
Users that are interested in cappy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆95Apr 21, 2026Updated 2 weeks ago
- ☆18Feb 20, 2024Updated 2 years ago
- A guide to structured generation using constrained decoding☆18Jun 9, 2024Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆15Apr 25, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated 2 years ago
- ☆27Jun 6, 2024Updated last year
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆73Nov 4, 2025Updated 6 months ago
- Bypass browser bot detection in langchain tools☆18Apr 22, 2026Updated 2 weeks ago
- An example SvelteKit project using sveltekit-modal, with a Python server endpoint written with stable diffusion.☆19Apr 17, 2023Updated 3 years ago
- ☆18Aug 19, 2024Updated last year
- Perplexity Lite using Langgraph, Tavily, and GPT-4.☆14Jan 11, 2024Updated 2 years ago
- A framework for event based autonomous multi-agent systems.☆306Sep 18, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 11 months ago
- Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Languag…☆23Apr 8, 2024Updated 2 years ago
- Train huggingface models on top of Prodigy annotations☆21Feb 19, 2024Updated 2 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆69Dec 9, 2024Updated last year
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆264Mar 19, 2024Updated 2 years ago
- Extract valuable information from your project github Stars & Forks such as email, company, twitter and then explore it with streamlit🌟☆21Feb 8, 2024Updated 2 years ago
- Self-Alignment with Principle-Following Reward Models☆170Sep 18, 2025Updated 7 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code Repository for the EMCL-PKDD 2021 "Multitask Recalibrated Aggregation Network for Medical Code Prediction)☆13Sep 7, 2021Updated 4 years ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Feb 9, 2024Updated 2 years ago
- The implementation for MLSys 2023 paper: "Cuttlefish: Low-rank Model Training without All The Tuning"☆44May 10, 2023Updated 2 years ago
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated 2 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 5 years ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Apr 20, 2024Updated 2 years ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Apr 13, 2026Updated 3 weeks ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆92Feb 27, 2024Updated 2 years ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆53Feb 27, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- We conduct a preregistered experiment to investigate whether fact checks provided by a large language model can serve as an effective mis…☆13Dec 14, 2024Updated last year
- The source code and dataset for our paper "Integrating Relation Constraints with Neural Relation Extractors" which is publicated at AAAI …☆15Mar 25, 2020Updated 6 years ago
- NFT Badge for staking on Polygon. This smart contract give a staker a NFT that represents staking period (= vesting period ) which a stak…☆11May 31, 2021Updated 4 years ago
- NavCog is an example app of blelocpp library aimed specifically for the blind to help those people “explore” the world without vision. No…☆10Jan 18, 2017Updated 9 years ago
- Repository to create traveling waves integrate special information through time☆56Aug 8, 2025Updated 9 months ago
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆25Sep 11, 2024Updated last year
- ☆21May 11, 2025Updated 11 months ago