NVIDIA / NeMo-InspectorLinks
A tool for an analysis of LLM generations.
☆40Updated 3 weeks ago
Alternatives and similar repositories for NeMo-Inspector
Users that are interested in NeMo-Inspector are comparing it to the libraries listed below
Sorting:
- ☆31Updated last year
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆24Updated last week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated 3 weeks ago
- ☆55Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 6 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆100Updated last week
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆109Updated 4 months ago
- ☆55Updated 8 months ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆59Updated last year
- ☆49Updated 9 months ago
- Lego for GRPO☆30Updated 5 months ago
- Supercharge huggingface transformers with model parallelism.☆77Updated 3 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 11 months ago
- ☆48Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆97Updated 5 months ago
- Aioli: A unified optimization framework for language model data mixing☆28Updated 9 months ago
- ☆80Updated this week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 6 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 10 months ago
- ☆36Updated 3 months ago
- ☆119Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆63Updated 11 months ago
- Official Repository for Task-Circuit Quantization☆24Updated 5 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆59Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆35Updated last year
- ☆62Updated 3 months ago