wandb / aihackercupLinks
A competition to get you started on the NeurIPS AI Hackercup
☆29Updated 11 months ago
Alternatives and similar repositories for aihackercup
Users that are interested in aihackercup are comparing it to the libraries listed below
Sorting:
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 months ago
- ☆80Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 7 months ago
- A curated list of materials on AI guardails☆40Updated 3 months ago
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆128Updated this week
- Simple UI for debugging correlations of text embeddings☆290Updated 3 months ago
- Automating enterprise workflows with multimodal agents☆110Updated 10 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated last month
- ☆19Updated last year
- ☆124Updated 10 months ago
- ☆67Updated 10 months ago
- Train your own SOTA deductive reasoning model☆105Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆88Updated this week
- ☆19Updated last year
- Simple examples using Argilla tools to build AI☆55Updated 9 months ago
- ☆31Updated 9 months ago
- Training-Ready RL Environments + Evals☆65Updated this week
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆102Updated last year
- Fine tune Gemma 3 on an object detection task☆79Updated last month
- An introduction to LLM Sampling☆79Updated 8 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 11 months ago
- ☆68Updated 3 months ago
- Just a bunch of benchmark logs for different LLMs☆120Updated last year
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆38Updated last year
- RAG example using DSPy, Gradio, FastAPI☆84Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆113Updated 5 months ago
- One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.☆22Updated 5 months ago
- ☆56Updated 2 months ago
- Material for the series of seminars on Large Language Models☆34Updated last year