rajshah4 / LLM-EvaluationView external linksLinks
Sample notebooks and prompts for LLM evaluation
☆160Nov 2, 2025Updated 3 months ago
Alternatives and similar repositories for LLM-Evaluation
Users that are interested in LLM-Evaluation are comparing it to the libraries listed below
Sorting:
- Vectorized implementation of a general feedforward neural network in Python☆10Jan 22, 2017Updated 9 years ago
- Learn how to use Transformer-based models for named-entity recognition (NER) tasks and how to analyze various model features, constraints…☆15Jun 29, 2022Updated 3 years ago
- Routing with reinforcement learning☆10Apr 9, 2022Updated 3 years ago
- ☆11May 14, 2017Updated 8 years ago
- ☆27Feb 9, 2026Updated last week
- Study the temporal performance degradation of machine learning models.☆16Jan 26, 2024Updated 2 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Feb 21, 2019Updated 6 years ago
- ☆19Jun 26, 2024Updated last year
- Exploring the classical regression capabilities of LLMs.☆18May 20, 2024Updated last year
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Feb 15, 2024Updated 2 years ago
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆20Feb 11, 2024Updated 2 years ago
- This repository stems from our paper, “Cataloguing LLM Evaluations”, and serves as a living, collaborative catalogue of LLM evaluation fr…☆19Nov 16, 2023Updated 2 years ago
- Code for Zero-shot Triplet Extraction by Template Infilling (Kim et al; IJCNLP-AACL 2023)☆21Feb 17, 2024Updated last year
- Real-time data pipeline for AI apps in Azure☆26Dec 5, 2023Updated 2 years ago
- Build Neo4J Knowledge Graphs from Excel files☆22Nov 18, 2024Updated last year
- Evaluating LLMs with fewer examples☆169Apr 12, 2024Updated last year
- An end-to-end benchmark suite of multi-modal DNN applications for system-architecture co-design☆22Dec 13, 2024Updated last year
- Fiddler Auditor is a tool to evaluate language models.☆189Mar 11, 2024Updated last year
- List of Computer Science courses with video lectures.☆27Feb 17, 2022Updated 3 years ago
- Codes, scripts, and notebooks on various aspects of transformer models.☆27Feb 27, 2023Updated 2 years ago
- A skill-sharing platform where users can offer and request skills, match with others, and build a learning community.☆28Jul 18, 2025Updated 6 months ago
- 2024 LlamaIndex RAG Hackathon "1st Place Award" Project☆70Feb 16, 2024Updated 2 years ago
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆254Apr 11, 2025Updated 10 months ago
- Solving Inverse Physics Problems with Score Matching☆32Dec 4, 2023Updated 2 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Aug 12, 2024Updated last year
- How to train an instance segmentation model with mmdetection☆29Oct 13, 2019Updated 6 years ago
- A tool for evaluating LLMs☆428May 10, 2024Updated last year
- 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning☆408Jan 17, 2024Updated 2 years ago
- Sample applications built on the Graphlit Platform☆76Oct 11, 2025Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- Streamlit web-app based Bone Fracture detection using YoloV8, FasterRCNN with ResNet, and VGG16 with SSD☆12Nov 26, 2024Updated last year
- In this GitHub repository, we will demonstrate how to utilize MongoDB to build an automated underwriting process to calculate a customize…☆11Updated this week
- Pipeline for exploring protein families using homology based or vector representation based methods to generate clusters in sequence spac…☆18Oct 8, 2025Updated 4 months ago
- Demo code for the Custom Copilot Demo☆11Dec 3, 2024Updated last year
- This projects aims to show how whisper model can be fine-tuned on language it was not trained but is trained on similar language to it.☆11May 10, 2024Updated last year
- Here, I provided the solution for exercises of IBM Quantum Challenge 2020☆10Oct 27, 2020Updated 5 years ago
- Match celebrity users with their respective tweets by making use of Semantic Textual Similarity on over 900+ celebrity users' 2.5 million…☆13Nov 21, 2023Updated 2 years ago
- Typewriter component for Svelte that actually "types" one character at a time☆16Jan 23, 2026Updated 3 weeks ago