NanshineLoong / Self-Evolving-BenchmarkView external linksLinks
A framework for evolving and testing question-answering datasets with various models.
☆21Feb 28, 2024Updated last year
Alternatives and similar repositories for Self-Evolving-Benchmark
Users that are interested in Self-Evolving-Benchmark are comparing it to the libraries listed below
Sorting:
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 4 months ago
- Data Structures with Python(AIX20001) 강의 자료실☆18Jun 14, 2024Updated last year
- This Python project integrates MetaTrader5 with GPT-4 to generate automated trading signals. It analyzes OHLC and tick data to provide re…☆12Aug 25, 2024Updated last year
- A simple Sentiment Analysis API in FastAPI.☆15Dec 17, 2024Updated last year
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- A Terminal User Interface (TUI) application that enables interactive conversations with your documents using Large Language Models (LLM) …☆13Dec 11, 2024Updated last year
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆69Aug 5, 2025Updated 6 months ago
- [ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.☆38Feb 25, 2025Updated 11 months ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Oct 30, 2020Updated 5 years ago
- The best way to practice interview questions☆14Apr 25, 2023Updated 2 years ago
- Interact with ChatGPT and GPT-4 in alternative ways☆13Mar 17, 2024Updated last year
- FGLA: Fast Generation-Based Gradient Leakage Attacks against Highly Compressed Gradients☆14Dec 20, 2022Updated 3 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 3 years ago
- A Python-based text editor server built with FastMCP that provides tools for file operations. This server enables reading, editing, and m…☆13Aug 21, 2025Updated 5 months ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆13Jan 1, 2025Updated last year
- Amplify your coding capabilities with AI - your smart co-pilot for an elevated coding experience.☆14Feb 9, 2026Updated last week
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆14Apr 14, 2025Updated 10 months ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- questions on leetcode☆10Mar 11, 2024Updated last year
- USDX indicator calculates and displays the US dollar index in the separate window of any other chart.☆11Aug 8, 2025Updated 6 months ago
- voice ai assistant for linux☆12Aug 29, 2024Updated last year
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated 9 months ago
- Object-Oriented distance-independent Individual Tree Simulator (TreeSim)☆15Dec 7, 2023Updated 2 years ago
- Welcome to Meet GPT! This allows business users to experience Azure Open AI Models within their own environment using a PowerApps client.☆10May 11, 2023Updated 2 years ago
- Software for the Autonomous Agents Terrabot Project☆11Oct 15, 2025Updated 4 months ago
- ⚡ FutureGPT - Application development framework that connects GPT-4 with external data, the internet, other applications and language mod…☆12May 14, 2023Updated 2 years ago
- Development repository for the Digital Terraria Lab implementation of the Sugarscape agent-based societal simulation.☆15Updated this week
- GPT AI Assistant documentation☆19Apr 18, 2025Updated 9 months ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- LCA as Code - Domain-Specific Language for Life-Cycle Analysis☆14Oct 1, 2025Updated 4 months ago
- ROS (Python) package for controlling Yale Grablab Openhands☆12Feb 10, 2022Updated 4 years ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- MetaTrader 5 indicator that measures the largest distance between a price (high or low) and a moving average.☆11Oct 9, 2020Updated 5 years ago
- MCP server or onshape CAD☆11Apr 21, 2025Updated 9 months ago
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Sep 1, 2023Updated 2 years ago
- Python library for Evaluation☆16Updated this week
- ☆12Dec 8, 2024Updated last year
- Practical, hands-on risk modeling, risk assessment and verifications of risk models across major risk classes and understanding risk regu…☆16May 19, 2022Updated 3 years ago