ninehills / langevalView external linksLinks
Evaluation for AI apps and agent
☆44Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for langeval
Users that are interested in langeval are comparing it to the libraries listed below
Sorting:
- Stream UTF-8 bytes and read grapheme clusters safely☆34Jul 22, 2025Updated 6 months ago
- ☆13Mar 16, 2025Updated 11 months ago
- Tweets about Claude Code on Twitter / X☆24Sep 24, 2025Updated 4 months ago
- 🤗 HF Downloader (Hugging Face Downloader) 📦 A user-friendly GUI tool for downloading Hugging Face resources with enhanced connectivity…☆13Jan 5, 2025Updated last year
- This repository contains the code for implementation of RAG approach with company policies data, evaluation of RAG solution and smart chu…☆15Sep 18, 2025Updated 4 months ago
- 儿童故事常识推理与寓意理解评测(Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories,CRMU)☆18Oct 22, 2024Updated last year
- KDD2024-WhoIsWho-Top3☆16Jun 17, 2024Updated last year
- Fine-tuning embedding models.☆14Nov 25, 2024Updated last year
- Update a binary to its latest version by using the original package manager that was used to install it☆22Aug 31, 2025Updated 5 months ago
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆24Mar 5, 2024Updated last year
- 为 AstrBot 提供一种 Deepresearch 方案☆24Aug 5, 2025Updated 6 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架 。☆244Feb 6, 2026Updated last week
- ☆41Apr 11, 2025Updated 10 months ago
- Word acquisition in neural language models (TACL 2022).☆20Jan 30, 2025Updated last year
- ☆17Dec 12, 2024Updated last year
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- ☆11Updated this week
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 2 years ago
- Some resources about Ray Forward Meetup☆30Dec 25, 2025Updated last month
- QGEval: A Benchmark for Question Generation Evaluation☆19Nov 7, 2024Updated last year
- Empower ChatGPT with the ability to perform mathematical calculations, web scraping, PDF analysis, and more.☆21Apr 1, 2023Updated 2 years ago
- Fast, High-Fidelity LLM Decoding with Regex Constraints☆21Jul 26, 2024Updated last year
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆31Jun 16, 2025Updated 8 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Sep 17, 2024Updated last year
- Countdown Game Distill&RL☆47Sep 5, 2025Updated 5 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆25Sep 29, 2024Updated last year
- ☆19Jun 25, 2024Updated last year
- A Next.js version of Claude Aritfacts , inspired by llamacoder☆27Sep 26, 2024Updated last year
- Research papers about Chain of Thought (CoT)☆59Oct 25, 2023Updated 2 years ago
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆416Jun 1, 2023Updated 2 years ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Apr 4, 2025Updated 10 months ago
- ☆13Jun 13, 2025Updated 8 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆120Jan 29, 2025Updated last year
- CCL2020,“小牛杯”幽默计算任务数据发布☆23Aug 27, 2024Updated last year
- Chinese Machine Reading 2021海华AI挑战赛·中文阅读理解·技术组·第三名☆20May 27, 2021Updated 4 years ago
- ☆27Jul 18, 2023Updated 2 years ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆256Oct 30, 2024Updated last year
- ☆120Jun 30, 2024Updated last year
- Fine-Tuning LLM and embedding models☆27Sep 12, 2023Updated 2 years ago