bytedance / BytevalKit-LLMLinks
☆24Updated 2 weeks ago
Alternatives and similar repositories for BytevalKit-LLM
Users that are interested in BytevalKit-LLM are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆38Updated 2 months ago
- FuseAI Project☆87Updated 5 months ago
- [COLM'24] Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration☆29Updated 9 months ago
- Large Language Models Meet NL2Code: A Survey☆35Updated 8 months ago
- ☆68Updated 7 months ago
- Currently collecting some awesome Manus replays. Feel free to share your use cases.☆15Updated 4 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆155Updated last month
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆185Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆244Updated 8 months ago
- Generative Judge for Evaluating Alignment☆244Updated last year
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆28Updated last month
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆19Updated 8 months ago
- ☆88Updated 8 months ago
- Reproducible Language Agent Research☆29Updated 3 weeks ago
- ☆310Updated last year
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆34Updated 3 months ago
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆62Updated 11 months ago
- WritingBench: A Comprehensive Benchmark for Generative Writing☆94Updated last month
- Contrastive Chain-of-Thought Prompting☆64Updated last year
- ☆71Updated 4 months ago
- ☆27Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆113Updated 10 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆58Updated 4 months ago
- Designing Multi-Agent Systems with Zero Supervision☆87Updated last week
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Updated last year
- ☆102Updated 7 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆229Updated 6 months ago
- ☆221Updated 2 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year