locuslab / chatllm-vscode
☆62Updated last year
Alternatives and similar repositories for chatllm-vscode:
Users that are interested in chatllm-vscode are comparing it to the libraries listed below
- Score LLM pretraining data with classifiers☆54Updated last year
- Lightweight tools for quick and easy LLM demo's☆26Updated 6 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- EvaByte: Efficient Byte-level Language Models at Scale☆85Updated last week
- ☆22Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆52Updated 3 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 2 months ago
- Functional Benchmarks and the Reasoning Gap☆84Updated 6 months ago
- Very minimal (and stateless) agent framework☆41Updated 2 months ago
- Prototype advanced LLM algorithms for reasoning and planning.☆96Updated 8 months ago
- ☆60Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆88Updated 8 months ago
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated last month
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- ☆46Updated last month
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated this week
- LILO: Library Induction with Language Observations☆85Updated 7 months ago
- ☆67Updated 7 months ago
- Simple GRPO scripts and configurations.☆59Updated last month
- ☆48Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 3 months ago
- ☆81Updated last year
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 6 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆56Updated 2 weeks ago
- Track the progress of LLM context utilisation☆54Updated 8 months ago
- Code repository for the c-BTM paper☆106Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆168Updated 2 months ago