The official Python library for Openlayer, the Continuous Model Improvement Platform for AI. π
β16Updated this week
Alternatives and similar repositories for openlayer-python
Users that are interested in openlayer-python are comparing it to the libraries listed below
Sorting:
- β12Jan 11, 2026Updated last month
- A Swedish Natural Language Understanding Benchmarkβ11Dec 12, 2025Updated 2 months ago
- [CVPR2024] Learning from Synthetic Human Group Activitiesβ14Feb 24, 2025Updated last year
- A framework for few-shot evaluation of autoregressive language models.β12Jul 14, 2025Updated 7 months ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference β¦β14Dec 12, 2024Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"β11Oct 11, 2024Updated last year
- β14Dec 1, 2025Updated 2 months ago
- β10Dec 3, 2024Updated last year
- β11Jan 3, 2024Updated 2 years ago
- β11Oct 15, 2022Updated 3 years ago
- β27Updated this week
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"β11Nov 18, 2022Updated 3 years ago
- Align, a general text alignment functionβ15Dec 7, 2023Updated 2 years ago
- Code and Data for GlitchBenchβ13Feb 27, 2024Updated 2 years ago
- benchmarks for evaluating MT modelsβ11Jun 26, 2024Updated last year
- Website for release of TellMeWhy dataset for why question answeringβ14Nov 11, 2022Updated 3 years ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Modelsβ11Apr 9, 2024Updated last year
- A Shiny app to help people learn (and play with) Hamiltonian Monte Carlo samplingβ11Feb 8, 2025Updated last year
- MaterialX, the next generation of mkdocs-materialβ38Updated this week
- β12Nov 5, 2024Updated last year
- Library which aim to generate kubernetes yamls templates from an Airflow dag using the Airflow Kuberntes Pod Operatorβ10May 6, 2021Updated 4 years ago
- β14May 12, 2018Updated 7 years ago
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)β10Jan 11, 2024Updated 2 years ago
- β22Sep 30, 2025Updated 5 months ago
- Survey of available speech datasets for Polish ASR developmentβ17Jan 1, 2025Updated last year
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop β¦β22Oct 12, 2023Updated 2 years ago
- β10Jul 12, 2023Updated 2 years ago
- δΈζιθ倧樑εζ΅θ―εΊεοΌε ε€§η±»δΊεδΊδ»»ε‘γηηΊ§εθ―δ»·οΌε½ε 樑εθ·εΎAηΊ§β10May 6, 2024Updated last year
- β12Mar 5, 2025Updated 11 months ago
- Nomad Cyclist Problem - A variation of Traveling Salesman Problem (with open tour) adjusted for elevation and factorsβ10Jul 4, 2025Updated 7 months ago
- LLM benchmarksβ13Feb 22, 2024Updated 2 years ago
- β11Nov 5, 2024Updated last year
- β11Updated this week
- Shaping Language Models with Cognitive Insightsβ15Feb 29, 2024Updated last year
- A canonical source of GenAI energy benchmark and meausrementsβ50Nov 29, 2025Updated 3 months ago
- LGEB: Benchmark of Language Generation Evaluationβ16Oct 21, 2022Updated 3 years ago
- β14May 7, 2025Updated 9 months ago
- An LSTM model implemented by PyTorch to perform sentiment classification on the Stanford Sentiment Treebank (SST-5) dataset.β11Sep 13, 2022Updated 3 years ago
- Meta-specification framework for AI Agents to generate Spec-driven X toolkits automatically.β28Nov 22, 2025Updated 3 months ago