DataScienceUIBK/SustainableQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DataScienceUIBK/SustainableQA)

DataScienceUIBK / SustainableQA

SustainableQA: A Comprehensive Question Answering Dataset for Corporate Sustainability and EU Taxonomy Reporting

☆45

Alternatives and similar repositories for SustainableQA

Users that are interested in SustainableQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DataScienceUIBK / ArabicaQA
View on GitHub
ArabicaQA: Comprehensive Dataset for Arabic Question Answering accepted at SIGIR 2024
☆18Jul 28, 2024Updated last year
abdoelsayed2016 / Legal-Question-Answering-Review
View on GitHub
☆46Apr 26, 2023Updated 3 years ago
rasyosef / splade-index
View on GitHub
Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba
☆38Oct 16, 2025Updated 9 months ago
evidentlyai / ml_observability_course
View on GitHub
Free Open-source ML observability course for data scientists and ML engineers. Learn how to monitor and debug your ML models in productio…
☆107Dec 17, 2023Updated 2 years ago
7oSkaaa / Competitive-Programming-Session-Content
View on GitHub
Competitive Programming Sessions
☆221Jun 26, 2026Updated 3 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ACL2023-Retrieval-LM / ACL2023-Retrieval-LM.github.io
View on GitHub
https://acl2023-retrieval-lm.github.io/
☆157Oct 18, 2023Updated 2 years ago
TutteInstitute / evoc
View on GitHub
Embedding Vector Oriented Clustering
☆342Jun 2, 2026Updated last month
jina-ai / late-chunking
View on GitHub
Code for explaining and evaluating late chunking (chunked pooling)
☆532Dec 23, 2024Updated last year
AI21Labs / in-context-ralm
View on GitHub
☆295Dec 20, 2023Updated 2 years ago
lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆876Jul 13, 2026Updated last week
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Updated this week
ace-agent / ace
View on GitHub
Evolve your language agent with Agentic Context Engineering (ACE)
☆1,223May 19, 2026Updated 2 months ago
dphnAI / sonar
View on GitHub
Large-scale LLM inference engine
☆1,808Updated this week
xhluca / bm25s
View on GitHub
Fast BM25 search in Python, powered by Numpy and Numba
☆1,741Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
McGill-NLP / llm2vec
View on GitHub
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
☆1,705Apr 4, 2026Updated 3 months ago
agiresearch / OpenAGI
View on GitHub
OpenAGI: When LLM Meets Domain Experts
☆2,276Nov 28, 2024Updated last year
gkamradt / needle-in-a-haystack
View on GitHub
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆2,348Jun 8, 2026Updated last month
Mega4alik / ollm
View on GitHub
☆2,682Nov 29, 2025Updated 7 months ago
microsoft / PromptWizard
View on GitHub
Task-Aware Agent-driven Prompt Optimization Framework
☆3,899Oct 13, 2025Updated 9 months ago
urchade / GLiNER
View on GitHub
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts)
☆3,415Updated this week
Agenta-AI / agenta
View on GitHub
The open-source workspace for building and running AI agents. Build agents through chat, share them with your team, and run background ag…
☆4,324Updated this week
lumina-ai-inc / chunkr
View on GitHub
Vision infrastructure to turn complex documents into RAG/LLM-ready data
☆4,041Apr 9, 2026Updated 3 months ago
microsoft / mcp-for-beginners
View on GitHub
This open-source curriculum introduces the fundamentals of Model Context Protocol (MCP) through real-world, cross-language examples in .N…
☆16,815Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
apple / embedding-atlas
View on GitHub
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…
☆4,877Updated this week
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,641May 26, 2026Updated last month
promptslab / Promptify
View on GitHub
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engin…
☆4,623Mar 27, 2026Updated 3 months ago
systemdesign42 / system-design-academy
View on GitHub
If you want to become good at AI & system design, join this newsletter 👇
☆26,809Updated this week
soulmachine / machine-learning-cheat-sheet
View on GitHub
Classical equations and diagrams in machine learning
☆8,037Jul 30, 2024Updated last year
Facico / Chinese-Vicuna
View on GitHub
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca
☆4,119Apr 18, 2025Updated last year
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,254Jun 17, 2026Updated last month
clovaai / deep-text-recognition-benchmark
View on GitHub
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
☆3,937Mar 4, 2024Updated 2 years ago
yizhongw / self-instruct
View on GitHub
Aligning pretrained language models with instruction data generated by themselves.
☆4,606Mar 27, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
togethercomputer / RedPajama-Data
View on GitHub
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
☆4,970Jun 3, 2026Updated last month
AutoGPTQ / AutoGPTQ
View on GitHub
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
☆5,074Apr 11, 2025Updated last year
mrdbourke / pytorch-deep-learning
View on GitHub
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
☆18,430Feb 11, 2026Updated 5 months ago
louisfb01 / start-machine-learning
View on GitHub
A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2026 without ANY background in the field …
☆5,279Jan 23, 2026Updated 5 months ago
HandsOnLLM / Hands-On-Large-Language-Models
View on GitHub
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
☆27,725Apr 24, 2026Updated 2 months ago
youssefHosni / Data-Science-Interview-Questions-Answers
View on GitHub
Curated list of data science interview questions and answers
☆5,776Sep 29, 2024Updated last year
FlagOpen / FlagEmbedding
View on GitHub
Retrieval and Retrieval-augmented LLMs
☆11,968Apr 22, 2026Updated 3 months ago