mjbommar / gpt4-passes-the-barLinks
GPT-4 Passes the Bar
☆26Updated last year
Alternatives and similar repositories for gpt4-passes-the-bar
Users that are interested in gpt4-passes-the-bar are comparing it to the libraries listed below
Sorting:
- A dataset for pretraining language models targeted for legal tasks.☆133Updated 2 years ago
- ☆27Updated 3 years ago
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated 2 weeks ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆17Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆68Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆25Updated 6 months ago
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆82Updated last month
- Collection of Datasets for Legal Text Processing☆106Updated 2 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- StAtutory Reasoning Assessment☆13Updated 2 years ago
- Kelvin Legal Data OS - Public Examples☆19Updated last year
- Factored Cognition Primer: How to write compositional language model programs☆49Updated 2 years ago
- Code and data to support "Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4"☆69Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆64Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 9 months ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal☆91Updated 2 years ago
- ☆95Updated last year
- ☆18Updated 4 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆121Updated last year
- A simple library for segmenting legal texts☆17Updated 2 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"☆19Updated 2 years ago
- ☆8Updated 11 months ago
- Small python package to measure OCR quality and other related metrics.☆23Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- ☆13Updated 2 years ago
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆20Updated last year
- [Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆47Updated 2 years ago
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆10Updated 2 years ago