benchmark
Here are 4,463 public repositories matching this topic...
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
-
Updated
Jun 12, 2024
HausaHate is a benchmark dataset for Hausa hate speech detection task. it was extracted from West African Facebook pages and comprises 2,000 comments annotated according to a binary class (offensive and non-offensive) and hate speech targets (race, gender and none).
-
Updated
Jun 12, 2024
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
-
Updated
Jun 12, 2024 - Python
MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models.
-
Updated
Jun 12, 2024 - Python
Performance benchmarking and testing framework for .NET applications 📈
-
Updated
Jun 12, 2024 - C#
Framework for benchmarking vector search engines
-
Updated
Jun 11, 2024 - Python
Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)
-
Updated
Jun 11, 2024 - Python
MTEB: Massive Text Embedding Benchmark
-
Updated
Jun 11, 2024 - Python
A task generation and model evaluation system.
-
Updated
Jun 11, 2024 - Python
The official evaluation suite and dynamic data release for MixEval.
-
Updated
Jun 12, 2024 - Python
Foundation model benchmarking tool. Run any model on Amazon SageMaker and benchmark for performance across instance type and serving stack options.
-
Updated
Jun 11, 2024 - Jupyter Notebook
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
-
Updated
Jun 11, 2024 - Python
The most powerful MICROSOFT WINDOWS hardening and benchmark! Work in progress -- experimental. Best security database you will have "2024", "11" parent
-
Updated
Jun 11, 2024 - Batchfile
Take your packages for a jog!
-
Updated
Jun 11, 2024 - Julia
https://db-benchmarks.com website
-
Updated
Jun 11, 2024 - JavaScript
Source for the TechEmpower Framework Benchmarks project
-
Updated
Jun 11, 2024 - Java
Improve this page
Add a description, image, and links to the benchmark topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the benchmark topic, visit your repo's landing page and select "manage topics."