Foundation model benchmarking tool. Run any model on Amazon SageMaker and benchmark for performance across instance type and serving stack options.
-
Updated
Jun 2, 2024 - Jupyter Notebook
Foundation model benchmarking tool. Run any model on Amazon SageMaker and benchmark for performance across instance type and serving stack options.
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
The official evaluation suite and dynamic data release for MixEval.
World Model based Autonomous Driving Platform in CARLA 🚗
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
Must-read Papers on Knowledge Editing for Large Language Models.
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
A task generation and model evaluation system.
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
日本語LLMまとめ - Overview of Japanese LLMs
SaprotHub: Making Protein Modeling Accessible to All Biologists
A curated list of foundation models for vision and language tasks
Investigation of the capabilities of foundations models in the context of time series forecasting
Evaluation framework for oncology foundation models (FMs)
Papers, codes, datasets, applications, tutorials.
This repository contains the python package for Helical
A Large Short-video Recommendation Dataset with Raw Text/Audio/Image/Videos (Talk Invited by DeepMind).
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Add a description, image, and links to the foundation-models topic page so that developers can more easily learn about it.
To associate your repository with the foundation-models topic, visit your repo's landing page and select "manage topics."