llama
Here are 966 public repositories matching this topic...
A high-throughput and memory-efficient inference and serving engine for LLMs
-
Updated
Jun 12, 2024 - Python
-
Updated
Jun 12, 2024 - C++
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
-
Updated
Jun 12, 2024 - Python
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
-
Updated
Jun 12, 2024 - Go
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain
-
Updated
Jun 12, 2024 - Python
🤖 Collect practical AI repos, tools, websites, papers and tutorials on AI. 实用的AI百宝箱 💎
-
Updated
Jun 12, 2024 - Ruby
Java version of LangChain
-
Updated
Jun 12, 2024 - Java
AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
-
Updated
Jun 12, 2024 - Python
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
-
Updated
Jun 12, 2024 - Python
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
-
Updated
Jun 12, 2024 - Python
A high-performance inference system for large language models, designed for production environments.
-
Updated
Jun 12, 2024 - C++
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
-
Updated
Jun 12, 2024 - Python
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口
-
Updated
Jun 12, 2024 - Python
🤯 Lobe Chat - an open-source, modern-design ChatGPT/LLMs UI/Chat Framework. Supports speech-synthesis, multi-modal, and extensible plugin system. One-click FREE deployment of your private ChatGPT/Gemini/Ollama chat application.
-
Updated
Jun 12, 2024 - TypeScript
Improve this page
Add a description, image, and links to the llama topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the llama topic, visit your repo's landing page and select "manage topics."