logo
Login
aigc
aigc
aigc
No description

Pinned

🤖 Free Search with AI, 💡 Open-Source Perplexity, 📍 Support Ollama/SearXNG, Support Docker deployment. 让AI大模型和搜索引擎回答你的问题,支持本地大模型(Ollama)、聚合搜索引擎SearXNG,支持Docker一键部署。
TypeScript
17500
vLLM官方镜像部署DeepSeek模型,生产环境中提供类OpenAI接口服务。
Markdown
0900
An easy API for making Event Source requests, with all the features of fetch(), Supports browsers and node.js
TypeScript
0000
LLM inference in C/C++ Fork from https://github.com/ggerganov/llama.cpp.git 自动化构建Docker镜像
C++
3000
SGLang is a fast serving framework for large language models and vision language models.
Python
0000
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
0000
Recent updates
🤖 Free Search with AI, 💡 Open-Source Perplexity, 📍 Support Ollama/SearXNG, Support Docker deployment. 让AI大模型和搜索引擎回答你的问题,支持本地大模型(Ollama)、聚合搜索引擎SearXNG,支持Docker一键部署。
TypeScript
17500
AI模型聚合管理中转分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude、Gemini等格式,可供个人或者企业内部管理与分发渠道使用。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.
JavaScript
Go
0000
LLM inference in C/C++ Fork from https://github.com/ggerganov/llama.cpp.git 自动化构建Docker镜像
C++
3000
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
0000
https://github.com/vllm-project/FlashMLA.git
Cuda
Python
0000
Fast and memory-efficient exact attention
Python
0000
https://github.com/oneapi-src/oneDNN.git
C++
0000
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Python
0000