logo
M.A
Login
M.A

M.A

zacma

Everything as Code.

6Followings
5Followers

Groups

  • aigc
  • qijia
Only show repositories within permission
🤖 Free Search with AI, 💡 Open-Source Perplexity, 📍 Support Ollama/SearXNG, Support Docker deployment. 让AI大模型和搜索引擎回答你的问题,支持本地大模型(Ollama)、聚合搜索引擎SearXNG,支持Docker一键部署。
TypeScript
17500MIT
Python
0000AGPL-3.0
AI模型聚合管理中转分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude、Gemini等格式,可供个人或者企业内部管理与分发渠道使用。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.
JavaScript
Go
0000NOASSERTION
Coding项目协同数据迁移工具
批量迁移代码托管平台代码仓库到 CNB
Go
0000MIT
LLM inference in C/C++ Fork from https://github.com/ggerganov/llama.cpp.git 自动化构建Docker镜像
C++
3000MIT
HTML
Cuda
0000NOASSERTION
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
0000Apache-2.0
https://github.com/vllm-project/FlashMLA.git
Cuda
Python
0000MIT
Fast and memory-efficient exact attention
Python
0000BSD-3-Clause
Python
0000Apache-2.0
https://github.com/oneapi-src/oneDNN.git
C++
0000Apache-2.0
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Python
0000
vLLM官方镜像部署DeepSeek模型,生产环境中提供类OpenAI接口服务。
Markdown
0900
DeepResearch Agent with LangGraph, using any LLM models, search engine, RAG retrieval.
TypeScript
0000
Xorbits Inference(Xinference) is a powerful and versatile library designed to serve language, speech recognition, and multimodal models. With Xorbits Inference, you can effortlessly deploy and serve your or state-of-the-art built-in models using just a single command. Whether you are a researcher, developer, or data scientist...
Python
0000
SGLang is a fast serving framework for large language models and vision language models.
Python
0000
常用镜像制品库
Markdown
0000