Make every frame speak beautifully
Smart subtitle generation and multilingual translation solution for video/audio files.

This application retains all the features of the original VideoSubtitleGenerator command-line tool, with the following enhancements:
whisper commandBecause I use an Apple Silicon chip, I lack a Windows CUDA development environment. For CUDA support, there are many scenarios that are difficult to cover both in development and testing.
Starting from version 1.20.0, Core ML is supported on Apple Silicon, providing faster speech recognition. If you are using an Apple Silicon chip, please download the mac arm64 version of the release package. It will automatically enable Core ML acceleration.
This project supports various translation services, including Baidu Translation, Volcano Engine Translation, DeepLX, local Ollama models, DeepSeek and OpenAI-style APIs. Using these services requires the appropriate API keys or configurations.
For information on obtaining API keys for services like Baidu Translation and Volcano Engine, please refer to https://bobtranslate.com/service/. We appreciate the information provided by Bob, an excellent software tool.
For AI translation, the translation results are heavily influenced by models and prompt words, so you can try different models and prompt words to find the right combination for you. Recommended to try AI aggregation platform DeerAPI, nearly 500 kinds of model to support multiple platforms, choose appropriate model for translation.
SmartSub now supports configuring custom parameters for each AI translation service, allowing you to precisely control model behavior:
To generate subtitles from video or audio, you need to use the whisper model. Whisper models have different accuracies and processing speeds.
tiny or base models, which may have lower accuracy but faster processing speeds and smaller memory usagesmall or base models to balance accuracy and resource consumptionlarge models for higher accuracyen for optimized English processingq5 or q8 models, which offer smaller sizes at the cost of slightly reduced accuracyPlease download the appropriate package based on your computer's system, chip, and graphics card.
| System | Chip | Graphics Card | Download Package |
|---|---|---|---|
| Windows | x64 | CUDA >= 11.8.0 < 12.0.0 | windows-x64_cuda11.8.0 |
| Windows | x64 | CUDA >= 12.0.0 | windows-x64_cuda12.4.1 |
| Windows | x64 | CUDA >= 12.2.0 | windows-x64_cuda12.2.0 |
| Windows | x64 | no CUDA | windows-x64_no_cuda |
| Mac | Apple | support CoreML | mac-arm64 |
| Mac | Intel | no support CoreML | mac-x64 |
1️⃣ Clone the project locally
git clone https://github.com/buxuku/SmartSub.git
2️⃣ Install dependencies using yarn install or npm install
cd SmartSub yarn install
If you are on Windows, or Mac intel platform, please download the node file from https://github.com/buxuku/whisper.cpp/releases/tag/latest and rename it to 'addon.node' and overlay it in the 'extraResources/addons/' directory.
3️⃣ After installing dependencies, run yarn dev or npm run dev to launch the project
yarn dev
Due to the large size of model files, downloading them through the software may be challenging. You can manually download models and import them into the application. Here are two links for downloading models:
Domestic mirror (faster download speeds): https://hf-mirror.com/ggerganov/whisper.cpp/tree/main
Hugging Face official source: https://huggingface.co/ggerganov/whisper.cpp/tree/main
If you are using an Apple Silicon chip, you need to download the corresponding encoder.mlmodelc file. After downloading, you can import the model files into the application using the "Import Model" feature on the "Model Management" page.(If it is a q5 or q8 series model, there is no need to download this file)
After downloading, you can import the model files into the application using the "Import Model" feature on the "Model Management" page. Or you can directly copy the model files to the model directory.
Import steps:
Execute the following command in the terminal:
sudo xattr -dr com.apple.quarantine /Applications/SmartSub.app
Then try running the application again.
👏🏻 Issues and Pull Requests are welcome to help improve this project!
⭐ If you find this project helpful, feel free to give me a star, or buy me a cup of coffee (please note your GitHub account).
👨👨👦👦 If you have any use problems, welcome to join the wechat communication group, exchange and learn together.
| Alipay donation code | WeChat donation code | WeChat communication group |
|---|---|---|
![]() | ![]() | ![]() |
This project is licensed under the MIT License. See the LICENSE file for details.