离线运行的本地语音识别服务 - CurateClick

https://github.com/jianchang512/stt这是一个离线运行的本地语音识别转文字工具，基于 fast-whipser 模型，将视频/音频中的人类声音识别并转为文字，可选文字输出格式:json、srt字幕、纯文字。可用于自行部署后替代 openai 的语音识别接口或百度语音识别等，准确率基本等同openai官方api接口。## 特点1. 本地离线工作2. 提供 api 调用接口3. 使用 openai-whisper 开源模型4. win下提供预编译exe版本，双击即可使用，无需部署5. 支持 win/mac/linux 源码部署## 预览视频https://github.com/jianchang512/stt/assets/3378335/d716acb6-c20c-4174-9620-f574a7ff095d![image](https://github.com/jianchang512/stt/assets/3378335/0f724ff1-21b3-4960-b6ba-5aa994ea414c)## api 接口接口地址: http://127.0.0.1:9977/api请求方法: POST请求参数: language: 语言代码:可选如下 > > 中文：zh > 英语：en > 法语：fr > 德语：de > 日语：ja > 韩语：ko > 俄语：ru > 西班牙语：es > 泰国语：th > 意大利语：it > 葡萄牙语：pt > 越南语：vi > 阿拉伯语：ar > 土耳其语：tr > model: 模型名称，可选如下 > > base 对应于 models/base.pt > small 对应于 models/small.pt > medium 对应于 models/medium.pt > large 对应于 models/large.pt > large-v3 对应于 models/large-v3.pt > response_format: 返回的字幕格式，可选 text|json|srt file: 音视频文件，二进制上传Api 请求示例python import requests # 请求地址 url = "http://127.0.0.1:9977/api" # 请求参数 file:音视频文件，language：语言代码，model：模型，response_format:text|json|srt # 返回 code==0 成功，其他失败，msg==成功为ok，其他失败原因，data=识别后返回文字 files = {"file": open("C:\\Users\\c1\\Videos\\2.wav", "rb")} data={"language":"zh","model":"base","response_format":"json"} response = requests.request("POST", url, timeout=600, data=data,files=files) print(response.json())

Latest Weekly Picks

make ink

Weekly Pick

Your AI tattoo generator for pro-grade concepts

make ink

Nov 13, 2025

Leo Wade

Sellfy

Weekly Pick

A code-free online store builder to turn views into revenue. Sell digital products, subscriptions, and merch, without fees or hassle.

Sellfy

Nov 7, 2025

Maris

Video To Blog

Weekly Pick

Convert videos into awesome blog posts.

Video To Blog Source

Nov 5, 2025

Video To Blog

SellerPic

Weekly Pick

SellerPic is the all-in-one AI design platform for e-commerce

SellerPic Source

Oct 23, 2025

SellerPic

Fast Image AI

Weekly Pick

Fast Image AI instantly transforms your photos into stunning styles like Ghibli, Sketch, and Pixar. Effortlessly control image elements and create amazing effects with just one click.

Fast Image AI

Oct 20, 2025

Fast AI Team

LinkedInPro

Weekly Pick

AI-powered tool that transforms casual photos into professional LinkedIn headshots instantly. No photographer needed—just upload and download.

LinkedInPro

Oct 19, 2025

Gabriel

Crevas AI

Weekly Pick

Crevas unifies Veo 3, Sora 2, Nano Banana and more into one intuitive canvas — so filmmakers can script, prompt, and generate cinematic stories without switching tools or losing consistency.

Crevas AI

Oct 17, 2025

Spark Alpha

AI Foto Edit

Weekly Pick

AI Foto Edit - Text to Image & Image Edit

AI Foto Edit Source

Oct 14, 2025

foto miniatur