3 个版本

0.1.5	2023年9月13日
0.1.4	2023年5月31日
0.1.0	2023年5月31日

#2596 in 命令行工具

24 每月下载量

MIT 许可证

28KB
712 代码行

一个使用 Rust 构建的 Whisper CLI

该项目试图使用 Rust 构建一个简单的 Whisper CLI，以替代基本的 Python 版本。它底层使用 whisper.cpp，这使得它在 M1 系统上运行速度更快。

安装

您可以从最新版本下载适用于您操作系统的二进制文件，或者使用 cargo install whisper_cli 从头开始构建。

在任何地方运行

将 whisper 二进制文件放在 Unix 系统（Mac/Linux）的 /usr/local/bin 中，并确保它具有执行权限（在终端中使用 chmod +x whisper）。

关闭并重新打开终端，通过输入 whisper --help 来测试它。它应该输出以下内容。

用法

$ whisper --help
Generate a transcript of an audio file using the Whisper speech-to-text engine. The transcript will be saved as a .txt, .vtt, and .srt file in the same directory as the audio file.

Usage: whisper [OPTIONS] <AUDIO>

Arguments:
  <AUDIO> Path to the audio file to transcribe

Options:
  -m, --model <MODEL>
          Name of the Whisper model to use

          [default: medium]
          [possible values: tiny.en, tiny, base.en, base, small.en, small, medium.en, medium, large, large-v1]

  -l, --lang <LANG>
          Language spoken in the audio. Attempts to auto-detect by default

          [possible values: auto, en, zh, de, es, ru, ko, fr, ja, pt, tr, pl, ca, nl, ar, sv, it, id, hi, fi, vi, he, uk, el, ms, cs, ro, da, hu, ta, no, th, ur, hr, bg, lt, la, mi, ml, cy, sk, te, fa, lv, bn, sr, az, sl, kn, et, mk, br, eu, is, hy, ne, mn, bs, kk, sq, sw, gl, mr, pa, si, km, sn, yo, so, af, oc, ka, be, tg, sd, gu, am, yi, lo, uz, fo, ht, ps, tk, nn, mt, sa, lb, my, bo, tl, mg, as, tt, haw, ln, ha, ba, jw, su]

  -t, --translate
          Toggle translation

  -k, --karaoke
          Generate timestamps for each word

  -h, --help
          Print help information (use `-h` for a summary)

  -V, --version
          Print version information

开发

确保您已安装最新版本的 rust（使用 rustup）。然后，您可以通过运行 cargo build 来构建项目，并通过 cargo run 运行它。

许可证

该项目受 MIT 许可证的许可 - 有关详细信息，请参阅 LICENSE 文件。

依赖关系

~12–29MB
~424K SLoC