3 个版本

0.1.3	2023 年 12 月 5 日
0.1.2	2023 年 11 月 26 日
0.1.1	2023 年 11 月 25 日
0.1.0	~~2023 年 11 月 24 日~~

403 在机器学习中

每月 30 次下载

MIT 许可证

23KB
559 代码行

rusty-whisper

Rust 实现 Whisper。有关模型的更多信息：https://github.com/openai/whisper

该软件包基于 tract。

快速开始

下载权重和示例音频文件并运行简单的推理代码

use rusty_whisper::Whisper;

fn main() {
    let whisper = Whisper::new(
        "weights/encoder.onnx",
        "weights/decoder.onnx",
        "weights/multilingual.tiktoken",
        "weights/positional_embedding.npz",
        "weights/mel_filters.npz",
    );
    let result = whisper.recognize_from_audio("data/audio.wav");
    println!("{}", result);
}

该模型仅与 16 位 WAV 文件兼容，因此请在运行工具之前确保将输入转换。例如，您可以使用 ffmpeg 如下所示

ffmpeg -i input.mp3 -ar 16000 -ac 1 -c:a pcm_s16le output.wav

依赖项

~29–41MB
~537K SLoC