改進ASR錯誤判斷相似音詞彙的問題

SherryLiu c99787da72 backup reviewed dictionary 10 kuukautta sitten
README.md c1583d2700 save outputs 10 kuukautta sitten
asr_api.py 16e5e2f5c6 added API 10 kuukautta sitten
dictionary.txt 8e3bbd0456 added jieba dictionary 10 kuukautta sitten
dictionary_reviewed.txt c99787da72 backup reviewed dictionary 10 kuukautta sitten
knowledge_graph_test_rows.csv 8e3bbd0456 added jieba dictionary 10 kuukautta sitten
llama_asr.py 892fdb36f4 commit before switch branch 10 kuukautta sitten
requirements.txt fed1681308 added tone 10 kuukautta sitten
soundex.py 70fff6defb improved soundex index and error handling 10 kuukautta sitten
whisper.py 8e3bbd0456 added jieba dictionary 10 kuukautta sitten

README.md

語音轉文字 ASR

Prerequisite

  • 語音檔存放需放在data/
  • environment.env放openai key

To run the code

pip install -r requirements.txt

執行所有在data目錄下的語音檔 python whisper.py

執行單個語音檔 python whisper.py --file 語音檔名