Ddongwenbo6ECAPA-TDNN NPU

ba3198ae创建于 2023年6月25日历史提交

文件	最后提交记录	最后更新时间
data	ECAPA-TDNN NPU NPU迁移 NPU迁移 NPU适配	2 年前
README.md	ECAPA-TDNN NPU NPU迁移 NPU迁移 NPU适配	2 年前
RNNLM.yaml	ECAPA-TDNN NPU NPU迁移 NPU迁移 NPU适配	2 年前
custom_model.py	ECAPA-TDNN NPU NPU迁移 NPU迁移 NPU适配	2 年前
extra_requirements.txt	ECAPA-TDNN NPU NPU迁移 NPU迁移 NPU适配	2 年前
train.py	ECAPA-TDNN NPU NPU迁移 NPU迁移 NPU适配	2 年前

Language Model

This folder contains a recipe for training language models. It supports both an RNN-based LM and a Transformer-based LM. The scripts rely on the HuggingFace dataset, which manages data reading and loading from large text corpora. Training an LM might on large text corpora might take weeks (or months) even on modern GPUs. In this template, for simplicity, we only use the training transcriptions of the mini-librispeech dataset. In the recipes, we assume you already ran the tokenizer training (see ../Tokenizer).

Extra Dependency:

Make sure you have the HuggingFace dataset installed. If not, type: pip install datasets

How to run:

python train.py RNNLM.yaml