Aligners

Collection of Aligner models

Wav2Vec2.0 Aligner

AlignerWAV2VEC2

 AlignerWAV2VEC2 (text_normalizer, device='cuda')

Initialize self. See help(type(self)) for accurate signature.

source

Point

 Point (token_index:int, time_index:int, score:float)

source

Segment

 Segment (label:str, start:int, end:int, score:float)

Usage

text_normalizer = TTSTextNormalizer().english_cleaners
aligner = AlignerWAV2VEC2(text_normalizer, device='cpu') # for CI on cpu
wav_path = "../data/en/LibriTTS/test-clean/1089/134686/1089_134686_000015_000001.wav"
txt_path = "../data/en/LibriTTS/test-clean/1089/134686/1089_134686_000015_000001.original.txt"
wav, sr = torchaudio.load(wav_path)
with open(txt_path, 'r') as f: txt = f.read()
alignments = aligner.get_alignments(wav, txt)