Citrinet

由XWu著作·2022—Abstract:Citrinetisanend-to-endconvolutionalConnectionistTemporalClassification(CTC)basedautomaticspeechrecognition(ASR)model.,由SMajumdar著作·2021·被引用58次—Citrinetisdeepresidualneuralmodelwhichuses1Dtime-channelseparableconvolutionscombinedwithsub-wordencodingandsqueeze-and- ...,2022年11月18日—StreamingCitrinet-1024modelisanon-autoregressive,streamingvariantofCitrinetmodel[1...

Attention Enhanced Citrinet for Speech Recognition

由 X Wu 著作 · 2022 — Abstract:Citrinet is an end-to-end convolutional Connectionist Temporal Classification (CTC) based automatic speech recognition (ASR) model.

Citrinet: Closing the Gap between Non

由 S Majumdar 著作 · 2021 · 被引用 58 次 — Citrinet is deep residual neural model which uses 1D time-channel separable convolutions combined with sub-word encoding and squeeze-and- ...

nvidiastt_en_citrinet_1024_gamma_0_25

2022年11月18日 — Streaming Citrinet-1024 model is a non-autoregressive, streaming variant of Citrinet model [1] for Automatic Speech Recognition which uses ...

nvidiastt_en_citrinet_384_ls

Citrinet-CTC model is an autoregressive variant of Citrinet model [1] for Automatic Speech Recognition which uses CTC loss/decoding instead of Transducer Loss.

Speech Recognition With CitriNet

2023年9月5日 — The NVIDIA TAO Toolkit eliminates the time-consuming process of building and fine-tuning DNNs from scratch for IVA applications.

Speech to Text English Citrinet - NGC | Catalog

CitriNet is an end-to-end architecture that is trained using CTC loss. These model checkpoints are intended to be used with the Transfer Learning Toolkit (TLT).