AI & ML interests

This repo contains models and datasets for Japanese ASR. See our main model https://huggingface.co/kotoba-tech/kotoba-whisper-v1.0.

Japanese ASR

This repository contains all the models and datasets for train/evaluate the Japanese ASR dataset generated through the process of achieving kotoba-whisper models. Following table shows CER comparison with different data size of ReazonSpeech used to distill openai/whisper-large-v3. The model names follows japanese-asr/distil-whisper-large-v3-ja-reazonspeech-{size of reazonspeech}.

CER

Please find more detailed results at kotoba-whisper codebase.