This project is an education project to Learning Chinese characters and words( single or multiple character)
The input will be a Chinese subtitle file (.srt): time stamps and Chinese character
The output will be various list of Chinese characters or words, based on the used request:
1- the top 10 or 50 character used (in the subtitle file)
2- the character also part of he HSK-1 list ( or HSK-2,..6)
3- or the character part of a customized list
Export cvs file (format will be provided) basically: char,pinyin,simple translation,
This program could be a web based or self contain application.
Full documentation of the code need to be provided.
In python will be nice, as it is at the end of the day just comparing 2 files and making a new one!
must speak Chinese.