Letter Frequency Analysis of Lithuanian and Other Languages Using the Latin Alphabet
Gintautas Grigas (Institute of Mathematics and Informatics, Vilnius University, Lithuania)
Anita Juškevičienė (Institute of Mathematics and Informatics, Vilnius University, Lithuania)
Anita Juškevičienė (Institute of Mathematics and Informatics, Vilnius University, Lithuania)
Abstract
It is important to evaluate specificities of alphabets, particularly the letter frequencies while designing keyboards, analyzing texts, designing games based on alphabets, and doing some text mining. In order to adequately compare lettter frequences of Lithuanian language to other languages in the Internet space, Wikipedia source was selected which content is common to different languages. The method of letter frequency jumps is used. The main attention is paid to the analysis of letter frequencies at the boundary between native letters and foreign letters used in Lithuanian and other languages.
Article in:
Lithuanian
Article published:
2015-12-28
Keyword(s): alphabet; diacritic; keyboard; Latin script; letter; letter frequency; Lithuanian language; mobile phone; text input.
DOI: 10.3846/cpe.2015.271
Coactivity: Philology, Educology / Santalka: Filologija, Edukologija ISSN 2351-714X, eISSN 2335-7711
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 License.