This paper gives an overview of the current state of the digitization of Mari linguistic resources. After a brief account of the history of Mari literary languages, samples of different types of Mari text are illustrated and the procedural steps of character encoding conversion are presented. There is also an account of the Turku-Tokyo project to create UTF-8 encoded corpora of the two Mari literary languages. Finally, the future prospects of corpus-based Mari linguistics are discussed briefly.