One of the options in SCUMMtr is to export hexadecimal char codes, which seems like the most likely candidate to convert into readable text, but it looks like there's still a bunch of garbage characters in there that I can't figure out what to do with (e.g. {, n, I).
Here's the first 10 lines of the SOMI text dump in English, with matching Japanese text transcribed from the game as reference:
Code: Select all
jungle : ジャングル
rowboat and oars : ボートとオール
jungle : ジャングル
jungle : ジャングル
jungle : ジャングル
It's a gross old bottle.\255\003I don't want it. : 薄汚れた古いガラスびんだ。\255\003こんな物いらない。
bottle : ガラスびん
note : メモ
Herman Toothrot : ハーマン・トゥースロット
There's a message in it! : 中に手紙が入ってる!
The corresponding Japanese hexadecimal codes exported by SCUMMtr:
Code: Select all
\x83W\x83\x83\x83\x93\x83O\x83\x8B
\x83{\x81[\x83g\x82\xC6\x83I\x81[\x83\x8B
\x83W\x83\x83\x83\x93\x83O\x83\x8B
\x83W\x83\x83\x83\x93\x83O\x83\x8B
\x83W\x83\x83\x83\x93\x83O\x83\x8B
\x94\x96\x89\x98\x82\xEA\x82\xBD\x8C\xC3\x82\xA2\x83K\x83\x89\x83X\x82\xD1\x82\xF1\x82\xBE\x81B\xFF\x03\x82\xB1\x82\xF1\x82\xC8\x95\xA8\x82\xA2\x82\xE7\x82\xC8\x82\xA2\x81B
\x83K\x83\x89\x83X\x82\xD1\x82\xF1
\x8E\xE8\x8E\x86
\x83n\x81[\x83}\x83\x93\x81E\x83g\x83D\x81[\x83X\x83\x8D\x83b\x83g
\x92\x86\x82\xC9\x8E\xE8\x8E\x86\x82\xAA\x93\xFC\x82\xC1\x82\xC4\x82\xE9\x81I
I've attached the exported hex code dumps from both games, and a sample file comparing the first 10 lines of the SOMI text dump using SCUMMtr's other export options, if they can help. MI2's other dumps are pretty much identical to SOMI's, except for some reason it kept generating UTF-16 text files instead of ANSI when that option was used.
Is it possible to get a properly encoded Japanese text dump out of this, or is SCUMMtr just not the right tool for the job?