If you use iMessage, you can access and query the database where all of the messages are stored. All you need is a Mac and some basic SQL skills to begin!
Converting from UTF-8 to GSM-7 is usually lossy. We came up with a strategy to preserve SMS message content when converting UTF-8 to GSM-7 with Transliteration.
Figuring out maximum character counts for concatenated SMS messages is complicated. Throw character encodings into the mix, and everything can become very muddled.
Unfortunately, character encodings are widely misunderstood, and implementations differ widely. A particular program may handle even similar encodings in very different fashions.