I am now pacing myself and fixing a few issues every night.
Three oddball things I was encountering lately, that corrupted something about the conversion:
1) I have this entire board set to deliver up to 100 messages per page. In other words, we should have almost no threads that are paged. EXCEPT that the dummy Yuku user ID that I have been using as the user for downloading the test batches still sees 20 messages per page. This is ALMOST no problem except that this guy's stupid script seems to duplicate the first post of the thread for each page of a multipage thread. I think I have a workaround.
2) Apparently I overlooked character encoding issues. See this thread:
http://openitforum.yuku.com?topic=7979 In the quoted block: "We tell our vendors, if you’re asking for 15 days"all of the apostrophes were three character unicode sequences; it is a DIFFERENT apostrophe character than normal. But the SMF database was set for (geez) Latin Swedish-1 (?) encoding. So these characters were being displayed in the new board as three gibberish characters. I figured out how to force UTF-8 mode for all imported and exported data, which seems to fix the issue. I've seen the Unicode trash characters (improper displaying) in several other posts I have checked at random, all of which were copy and pastes from other web sites. The whole domain of compatibility issues with Unicode and different character sets is enough to make my head explode.
3) A few posts contain the same <span> tag that I trigger on to detect the post title. The symptom was some messages come through blank. I fixed by making the search for post title extremely specific, by adding more surrounding tags. Seems fixed.
Some people do this stuff for a living. I guess it falls under the domain of data warehousing. I don't envy 'em.

This is grueling, fussy work.