I have a French language Word doc (.docx) that I pasted into CKEditor 3.5 and it ended up creating an HTML file that was 978KB (and caused CKEditor to run very slow and almost crash my browser).
After manually cleaning up the file, it was only 64KB, more than 10x smaller.
The problem is that it created rather odd code like for every bullet/paragraph in the original Word:
<p align="JUSTIFY" style="text-indent: 0in; margin-bottom: 0.08in;"><font size="2"><span lang="fr-FR"><b>1.</b></span></font><font size="2"><span lang="fr-FR"> </span></font><font size="2"><span lang="fr-FR"><u><b>Restrictions du Client</b></u></span></font><font size="2"><span lang="fr-FR">. </span></font></p>
While I know it can't do everything, all of those SPANs do nothing and this works just as fine.
<p align="JUSTIFY"><b>1.</b> <u><b>Restrictions du Client</b></u>. </p>
And an embedded table in Word was really odd looking and was repeated over 100 times:
<dl> <li> <ul> <li> <ul> <li> <ul> <li> <ul> <li> <ul> <li> <dl> <li> <ul> <li> <ul> <li> <ul> <li> <ul> <li> <ul> <li> <dl> <dd> <table border="1" cellpadding="7" cellspacing="0" frame="VOID" rules="NONE" width="325"> <col width="311" /> <tbody> <tr> <td height="4" rowspan="8" valign="TOP" width="311"> <p lang="fr-FR" style="text-indent: 0in; margin-bottom: 0in;"> </p> <table border="0" cellpadding="7" cellspacing="0" width="276"> <col width="262" /> <tbody> <tr> <td height="1" rowspan="8" valign="TOP" width="262"> <p style="text-indent: 0in; margin-bottom: 0in;"><font color="#000000"><font size="2"><span lang="fr-FR"><u><b>Sites Web </b></u></span></font></font><font color="#000000"><font size="2"><span lang="fr-FR"><u><b>des Services d’assistance clientèle </b></u></span></font></font><font color="#000000"><font size="2"><span lang="fr-FR"><b>:</b></span></font></font></p> <p lang="fr-FR" style="text-indent: 0in; margin-bottom: 0in;"> </p> <p style="text-indent: 0in; margin-bottom: 0in;"><font color="#000000"><font size="2"><b>Service Pro</b></font></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#0000ff"><u><a href="http://test.com/"><font size="2"><span lang="fr-FR">test.com</span></font></a></u></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#0000ff"><u><a href="http://test.de/"><font size="2"><span lang="fr-FR">test.de</span></font></a></u></font><font color="#000000"><font size="2"><span lang="fr-FR"><u> (</u></span></font></font><font size="2"><span lang="fr-FR">Allemagne</span></font><font color="#000000"><font size="2"><span lang="fr-FR">)</span></font></font><font color="#000000"><font size="2"><span lang="fr-FR"><u> </u></span></font></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#0000ff"><u><a href="http://test.fr/"><font size="2"><span lang="fr-FR">test.fr</span></font></a></u></font><font color="#000000"><font size="2"><span lang="fr-FR"><u> </u></span></font></font><font color="#000000"><font size="2"><span lang="fr-FR">(France)</span></font></font></p> <p lang="fr-FR" style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"> </p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#000000"><font size="2"><b>Service Manage </b></font></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#0000ff"><u><a href="http://test.com/help"><font size="2">test.com/help</font></a></u></font><font color="#0000ff"><font size="2"> </font></font><font color="#000000"><font size="2"><b> </b></font></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"> </p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#000000"><font size="2"><b>Services X, </b></font></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#000000"><font size="2"><b>Y</b></font></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#000000"><font size="2"><b>et Z</b></font></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#0000ff"><u><a href="http://test.com/"><font size="2"><span lang="fr-FR">http://test.com</span></font></a></u></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#0000ff"><u><a href="http://test.de/"><font size="2"><span lang="fr-FR">http://test.de</span></font></a></u></font><font color="#1f497d"><font size="2"><span lang="fr-FR"> </span></font></font><font size="2"><span lang="fr-FR">(</span></font><font size="2"><span lang="fr-FR">Allemagne</span></font><font size="2"><span lang="fr-FR">)</span></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#0000ff"><u><a href="http://test.fr/"><font size="2"><span lang="fr-FR">http://test.fr</span></font></a></u></font><font color="#1f497d"><font size="2"><span lang="fr-FR"> </span></font></font><font size="2"><span lang="fr-FR">(France)</span></font></p> <p lang="fr-FR" style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"> </p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#000000"><font size="2"><b>Service Z</b></font></font></p> <p style="margin-right: 0.1in; text-indent: 0in; margin-bottom: 0in;"><font color="#0000ff"><u><a href="http://test.com/"><font size="2">http://test.com</font></a></u></font></p> <p style="text-indent: 0in; margin-bottom: 0in;"> </p> <p style="text-indent: 0in;"><font size="2"><b>E-mail</b></font><font color="#000000"><font size="2"><b> :</b></font></font><font color="#000000"><font size="2"> </font></font><font color="#0000ff"><u><a href="mailto:support@test.com"><font size="2">support@test.com</font></a></u></font></p> </td> </tr> <tr> </tr> <tr> </tr> <tr> </tr> <tr> </tr> <tr> </tr> <tr> </tr> <tr> </tr> </tbody> </table> </td> </tr> <tr> </tr> <tr> </tr> <tr> </tr> <tr> </tr> <tr> </tr> <tr> </tr> <tr> </tr> </tbody> </table> </dd> </dl> </li> </ul> </li>
Then it does it again over 100 times...
Re: French paste from word result is huge
If I do a Paste-From-Word, it seems better and that same paragraph looks better:
However, the repeating table problem still seems to be there, showing up over 100 times in a row.