![]() |
Unusually large html file
Can someone explain why this page:
www.simleaguebaseball.com/history/allstar.htm Is nearly 1.5 MB in size while this page: www.simleaguebaseball.com/all_stat.htm is only 189k? Both are all text. The first has some color formatting and a few tables, but it seems unusally large. Can someone check the page source and tell me if there is something weird inbedded in it? |
There not both text, the first is html, the second one is just text in an htm file. view the source and you can see.
|
The first one was also created with Word. Word has a nasty habit of seriously bloating HTML with a bunch of extra crap.
|
They're two totally differnt pages. Just look at the crappy color formatting.
|
right, I did indeed edit the first one with Word because every time I tried cutting and pasting text from a notepad document into it in Frontpage, the formatting went away. Is there any way I can "un-bloat" the page?
|
Un-bloat and KEEP your MS Word formatting ? Not likely.
The reason MS Word makes it so huge is their heavy use of XML for all the pretty classes they can use for the formatting and such. problem is, Microsoft LOVES to specify the classes used, and every last bit of formatting on each and every individual cell, or if they are being conservative, each and every row. Most coders will simply apply a format to the entire table, and only change the one atribute they want for that row, such as a colour, rather than re-specifying the font, size, colour, spacing, etc for each and every cell. The good news: You can get the HTML version to have the exact same pretty formatting without all the Microsoft Bloat. The bad news: You cannot just cut and paste from the finished MS Word you have, into an HTML format. If you had made this project from the start in Dreamweaver, or Adobe GoLive, the formatting would all be there and it would work great, but now you have to pretty much re-invent the wheel to put all that formatting into the existing document. The good part is you have the raw table already that you can just dump into Dreamweaver or GoLive, and just do the formatting in one of those programs. Man I hate Microsoft, apparently their Word Webpages now include such things as the name of the person the computer was registered to (M***a K***g) and the fact that it was done on a Dell computer. Why the heck would anyone need to include that sort of info on a webpage ? |
right. I built the page using MS Frontpage, but for some reason, cutting and pasting formatted (ie spaced out) Courier New text from Notepad (the boxscores) into Frontpage kept losing the formatting and using MS Word was the only way I could get it to work. Had I only known...
|
Those non breaking spaces( :) are killer too. They are basically a 6 byte equivalent of a 1 byte space. I don't think you need them at all since you're using the Pre(pre-formatted text) tags. (I'd open it in notepad and edit->replace all of them with a space. Do it on a copy just in case it messes up the formatting too much to fix. It shouldn't though)
Dreamweaver has a clean up Word HTML command that will drastically reduce the size. I can't tell you by how much because opening the file in dreamweaver crashes my computer. I've seen it reduce the size by over half and with that much word formatting I bet it'll reduce it by at least that much. Also.. Anyone not using the exact same font, font size, screen resolution etc as you isn't going to see those tables aligned correctly. I don't know if they're aligned for anyone else but for me it's off. If you're wanting compatibility with other computers you might want to put those in real tables. It's not going to be a fast fix. Consider it a learning experience. I know I've spent plenty of time fixing my own HTML. (Try to overlook grammar, typos, and all other mis-information. It's VERY, VERY late.. or early..) |
Sadly, I don't have Dreamweaver. And I won't unless it's free somewhere. I'm stuck with MS Frontpage.
|
Try NVU.
Its open source and free. I make no promises if you'll like it, or it's what you want, but it doesn't hurt to know about the software out there. |
I went back and rebuilt the page by basically copying all the "formatted" text into Notepad, then copying and pasting it into a new page.
The exact same content went from 1.27 GB to 330k! Thanks all! |
All times are GMT -8. The time now is 03:37 AM. |
Powered by vBulletin® Version 3.8.7
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
Search Engine Optimization by vBSEO 3.6.0 PL2
© 2002-2012 Tilted Forum Project