06-13-2006, 06:47 PM | #1 (permalink) |
Who You Crappin?
Location: Everywhere and Nowhere
|
Unusually large html file
Can someone explain why this page:
www.simleaguebaseball.com/history/allstar.htm Is nearly 1.5 MB in size while this page: www.simleaguebaseball.com/all_stat.htm is only 189k? Both are all text. The first has some color formatting and a few tables, but it seems unusally large. Can someone check the page source and tell me if there is something weird inbedded in it?
__________________
"You can't shoot a country until it becomes a democracy." - Willravel |
06-13-2006, 07:03 PM | #2 (permalink) |
Devils Cabana Boy
Location: Central Coast CA
|
There not both text, the first is html, the second one is just text in an htm file. view the source and you can see.
__________________
Donate Blood! "Love is not finding the perfect person, but learning to see an imperfect person perfectly." -Sam Keen |
06-14-2006, 05:04 AM | #5 (permalink) |
Who You Crappin?
Location: Everywhere and Nowhere
|
right, I did indeed edit the first one with Word because every time I tried cutting and pasting text from a notepad document into it in Frontpage, the formatting went away. Is there any way I can "un-bloat" the page?
__________________
"You can't shoot a country until it becomes a democracy." - Willravel |
06-14-2006, 05:29 PM | #6 (permalink) |
Tilted
|
Un-bloat and KEEP your MS Word formatting ? Not likely.
The reason MS Word makes it so huge is their heavy use of XML for all the pretty classes they can use for the formatting and such. problem is, Microsoft LOVES to specify the classes used, and every last bit of formatting on each and every individual cell, or if they are being conservative, each and every row. Most coders will simply apply a format to the entire table, and only change the one atribute they want for that row, such as a colour, rather than re-specifying the font, size, colour, spacing, etc for each and every cell. The good news: You can get the HTML version to have the exact same pretty formatting without all the Microsoft Bloat. The bad news: You cannot just cut and paste from the finished MS Word you have, into an HTML format. If you had made this project from the start in Dreamweaver, or Adobe GoLive, the formatting would all be there and it would work great, but now you have to pretty much re-invent the wheel to put all that formatting into the existing document. The good part is you have the raw table already that you can just dump into Dreamweaver or GoLive, and just do the formatting in one of those programs. Man I hate Microsoft, apparently their Word Webpages now include such things as the name of the person the computer was registered to (M***a K***g) and the fact that it was done on a Dell computer. Why the heck would anyone need to include that sort of info on a webpage ? Last edited by 3Z3VH; 06-14-2006 at 05:37 PM.. |
06-14-2006, 06:24 PM | #7 (permalink) |
Who You Crappin?
Location: Everywhere and Nowhere
|
right. I built the page using MS Frontpage, but for some reason, cutting and pasting formatted (ie spaced out) Courier New text from Notepad (the boxscores) into Frontpage kept losing the formatting and using MS Word was the only way I could get it to work. Had I only known...
__________________
"You can't shoot a country until it becomes a democracy." - Willravel |
06-15-2006, 04:10 AM | #8 (permalink) |
Delicious
|
Those non breaking spaces( :) are killer too. They are basically a 6 byte equivalent of a 1 byte space. I don't think you need them at all since you're using the Pre(pre-formatted text) tags. (I'd open it in notepad and edit->replace all of them with a space. Do it on a copy just in case it messes up the formatting too much to fix. It shouldn't though)
Dreamweaver has a clean up Word HTML command that will drastically reduce the size. I can't tell you by how much because opening the file in dreamweaver crashes my computer. I've seen it reduce the size by over half and with that much word formatting I bet it'll reduce it by at least that much. Also.. Anyone not using the exact same font, font size, screen resolution etc as you isn't going to see those tables aligned correctly. I don't know if they're aligned for anyone else but for me it's off. If you're wanting compatibility with other computers you might want to put those in real tables. It's not going to be a fast fix. Consider it a learning experience. I know I've spent plenty of time fixing my own HTML. (Try to overlook grammar, typos, and all other mis-information. It's VERY, VERY late.. or early..)
__________________
“It is better to be rich and healthy than poor and sick” - Dave Barry |
06-22-2006, 06:29 AM | #11 (permalink) |
Who You Crappin?
Location: Everywhere and Nowhere
|
I went back and rebuilt the page by basically copying all the "formatted" text into Notepad, then copying and pasting it into a new page.
The exact same content went from 1.27 GB to 330k! Thanks all!
__________________
"You can't shoot a country until it becomes a democracy." - Willravel |
Tags |
file, html, large, unusually |
|
|