RE: [squid-users] Webpage partially downloaded via squid

From: Gerard Saunders <gerard.saunders_at_deakin.edu.au>
Date: Wed, 28 Oct 2009 17:35:16 +1100

Hi Amos,

Thanks for getting back to me on this.

Interesting that you actually get the entire page in your testing, which is only possible if you initiate a refresh from the browser in our environment.

This is the bit that I don't understand. If you manually refresh the page when using squid it works, however the initial download fails. Do you have any idea why the second attempt, which also goes through the proxy server succeeds?

Here is some more detail on what is happening:

squid logs. Initial request. No refresh:

1256518175.833 2 128.184.148.154 TCP_IMS_HIT/304 334 GET http://www.movebank.org/community/info_icon.gif gerards NONE/- image/gif
1256518176.074 1046 128.184.148.154 TCP_MISS/200 19179 GET http://www.movebank.org/community gerards DIRECT/128.112.89.94 text/html
1256518176.594 519 128.184.148.154 TCP_MISS/404 638 GET http://www.movebank.org/bar_1_10.gif gerards DIRECT/128.112.89.94 text/html

squid log. Refresh from browser:

1256518384.321 1435 128.184.148.154 TCP_MISS/200 25382 GET http://www.movebank.org/community gerards DIRECT/128.112.89.94 text/html
1256518384.367 557 128.184.148.154 TCP_REFRESH_MISS/200 4042 GET http://www.movebank.org/community/IEFixes.css gerards DIRECT/128.112.89.94 text/css
1256518384.367 557 128.184.148.154 TCP_REFRESH_MISS/200 1830 GET http://www.movebank.org/community/plone_minwidth.js gerards DIRECT/128.112.89.94 application/x-javascript
1256518385.720 1911 128.184.148.154 TCP_REFRESH_MISS/200 67129 GET http://www.movebank.org/community/portal_javascripts/Plone%20Tableless/ploneScripts3251.js gerards DIRECT/128.112.89.94 application/x-javascript
1256518386.462 720 128.184.148.154 TCP_REFRESH_MISS/200 681 GET http://www.movebank.org/community/portal_css/Plone%20Tableless/ploneStyles1912.css gerards DIRECT/128.112.89.94 text/css
1256518387.491 1752 128.184.148.154 TCP_REFRESH_MISS/200 77641 GET http://www.movebank.org/community/portal_css/Plone%20Tableless/ploneStyles2899.css gerards DIRECT/128.112.89.94 text/css
1256518388.042 536 128.184.148.154 TCP_REFRESH_MISS/200 681 GET http://www.movebank.org/community/portal_css/Plone%20Tableless/ploneStyles2817.css gerards DIRECT/128.112.89.94 text/css
1256518388.583 538 128.184.148.154 TCP_REFRESH_MISS/200 676 GET http://www.movebank.org/community/portal_css/Plone%20Tableless/ploneStyles4785.css gerards DIRECT/128.112.89.94 text/css
1256518389.155 565 128.184.148.154 TCP_REFRESH_HIT/304 336 GET http://www.movebank.org/community/bar_1_10.gif gerards DIRECT/128.112.89.94 -
1256518389.163 575 128.184.148.154 TCP_REFRESH_HIT/304 336 GET http://www.movebank.org/community/newsitem_icon.gif gerards DIRECT/128.112.89.94 -
1256518389.165 577 128.184.148.154 TCP_REFRESH_HIT/304 336 GET http://www.movebank.org/community/info_icon.gif gerards DIRECT/128.112.89.94 -
1256518389.170 580 128.184.148.154 TCP_REFRESH_HIT/304 336 GET http://www.movebank.org/community/user.gif gerards DIRECT/128.112.89.94 -
1256518389.177 589 128.184.148.154 TCP_REFRESH_HIT/304 336 GET http://www.movebank.org/community/print_icon.gif gerards DIRECT/128.112.89.94 -
1256518389.180 585 128.184.148.154 TCP_REFRESH_HIT/304 336 GET http://www.movebank.org/community/linkTransparent.gif gerards DIRECT/128.112.89.94 -
1256518389.684 528 128.184.148.154 TCP_REFRESH_HIT/304 336 GET http://www.movebank.org/community/bullet.gif gerards DIRECT/128.112.89.94 -
1256518389.700 529 128.184.148.154 TCP_REFRESH_HIT/304 405 GET http://www.movebank.org/community/logo.gif gerards DIRECT/128.112.89.94 -
1256518389.720 541 128.184.148.154 TCP_REFRESH_HIT/304 336 GET http://www.movebank.org/community/input_background.gif gerards DIRECT/128.112

You can see from the above that the first request doesn't get any of the javascript/CSS. The refresh picks this up. This behaviour can be reproduced across all of our squid servers.

The fact that is works on your system, despite the garbage headers etc.. indicates to me that the problem *may* just be with our implementation of squid. Although I have had a pretty good look and there is nothing obvious that would result in the results we are seeing.

Cheers,
Gerard.

I can confirm that.

Duplicate Expires and Content-Language headers, the first ones filled
with complete garbage.

I do get the full page content though.

squidclient -h www.movebank.org -p 80 -H 'Host: www.movebank.org\n'
/community

HTTP/1.1 200 OK
Date: Tue, 27 Oct 2009 06:38:19 GMT
Server: Zope/(Zope 2.9.8-final, python 2.4.4, linux2) ZServer/1.1
Plone/2.5.4-2
Expires: Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT,
Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Ja
n 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000
00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT
, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01
Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000
00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00
GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat,
01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan
2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:0
0 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT,
Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan
2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000
00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT,
Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01
Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00
:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00
GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01
  Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000
00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00
GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat,
01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 20
00 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000
00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00
GMT, Sa
t, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan
2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:0
0:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT,
Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 J
an 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000
00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GMT, Sat, 01 Jan 2000 00:00:00 GM
T
Content-Language: en, en, en, en, en, en, en, en, en, en, en, en, en,
en, en, en, en, en, en, en, en, en, en, en, en, en, en, en, en, en, en, e
n, en, en, en, en, en, en, en, en, en, en, en, en, en, en, en, en, en,
en, en, en, en, en, en, en, en, en, en, en, en, en, en, en, en, en, en,
en, en
Content-Length: 24931
Expires: Sat, 01 Jan 2000 00:00:00 GMT
Content-Type: text/html;charset=utf-8
Content-Language: en
Connection: close

Amos

-- 
Please be using
   Current Stable Squid 2.7.STABLE7 or 3.0.STABLE19
   Current Beta Squid 3.1.0.14
Received on Wed Oct 28 2009 - 06:35:28 MDT

This archive was generated by hypermail 2.2.0 : Wed Oct 28 2009 - 12:00:03 MDT